This page links to a variety of text analysis tools and information on how they might be used in the classroom. Much of the content on this page is adapted from Dr. Chris Wells' (Macalester College) presentation at a workshop: Fostering the Digital Humanities at St. Thomas, June 2015.
(Image credit: Annotation Studio,
There are countless sources of full-text books; included here are but a handful. You might also search UST's extensive ebook holdings. Also, journal articles or dissertations might also be sources for text analysis.
JSTOR is an archiving service and search platform for thousands of academic journals, newspapers, and ebooks, to which the library subscribes. Constellate is a platform they have created to allow for text mining analysis of items in the JSTOR corpus. Full Constellate access is a subscription service, to which we do not subscribe, but they have made available a free tier of service that allows you to create and analyze datasets.
Hathi Trust is a not-for-profit collaborative of academic and research libraries preserving 17+ million digitized items. HathiTrust offers reading access to the fullest extent allowable by U.S. copyright law, computational access to the entire corpus for scholarly research, and other emerging services based on the combined collection. The University of St. Thomas is not a member of Hathi Trust, so we only have access to digitized items in the public domain, but this is still a huge collection.
The Research Center offers tools that allow researchers to query and interact with data from the HathiTrust Library. The Center is breaking new ground in the areas of text mining and non-consumptive research, allowing scholars to use HathiTrust content while preventing intellectual property misuse within the confines of current U.S. copyright law