Research and Course Guides: Digital Humanities Resources: Text Analysis

Text Analysis Tools

Voyant
A reading and analysis environment for digital texts
AntConc
A freeware corpus analysis toolkit for concordancing and text analysis
Google Ngram Viewer
A tool for viewing word/phrase frequency over time of the works in the Google Books corpus.
MALLET: MAchine Learning for LanguagE Toolkit
Open source software for topic modeling/ text analysis; requires command-line computing. University of Massachusetts-Amherst.
TAPoR Portal
Discover tools for textual study. University of Alberta.
Annotation Studio
A suite of tools to support individuals and classes to engage more deeply with texts by reading, annotating, discussing, sharing, and composing. Good user manual, tutorials, and case studies.
TV Corpus
Contains 325 million words of data in 75,000 TV episodes from the 1950s to the current time. All of the 75,000 episodes are tied to their IMDB entry, which means that you can create Virtual Corpora using extensive metadata -- year, country, series, rating, genre, plot summary, etc.
OpenRefine
A free, open-source tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
Tableau Public
A free platform to explore, create, and share data visualizations.
Gephi
Allows users to make colorful graphs and networks from textual data by revealing links between textual objects.
Google Trends
Search for names, topics, and phrases to see how popular they are in Google searches over a specified period of time.

Locating Source Texts

There are many sources of full-text books; below are some of the larger and more commonly-used ones. You might also search UST's extensive ebook holdings.

Hathi Trust
a partnership of academic & research institutions, offering millions of titles digitized from libraries around the world.
Project Gutenberg
Free electronic texts
Internet Archive: Ebooks & Texts
Free downloadable texts in multiple formats
Online Books Page
From the University of Pennsylvania Libraries.
University of Minnesota Text Mining guide
A guide to additional sources of text to use for digital humanities projects maintained by the U of MN.