HathiTrust Research Center: a Tutorial

I will be presenting on the HathiTrust Research Center and how to use its beta research portal.

The HathiTrust Research Center (HTRC) is dedicated to providing computational access to published works in the public domain and, in the future, on limited terms to works in-copyright in the HathiTrust Digital Library (hathitrust.org). The HTRC is a collaborative research center launched jointly by Indiana University and the University of Illinois, along with the HathiTrust Digital Library, to help meet the technical challenges of dealing with massive amounts of digital text that researchers face by developing cutting-edge software tools and cyberinfrastructure to enable advanced computational access to the growing digital record of human knowledge.

This session will provide an introduction to using the HTRC portal for basic text mining investigations (htrc2.pti.indiana.edu/HTRC-UI-Portal2/). Attendees will learn how to build a workset from the HTRC corpus, apply the textual analysis tools provided in the HTRC portal, and generate visualizations such as word clouds and statistical frequencies.


Categories: Data Mining, Text Mining |

About Harriett Green

I'm the English and Digital Humanities Librarian at the University of Illinois at Urbana-Champaign. My research interests include use and users of digital humanities tools, and how humanities scholars integrate digital tools and materials in their research workflows, and humanities data curation.

1 Response to HathiTrust Research Center: a Tutorial

  1. Thanks, Harriet, for walking us through this great project! Does anyone have any specific questions about the HathiTrust that we should address?

Comments are closed.