I will be presenting on the HathiTrust Research Center and how to use its beta research portal.
The HathiTrust Research Center (HTRC) is dedicated to providing computational access to published works in the public domain and, in the future, on limited terms to works in-copyright in the HathiTrust Digital Library (hathitrust.org). The HTRC is a collaborative research center launched jointly by Indiana University and the University of Illinois, along with the HathiTrust Digital Library, to help meet the technical challenges of dealing with massive amounts of digital text that researchers face by developing cutting-edge software tools and cyberinfrastructure to enable advanced computational access to the growing digital record of human knowledge.
This session will provide an introduction to using the HTRC portal for basic text mining investigations (htrc2.pti.indiana.edu/HTRC-UI-Portal2/). Attendees will learn how to build a workset from the HTRC corpus, apply the textual analysis tools provided in the HTRC portal, and generate visualizations such as word clouds and statistical frequencies.