Data Mining – THATCamp Gainesville 2014 http://gainesville2014.thatcamp.org April 24-25, 2014, at the University of Florida Fri, 10 Apr 2015 20:32:58 +0000 en-US hourly 1 https://wordpress.org/?v=4.9.12 Humanities Software Development: Data Mining and Writing Studies http://gainesville2014.thatcamp.org/2014/04/02/humanities-software-development-data-mining-and-writing-studies/ http://gainesville2014.thatcamp.org/2014/04/02/humanities-software-development-data-mining-and-writing-studies/#comments Wed, 02 Apr 2014 17:51:14 +0000 http://gainesville2014.thatcamp.org/?p=258 Continue reading ]]>

massmine-in-emacs

We will provide a short introduction to the software project called MassMine–an open source software, developed by academic/humanities researchers, for use within the academy. The software has been used to data mine Twitter and this data is being analyzed as the basis for a publication about trends, media ecology, and the concept of cybernetic “attention.” Our short presentation will explain how the software project resulted from limitations in currently available tools for conducting academic research on social media. The goal is for introduction to lead to engaging and innovative dialogue about the prospects for humanities software development, the ongoing task of understanding how/why data science/mining may present useful methods for research in the humanities, and/or how software development and data science may be integral to the research of “writing” (any form of inscription or multi-modal composition) as it occurs within an ever-changing and restructuring media ecology.

–Nicholas M. Van Horn will be co-presenting/collaborating remotely for this session

www.massmine.com

 

]]>
http://gainesville2014.thatcamp.org/2014/04/02/humanities-software-development-data-mining-and-writing-studies/feed/ 4
HathiTrust Research Center: a Tutorial http://gainesville2014.thatcamp.org/2014/03/23/hathitrust-research-center-a-tutorial/ http://gainesville2014.thatcamp.org/2014/03/23/hathitrust-research-center-a-tutorial/#comments Sun, 23 Mar 2014 01:07:14 +0000 http://gainesville2014.thatcamp.org/?p=231 Continue reading ]]>

I will be presenting on the HathiTrust Research Center and how to use its beta research portal.

The HathiTrust Research Center (HTRC) is dedicated to providing computational access to published works in the public domain and, in the future, on limited terms to works in-copyright in the HathiTrust Digital Library (hathitrust.org). The HTRC is a collaborative research center launched jointly by Indiana University and the University of Illinois, along with the HathiTrust Digital Library, to help meet the technical challenges of dealing with massive amounts of digital text that researchers face by developing cutting-edge software tools and cyberinfrastructure to enable advanced computational access to the growing digital record of human knowledge.

This session will provide an introduction to using the HTRC portal for basic text mining investigations (htrc2.pti.indiana.edu/HTRC-UI-Portal2/). Attendees will learn how to build a workset from the HTRC corpus, apply the textual analysis tools provided in the HTRC portal, and generate visualizations such as word clouds and statistical frequencies.

 

]]>
http://gainesville2014.thatcamp.org/2014/03/23/hathitrust-research-center-a-tutorial/feed/ 1