You are here

DHAsia Hands-On Clinic | Large-Scale Text Analysis of Japanese and Chinese Literature: An Introduction to Text Mining for Humanists, with Richard Jean So and Hoyt Long

In this hands-on workshop, we introduce colleagues in the humanities with little or no experience in computer science or programming to the rudiments of automated text analysis (or colloquially, “text mining”) for literary studies. 
This workshop will teach colleagues how to pursue this work from the very beginning steps: how to identify or build a corpora of texts; how to transform these texts into a format that a computer can interpret; how to input these texts into one’s computer and prepare them for computational and statistical analysis. 
After we teach these basic yet fundamental tasks, we will offer some lessons in introductory-level automated text analysis methods, such as document comparison and clustering analysis.  Throughout, we will provide easy-to-use computer code, so that any previous experience in programming is not necessary. 
Moreover, our workshop will address the particularities of dealing with Japanese and Chinese texts within text mining, and the code we provide works specifically for this type of material.  In sum, we expect participants who have completed this workshop to leave with enough practical skills to immediately begin their own text mining projects.
IMPORTANT NOTE: Although focused on Chinese and Japanese cases, the analytical approaches examined here are valuable for scholars working across Asia, on all time periods.

Details

When:

Thursday, March 3, 2016. 01:30 PM

Where:

CESTA, Wallenberg Hall, 4th Floor

Sponsor:

Wallenberg Hall, Center for Spatial and Textual Analysis (CESTA), History Department, Center for East Asian Studies, Department of English, Department of East Asian Languages and Cultures

Contact:

650-721-1385
tsmullaney@stanford.edu

Admission:

REGISTRATION REQUIRED | SPACE LIMITED | LIMITED TO STANFORD STUDENTS, FACULTY, AND STAFF | Please Contact Tom Mullaney (tsmullaney@stanford.edu)

Audience: