More and more source text in the humanities gets digitized every day, making it accessible to large scale computational analysis. Nevertheless, traditional methods of humanistic analysis are based on detailed arguments built upon on close readings of individual texts. How…
Tagged with: digital humanities
, english literature
, Natural language processing
, search interface
, text mining
, user interfaces
Posted in Digital Collections
, Digital Humanities
, Information Seeking
, Natural Language Processing
This year’s conference of the Association for Computational Linguistics, the most prestigious event in computational linguistics, had a paper that got me very excited. It’s called Extracting Social Networks from Literary Fiction [pdf], and here’s the abstract (emphasis added): We…
Take an example question that a literary scholar might have,
“How is the character Mary talked about in this text from by author X”?
It’s fairly open ended – what does “talked about” mean? How do we translate this into computational terms? In this post, I’ll describe some tools that natural language processing (NLP) has to offer, and show how each can be used to tackle this question along with pointers to sofware and tutorials.
Joseph Turian & co. at MetaOptimize have started a Q+A forum for “data geeks” – people in machine learning or data mining who deal with questions about visualizing, processing, or otherwise making sense of big data sets
Yesterday, I attended a group meeting with the Literature Lab at Stanford University’s English Department, where they presented some very cool new results on mining 19th Century British and American novels. The lab, fresh on its feet, is headed by…
Automated tools that pick up on so-called obvious information are easy to dismiss. We say – oh, it can’t tell us anything more than we already know. What we forget is that sometimes we’re looking for exactly this sort of information: stuff that is obvious… but that we don’t know yet.
Historians seem to use text in two ways, the first is to get an idea of what’s out there, the same way all researchers use the literature in their fields. The second is as evidence – what traces might have an event, personal characteristic, impression, or anything else, have left in textual records from around that time?