Language & Literature
Spring 2009 - Present
The study of intertextuality, the shaping of a text’s meaning by other texts, remains a labor-intensive process for the literary critic. Julia Kristeva, who coined the term intertext, suggested, "Any text is constructed as a mosaic of quotations; any text is the absorption and transformation of another". Such transformations range from direct quotations, representing a simple and overt intertextuality, to more complex references that are intentionally or subconsciously absorbed into a text. In the years since Kristeva first drew attention to the phenomenon, the field of its study has become increasingly - in some cases debilitatingly - complex. As this theoretical complexity grows, so does the burden upon the practicing literary critic to verify suspected instances of intertextuality. The critic must command a large corpus of possible contributing works; meanwhile, objective criteria by which intertext may be measured are lacking. Since, in many cases, the problem is one of pattern recognition, the task of locating new relationships between texts and validating suspected ones is a good candidate for automated assistance by computers.
In this work, we propose the use of machine learning and related statistical methods to improve the process by which intertextuality is studied. Specifically, we bring to bear computational techniques from the field of stylistics in order to examine instances where an author who is familiar with a particular corpus deliberately or subconsciously reflects this in discrete passages within his own work. In the feature space, we are particularly interested in the repetitive stylistic nature of sound oriented texts. Through our analysis, we have established that authors make extensive use of repetitive sound to emphasize ideas or phrases, or to construct poetic forms.
This work is supported by NEH Digital Humanities Start-Up Grant Award No. HD-51570-12 and NEH Digital Humanities Advancement Grant No. HAA-258767-18
- "Practical Text Phylogeny for Real-World Settings,", , ,IEEE Access,December 2018.
- "Coupling Story to Visualization: Using Textual Analysis as a Bridge Between Data, , , ,
and Interpretation,"Proceedings of the ACM International Conference on Intelligent User Interfaces (IUI),March 2018.
- "Authorship Attribution for Social Media Forensics,", , , , , , , ,IEEE Transactions on Information Forensics and Security (T-IFS),January 2017.
- "The Sense of a Connection: Automatic Tracing of Intertextuality by Meaning,", , ,Digital Scholarship in the Humanities (DSH),April 2016.
- "Evidence of Intertextuality: Investigating Paul the Deacon's Angustae Vitae,", , ,Literary & Linguistic Computing (LLC),September 2011.
- "Features from Frequency: Authorship and Stylistic Analysis Using Repetitive Sound,", ,Proceedings of the 4th Annual Chicago Colloquium on Digital Humanities and Computer Science (DHCS),November 2009.
- "Euterpe's Hidden Song: Patterns in Elegy,", ,Digital Humanities 2014 (DH),July 2014.
- "Modelling the Interpretation of Literary Allusion with Machine Learning Techniques,", , , , , , ,Digital Humanities 2013 (DH),July 2013.
- "Revealing Hidden Patterns in the Meter of Homer's Iliad,", ,The 7th Annual Chicago Colloquium on Digital Humanities and Computer Science (DHCS),November 2012.
- "Visualizing Sound as Functional n-grams in Homeric Greek Poetry,", ,Digital Humanities 2011 (DH),June 2011.
- "A Statistical Study of Latin Elegiac Couplets,", ,The 5th Annual Chicago Colloquium on Digital Humanities and Computer Science (DHCS),November 2010.
- The Tesserae code is available on GitHub