Journal of the Japanese Association for Digital Humanities
Online ISSN : 2188-7276
The Long Arc of History: Neural Network Approaches to Diachronic Linguistic Change
Eun Seo JoMark Algee-Hewitt
Author information

2018 Volume 3 Issue 1 Pages 1-32


Historians have traditionally relied on close readings of select primary sources to evaluate linguistic and discursive changes over time, but this approach can be limiting in its scope. Numeric representations of language allow us to statistically quantify and compare the significance of discursive changes and capture linguistic relationships over time. Here, we compare two deep learning methods of quantitatively identifying the chronology of linguistic shifts: RNN classification and RNN language modeling. In particular, we examine deep learning methods of isolating stylistic from topical changes, generating “decade embeddings,” and charting the changing average perplexity in a language model trained on chronologically sorted data. We apply these models to a historical diplomatic corpus, finding that the two world wars proved to be notable moments of linguistic change in American foreign relations. With this example we show applications of text-based deep learning methods for digital humanities usages.

Information related to the author
© Eun Seo Jo and Mark Algee-Hewitt
Next article