Textual Geographies uses named entity recognition and geolocation to extract place names from multilingual
(English, German, Spanish, and Chinese) printed volumes held by the HathiTrust digital library and to
associate those names with detailed geographic information. The project corpus currently includes more
than 5 million volumes in the public domain, but will soon expand to cover an equal number of non-public
The pilot phase of the project has enjoyed generous funding from the American Council of Learned Societies and the University of Notre Dame.
For more information concerning the corpus and research methods, please contact the project director, Matthew Wilkens.
Matthew Wilkens, University of Notre Dame (project director)
Cameron Blevins, Northeastern University
David Chiang, University of Notre Dame
Elizabeth Evans, University of Notre Dame
Marissa Gemma, Max Planck Institute for Empirical Aesthetics
Ryan Heuser, Stanford University
Matthew Sisk, University of Notre Dame
Mads Rosendahl Thomsen, Aarhus University