About


Textual Geographies uses named entity recognition and geolocation to extract place names from multilingual (English, German, Spanish, and Chinese) printed volumes held by the HathiTrust digital library and to associate those names with detailed geographic information. The project corpus currently includes more than 5 million volumes in the public domain, but will soon expand to cover an equal number of non-public domain texts.

The pilot phase of the project has enjoyed generous funding from the American Council of Learned Societies and the University of Notre Dame.

For more information concerning the corpus and research methods, please contact the project director, Matthew Wilkens.

People


Matthew Wilkens, University of Notre Dame (project director)
Cameron Blevins, Northeastern University
David Chiang, University of Notre Dame
Elizabeth Evans, University of Notre Dame
Marissa Gemma, Max Planck Institute for Empirical Aesthetics
Ryan Heuser, Stanford University
Matthew Sisk, University of Notre Dame
Mads Rosendahl Thomsen, Aarhus University

Advisory Board


David Bamman, University of California Berkeley
J. Stephen Downie, University of Illinois Urbana-Champaign
Ian Gregory, Lancaster University
Beth Plale, Indiana University Bloomington

Return to Home Page