Extraction of spatio-temporal data about historical events from text documents
Research output: Contribution to journal › Research article › Contributed › peer-review
Contributors
Abstract
Often, we are faced with questions regarding past events and the answers are hidden in the historical text archives. The growing developments in geographic information retrieval and temporal information retrieval techniques have given new ways to explore digital text archives for spatio-temporal data. The question is how to retrieve the answers from the text documents. This work contributes to a better understanding of spatio-temporal information extraction from text documents. Natural language processing techniques were used to develop an information extraction approach using the GATE language processing software. The developed framework uses gazetteer matching, spatio-temporal relationship extraction and pattern-based rules to recognize and annotate elements in historical text documents. The extracted spatio-temporal data is used as input for GIS studies on the time–geography context of the German–Herero resistance war of 1904 in Namibia. Related issues when analyzing the historical data in current GIS are discussed. Particularly problematic are movement data in small scale with poor temporal density and trajectories that are short or connect very distant locations.
Details
Original language | English |
---|---|
Pages (from-to) | 677-696 |
Journal | Transactions in GIS |
Volume | 22 |
Issue number | 3 |
Publication status | Published - Jun 2018 |
Peer-reviewed | Yes |
External IDs
Scopus | 85051815414 |
---|---|
ORCID | /0000-0002-9016-1996/work/144671012 |