An automated approach for geocoding tabular itineraries

Santos, Rui and Murrieta-Flores, Patricia and Martins, Bruno (2017) An automated approach for geocoding tabular itineraries. In: GIR'17 Proceedings of the 11th Workshop on Geographic Information Retrieval :. Association for Computing Machinery, Inc, DEU. ISBN 9781450353380

[thumbnail of An_Automated_Approach_for_Geocoding_Tabular_Itineraries_PMF]
Preview
PDF (An_Automated_Approach_for_Geocoding_Tabular_Itineraries_PMF)
ACM_GIR_An_Automated_Approach_for_Geocoding_Tabular_Itineraries_PMF.pdf - Accepted Version
Available under License Creative Commons Attribution.

Download (1MB)

Abstract

Historical itineraries, often accessible as lists or tables describing places visited in sequence, are abundant resources and also important objects of study for humanities scholars. This article advances a novel method for automatically geocoding tabular itineraries, combining approximate string matching with a cost optimization algorithm based on dynamic programming. Experiments with a dataset of historical itineraries, with ground-truth geocoding annotations provided by domain experts and leveraging also the GeoNames gazetteer, attest to the effectiveness of the proposed method. The obtained results show that while approximate string matching can already achieve very low median errors, with many toponyms matching exactly against GeoNames entries, the combination with cost optimization can significantly improve results in terms of the average distance towards the correct disambiguations.

Item Type:
Contribution in Book/Report/Proceedings
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/1700/1709
Subjects:
?? automated geocodingdigital humanitiesdynamic programminggeographic information retrievaltoponym matchinghuman-computer interactioncomputer networks and communicationscomputer vision and pattern recognitionsoftware ??
ID Code:
124083
Deposited By:
Deposited On:
23 May 2018 13:26
Refereed?:
Yes
Published?:
Published
Last Modified:
20 Aug 2024 23:14