Automatic identification of locative expressions from social media text: A comparative analysis

Published:

Fei Liu, Maria Vasardani and Timothy Baldwin (2014) Automatic Identification of Locative Expressions from Social Media Text: A Comparative Analysis, In Proceedings of The 4th International Workshop on Location and the Web (LocWeb 2014), Shanghai, China, pp. 9–16.

@inproceedings{Liu+:2014,
  author    = {Liu, Fei and Vasardani, Maria and Baldwin, Timothy},
  title     = {Automatic Identification of Locative Expressions from Social Media Text: A Comparative Analysis},
  booktitle = {Proceedings of the 4th International Workshop on Location and the Web (LocWeb 2014)},
  year      = {2014},
  address   = {Shanghai, China},
  pages     = {9--16}
} 

Abstract

With the proliferation of smartphones and the increasing popularity of social media, people have developed habits of posting not only their thoughts and opinions, but also con- tent concerning their whereabouts. On such highly-interactive yet informal social media platforms, people make heavy use of informal language, including when it comes to locative expressions. Such usage inhibits the ability of traditional Natural Language Processing approaches to retrieve geospa- tial information from social media text. In this research, we: (1) develop a medium-scale corpus of “locative expres- sions” derived from a variety of social media sources; (2) benchmark the performance of a range of geoparsers over the corpus, with the finding that even the best-performing systems are substantially lacking; and (3) carry out exten- sive error analysis to suggest ways of improving the accuracy and robustness of geoparsers.