Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Dynamic Spatio-Temporal Tweet Mining for Event Detection : A Case Study of Hurricane Florence

Farnaghi, Mahdi LU ; Ghaemi, Zeinab and Mansourian, Ali LU (2020) In International Journal of Disaster Risk Science 11(3). p.378-393
Abstract

Extracting information about emerging events in large study areas through spatiotemporal and textual analysis of geotagged tweets provides the possibility of monitoring the current state of a disaster. This study proposes dynamic spatio-temporal tweet mining as a method for dynamic event extraction from geotagged tweets in large study areas. It introduces the use of a modified version of ordering points to identify the clustering structure to address the intrinsic heterogeneity of Twitter data. To precisely calculate the textual similarity, three state-of-the-art text embedding methods of Word2vec, GloVe, and FastText were used to capture both syntactic and semantic similarities. The impact of selected embedding algorithms on the... (More)

Extracting information about emerging events in large study areas through spatiotemporal and textual analysis of geotagged tweets provides the possibility of monitoring the current state of a disaster. This study proposes dynamic spatio-temporal tweet mining as a method for dynamic event extraction from geotagged tweets in large study areas. It introduces the use of a modified version of ordering points to identify the clustering structure to address the intrinsic heterogeneity of Twitter data. To precisely calculate the textual similarity, three state-of-the-art text embedding methods of Word2vec, GloVe, and FastText were used to capture both syntactic and semantic similarities. The impact of selected embedding algorithms on the quality of the outputs was studied. Different combinations of spatial and temporal distances with the textual similarity measure were investigated to improve the event detection outcomes. The proposed method was applied to a case study related to 2018 Hurricane Florence. The method was able to precisely identify events of varied sizes and densities before, during, and after the hurricane. The feasibility of the proposed method was qualitatively evaluated using the Silhouette coefficient and qualitatively discussed. The proposed method was also compared to an implementation based on the standard density-based spatial clustering of applications with noise algorithm, where it showed more promising results.

(Less)
Please use this url to cite or link to this publication:
author
; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Disaster management, Hurricane Florence, Natural language processing, Spatio-temporal tweet analysis, Tweet clustering, Twitter, Machine Learning (ML)
in
International Journal of Disaster Risk Science
volume
11
issue
3
pages
16 pages
publisher
Springer
external identifiers
  • scopus:85085495747
ISSN
2095-0055
DOI
10.1007/s13753-020-00280-z
language
English
LU publication?
yes
id
22f834cd-4579-4e97-a7d8-9491eadc3e75
date added to LUP
2020-06-08 11:34:16
date last changed
2023-10-08 05:37:27
@article{22f834cd-4579-4e97-a7d8-9491eadc3e75,
  abstract     = {{<p>Extracting information about emerging events in large study areas through spatiotemporal and textual analysis of geotagged tweets provides the possibility of monitoring the current state of a disaster. This study proposes dynamic spatio-temporal tweet mining as a method for dynamic event extraction from geotagged tweets in large study areas. It introduces the use of a modified version of ordering points to identify the clustering structure to address the intrinsic heterogeneity of Twitter data. To precisely calculate the textual similarity, three state-of-the-art text embedding methods of Word2vec, GloVe, and FastText were used to capture both syntactic and semantic similarities. The impact of selected embedding algorithms on the quality of the outputs was studied. Different combinations of spatial and temporal distances with the textual similarity measure were investigated to improve the event detection outcomes. The proposed method was applied to a case study related to 2018 Hurricane Florence. The method was able to precisely identify events of varied sizes and densities before, during, and after the hurricane. The feasibility of the proposed method was qualitatively evaluated using the Silhouette coefficient and qualitatively discussed. The proposed method was also compared to an implementation based on the standard density-based spatial clustering of applications with noise algorithm, where it showed more promising results.</p>}},
  author       = {{Farnaghi, Mahdi and Ghaemi, Zeinab and Mansourian, Ali}},
  issn         = {{2095-0055}},
  keywords     = {{Disaster management; Hurricane Florence; Natural language processing; Spatio-temporal tweet analysis; Tweet clustering; Twitter; Machine Learning (ML)}},
  language     = {{eng}},
  number       = {{3}},
  pages        = {{378--393}},
  publisher    = {{Springer}},
  series       = {{International Journal of Disaster Risk Science}},
  title        = {{Dynamic Spatio-Temporal Tweet Mining for Event Detection : A Case Study of Hurricane Florence}},
  url          = {{http://dx.doi.org/10.1007/s13753-020-00280-z}},
  doi          = {{10.1007/s13753-020-00280-z}},
  volume       = {{11}},
  year         = {{2020}},
}