Aviso: para depositar documentos, por favor, inicia sesión e identifícate con tu cuenta de correo institucional de la UCM con el botón MI CUENTA UCM. No emplees la opción AUTENTICACIÓN CON CONTRASEÑA
 

Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation

dc.contributor.authorHernández-Suárez, Aldo
dc.contributor.authorSánchez-Pérez, Gabriel
dc.contributor.authorToscano-Medina, Karina
dc.contributor.authorPérez-Meana, Hector
dc.contributor.authorPortillo-Portillo, Jose
dc.contributor.authorSánchez, Victor
dc.contributor.authorGarcía Villalba, Luis Javier
dc.date.accessioned2023-06-17T12:38:38Z
dc.date.available2023-06-17T12:38:38Z
dc.date.issued2019-04-11
dc.description.abstractIn recent years, Online Social Networks (OSNs) have received a great deal of attention for their potential use in the spatial and temporal modeling of events owing to the information that can be extracted from these platforms. Within this context, one of the most latent applications is the monitoring of natural disasters. Vital information posted by OSN users can contribute to relief efforts during and after a catastrophe. Although it is possible to retrieve data from OSNs using embedded geographic information provided by GPS systems, this feature is disabled by default in most cases. An alternative solution is to geoparse specific locations using language models based on Named Entity Recognition (NER) techniques. In this work, a sensor that uses Twitter is proposed to monitor natural disasters. The approach is intended to sense data by detecting toponyms (named places written within the text) in tweets with event-related information, e.g., a collapsed building on a specific avenue or the location at which a person was last seen. The proposed approach is carried out by transforming tokenized tweets into word embeddings: a rich linguistic and contextual vector representation of textual corpora. Pre-labeled word embeddings are employed to train a Recurrent Neural Network variant, known as a Bidirectional Long Short-Term Memory (biLSTM) network, that is capable of dealing with sequential data by analyzing information in both directions of a word (past and future entries). Moreover, a Conditional Random Field (CRF) output layer, which aims to maximize the transition from one NER tag to another, is used to increase the classification accuracy. The resulting labeled words are joined to coherently form a toponym, which is geocoded and scored by a Kernel Density Estimation function. At the end of the process, the scored data are presented graphically to depict areas in which the majority of tweets reporting topics related to a natural disaster are concentrated. A case study on Mexico’s 2017 Earthquake is presented, and the data extracted during and after the event are reported.
dc.description.departmentDepto. de Ingeniería de Software e Inteligencia Artificial (ISIA)
dc.description.facultyFac. de Informática
dc.description.refereedTRUE
dc.description.sponsorshipUnión Europea. Horizonte 2020
dc.description.sponsorshipNational Science and Technology Council of Mexico (CONACyT)
dc.description.statuspub
dc.eprint.idhttps://eprints.ucm.es/id/eprint/67660
dc.identifier.doi10.3390/s19071746
dc.identifier.issn1424-8220
dc.identifier.officialurlhttps://doi.org/10.3390/s19071746
dc.identifier.relatedurlhttps://www.mdpi.com/1424-8220/19/7/1746
dc.identifier.urihttps://hdl.handle.net/20.500.14352/12688
dc.issue.number7
dc.journal.titleSensors
dc.language.isoeng
dc.page.initial1746
dc.publisherMDPI
dc.relation.projectIDRAMSES (700326)
dc.rightsAtribución 3.0 España
dc.rights.accessRightsopen access
dc.rights.urihttps://creativecommons.org/licenses/by/3.0/es/
dc.subject.keywordtwitter
dc.subject.keyworddata mining
dc.subject.keywordword2vec
dc.subject.keywordCRF
dc.subject.keywordLSTM
dc.subject.keywordgeocoding
dc.subject.keywordgeoparsing
dc.subject.ucmInternet (Informática)
dc.subject.ucmRedes
dc.subject.unesco3325 Tecnología de las Telecomunicaciones
dc.titleUsing Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation
dc.typejournal article
dc.volume.number19
dspace.entity.typePublication
relation.isAuthorOfPublication0f67f6b3-4d2f-4545-90e1-95b8d9f3e1f0
relation.isAuthorOfPublication.latestForDiscovery0f67f6b3-4d2f-4545-90e1-95b8d9f3e1f0

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Using_Twitter_Data_to_Monitor_Natural_Disaster_Soc.pdf
Size:
2.6 MB
Format:
Adobe Portable Document Format

Collections