research-article

Identifying Sub-events and Summarizing Disaster-Related Information from Microblogs

Authors:
Koustav Rudra

Indian Institute of Technology Kharagpur, Kharagpur, India

Indian Institute of Technology Kharagpur, Kharagpur, India
View Profile

,
Pawan Goyal

Indian Institute of Technology Kharagpur, Kharagpur, India

Indian Institute of Technology Kharagpur, Kharagpur, India
View Profile

,
Niloy Ganguly

Indian Institute of Technology Kharagpur, Kharagpur, India

Indian Institute of Technology Kharagpur, Kharagpur, India
View Profile

,
Prasenjit Mitra

The Pennsylvania State University, University Park, PA, USA

The Pennsylvania State University, University Park, PA, USA
View Profile

,
Muhammad Imran

Qatar Computing Research Institute (HBKU), Doha, Qatar

Qatar Computing Research Institute (HBKU), Doha, Qatar
View Profile

SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information RetrievalJune 2018Pages 265–274https://doi.org/10.1145/3209978.3210030

Published:27 June 2018Publication History

SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval

Pages 265–274

ABSTRACT

In recent times, humanitarian organizations increasingly rely on social media to search for information useful for disaster response. These organizations have varying information needs ranging from general situational awareness (i.e., to understand a bigger picture) to focused information needs e.g., about infrastructure damage, urgent needs of affected people. This research proposes a novel approach to help crisis responders fulfill their information needs at different levels of granularities. Specifically, the proposed approach presents simple algorithms to identify sub-events and generate summaries of big volume of messages around those events using an Integer Linear Programming (ILP) technique. Extensive evaluation on a large set of real world Twitter dataset shows (a). our algorithm can identify important sub-events with high recall (b). the summarization scheme shows (6---30%) higher accuracy of our system compared to many other state-of-the-art techniques. The simplicity of the algorithms ensures that the entire task is done in real time which is needed for practical deployment of the system.

References

Dhekar Abhik and Durga Toshniwal. 2013. Sub-event detection during natural hazards using features of social media data. In Proceedings of the 22nd International Conference on World Wide Web. ACM, 783--788. Google ScholarDigital Library
Allison Badgett and Ruihong Huang. 2016. Extracting Subevents via an Effective Two-phase Approach.. In EMNLP. 906--911.Google Scholar
David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993--1022. Google ScholarDigital Library
Dongfeng Cai, Yonghua Hu, Xuelei Miao, and Yan Song. 2009. Dependency Grammar Based English Subject-Verb Agreement Evaluation.. In PACLIC. Citeseer, 63--71. {5} Mark A. Cameron, Robert Power, Bella Robinson, and Jie Yin. 2012. Emergency Situation Awareness from Twitter for Crisis Management. In Proc. WWW. ACM, 695--698. Google ScholarDigital Library
Carlos Castillo. 2016. Big Crisis Data: Social Media in Disasters and Time-Critical Situations (1st ed.). Cambridge University Press, New York, NY, USA. Google ScholarDigital Library
Gunes Erkan and Dragomir R. Radev. 2004. LexRank:Graph-based lexical centrality as salience in text summarization. Artificial Intelligence Research 22 (2004), 457--479. Google ScholarDigital Library
Huiji Gao, Geoffrey Barbier, and Rebecca Goolsby. 2011. Harnessing the Crowdsourcing Power of Social Media for Disaster Relief. Intelligent Systems, IEEE 26, 3 (2011), 10--14. Google ScholarDigital Library
Kevin Gimpel, Nathan Schneider, Brendan O'Connor, Dipanjan Das, Daniel Mills, Jacob Eisenstein, Michael Heilman, Dani Yogatama, Jeffrey Flanigan, and Noah Smith, A. 2011. Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments. In Proc. ACL. Google ScholarDigital Library
gurobi 2015. Gurobi -- The overall fastest and best supported solver available. http://www.gurobi.com/.Google Scholar
Muhammad Imran, Carlos Castillo, Fernando Diaz, and Sarah Vieweg. 2015. Processing social media messages in mass emergency: a survey. ACM Computing Surveys (CSUR) 47, 4 (2015), 67. Google ScholarDigital Library
Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier, and Sarah Vieweg. 2014. Aidr: Artificial intelligence for disaster response. In Proc. WWW companion. 159--162. Google ScholarDigital Library
Chris Kedzie, Kathleen McKeown, and Fernando Diaz. 2015. Predicting Salient Updates for Disaster Summarization. In Proc. ACL. Beijing, China, 1608--1617.Google ScholarCross Ref
Lingpeng Kong, Nathan Schneider, Swabha Swayamdipta, Archna Bhatia, Chris Dyer, and Noah A. Smith. 2014. A Dependency Parser for Tweets. In Proc. EMNLP.Google Scholar
Chin-Yew Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Proc. Workshop on Text Summarization Branches Out (with ACL).Google Scholar
Polykarpos Meladianos, Giannis Nikolentzos, François Rousseau, Yannis Stavrakas, and Michalis Vazirgiannis. 2015. Degeneracy-based real-time sub-event detection in twitter stream. In Proc. AAAI ICWSM. 248--257.Google Scholar
Minh-Tien Nguyen, Asanobu Kitamoto, and Tri-Thanh Nguyen. 2015. TSum4act: A Framework for Retrieving and Summarizing Actionable Tweets during a Disaster for Reaction. In Proc. PAKDD.Google ScholarCross Ref
Miles Osborne, Sean Moran, Richard McCreadie, Alexander Von Lunen, Martin Sykora, Elizabeth Cano, Neil Ireson, Craig Macdonald, Iadh Ounis, Yulan He, Tom Jackson, Fabio Ciravegna, and Ann OBrien. 2014. Real-Time Detection, Tracking, and Monitoring of Automatically Discovered Events in Social Media. In Proc. ACL.Google ScholarCross Ref
Patrick Pantel and Dekang Lin. 2002. Discovering word senses from text. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 613--619. Google ScholarDigital Library
Daniela Pohl, Abdelhamid Bouchachia, and Hermann Hellwagner. 2012. Automatic sub-event detection in emergency management using social media. In Proc. WWW. ACM, 683--686. Google ScholarDigital Library
Koustav Rudra, Siddhartha Banerjee, Niloy Ganguly, Pawan Goyal, Muhammad Imran, and Prasenjit Mitra. 2016. Summarizing Situational Tweets in Crisis Scenario. In Proceedings of the 27th ACM Conference on Hypertext and Social Media. ACM, 137--147. {22} Koustav Rudra, Subham Ghosh, Niloy Ganguly, Pawan Goyal, and Saptarshi Ghosh. 2015. Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach. In Proc. CIKM. Google ScholarDigital Library
Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2010. Earthquake shakes Twitter users: real-time event detection by social sensors. In Proc. WWW. 851--860. Google ScholarDigital Library
Lidan Shou, Zhenhua Wang, Ke Chen, and Gang Chen. 2013. Sumblr: Continuous Summarization of Evolving Tweet Streams. In Proc. ACM SIGIR. 533--542. Google ScholarDigital Library
summary-matrix 2017. Summary Matrix - Kemeny-Young method. https: //en.wikipedia.org/wiki/Kemeny-Young_method.Google Scholar
Lynda Tamine, Laure Soulier, Lamjed Ben Jabeur, Frederic Amblard, Chihab Hanachi, Gilles Hubert, and Camille Roth. 2016. Social Media-Based Collaborative Information Access: Analysis of Online Crisis-Related Twitter Conversations. In ACM 27th Conference on Hypertext & Social Media. Google ScholarDigital Library
Istvan Varga, Motoki Sano, Kentaro Torisawa, Chikara Hashimoto, Kiyonori Ohtake, Takao Kawai, Jong-Hoon Oh, and Stijn De Saeger. 2013. Aid is Out There: Looking for Help from Tweets during a Large Scale Disaster.. In Proc. ACL.Google Scholar
Sudha Verma, Sarah Vieweg, William J. Corvey, Leysia Palen, James H. Martin, Martha Palmer, Aaron Schram, and Kenneth M. Anderson. 2011. Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency. In Proc. AAAI ICWSM.Google Scholar
Sarah Vieweg, Carlos Castillo, and Muhammad Imran. 2014. Integrating social media communications into the rapid assessment of sudden onset disasters. In Social Informatics. Springer, 444--461.Google Scholar
Xiaohui Yan, Jiafeng Guo, Yanyan Lan, and Xueqi Cheng. 2013. A biterm topic model for short texts. In Proc. WWW. ACM, 1445--1456. Google ScholarDigital Library

Index Terms

Identifying Sub-events and Summarizing Disaster-Related Information from Microblogs
1. Information systems
  1. Information retrieval

Recommendations

Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach
CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management

Microblogging sites like Twitter have become important sources of real-time information during disaster events. A significant amount of valuable situational information is available in these sites; however, this information is immersed among hundreds of ...
Read More
Summarizing Situational Tweets in Crisis Scenario
HT '16: Proceedings of the 27th ACM Conference on Hypertext and Social Media

During mass convergence events such as natural disasters, microblogging platforms like Twitter are widely used by affected people to post situational awareness messages. These crisis-related messages disperse among multiple categories like ...
Read More
Automatic Identification of Crisis-Related Sub-events Using Clustering
ICMLA '12: Proceedings of the 2012 11th International Conference on Machine Learning and Applications - Volume 02

Social media are becoming an important instrument for supporting crisis management, due to their broad acceptance and the intensive usage of mobile devices for accessing them. Social platforms facilitate collaboration among the public during a crisis ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
June 2018
1509 pages
ISBN:9781450356572
DOI:10.1145/3209978
General Chairs:
Kevyn Collins-Thompson
University of Michigan, United States
,
Qiaozhu Mei
University of Michigan, United States
,
Program Chairs:
Brian Davison
Lehigh University, United States
,
Yiqun Liu
Tsinghua University, China
,
Emine Yilmaz
University College London, United Kingdom
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 June 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
class-based summarization
high-level summarization
humanitarian classes
situational information
sub-event detection
Qualifiers
- research-article
Conference

Acceptance Rates
SIGIR '18 Paper Acceptance Rate86of409submissions,21%Overall Acceptance Rate792of3,983submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 41
  Total Citations
  View Citations
- 934
  Total Downloads
- Downloads (Last 12 months)40
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Identifying Sub-events and Summarizing Disaster-Related Information from Microblogs

SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach

Summarizing Situational Tweets in Crisis Scenario

Automatic Identification of Crisis-Related Sub-events Using Clustering