skip to main content
10.1145/3209978.3210030acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Identifying Sub-events and Summarizing Disaster-Related Information from Microblogs

Authors Info & Claims
Published:27 June 2018Publication History

ABSTRACT

In recent times, humanitarian organizations increasingly rely on social media to search for information useful for disaster response. These organizations have varying information needs ranging from general situational awareness (i.e., to understand a bigger picture) to focused information needs e.g., about infrastructure damage, urgent needs of affected people. This research proposes a novel approach to help crisis responders fulfill their information needs at different levels of granularities. Specifically, the proposed approach presents simple algorithms to identify sub-events and generate summaries of big volume of messages around those events using an Integer Linear Programming (ILP) technique. Extensive evaluation on a large set of real world Twitter dataset shows (a). our algorithm can identify important sub-events with high recall (b). the summarization scheme shows (6---30%) higher accuracy of our system compared to many other state-of-the-art techniques. The simplicity of the algorithms ensures that the entire task is done in real time which is needed for practical deployment of the system.

References

  1. Dhekar Abhik and Durga Toshniwal. 2013. Sub-event detection during natural hazards using features of social media data. In Proceedings of the 22nd International Conference on World Wide Web. ACM, 783--788. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Allison Badgett and Ruihong Huang. 2016. Extracting Subevents via an Effective Two-phase Approach.. In EMNLP. 906--911.Google ScholarGoogle Scholar
  3. David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993--1022. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Dongfeng Cai, Yonghua Hu, Xuelei Miao, and Yan Song. 2009. Dependency Grammar Based English Subject-Verb Agreement Evaluation.. In PACLIC. Citeseer, 63--71. {5} Mark A. Cameron, Robert Power, Bella Robinson, and Jie Yin. 2012. Emergency Situation Awareness from Twitter for Crisis Management. In Proc. WWW. ACM, 695--698. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Carlos Castillo. 2016. Big Crisis Data: Social Media in Disasters and Time-Critical Situations (1st ed.). Cambridge University Press, New York, NY, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Gunes Erkan and Dragomir R. Radev. 2004. LexRank:Graph-based lexical centrality as salience in text summarization. Artificial Intelligence Research 22 (2004), 457--479. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Huiji Gao, Geoffrey Barbier, and Rebecca Goolsby. 2011. Harnessing the Crowdsourcing Power of Social Media for Disaster Relief. Intelligent Systems, IEEE 26, 3 (2011), 10--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Kevin Gimpel, Nathan Schneider, Brendan O'Connor, Dipanjan Das, Daniel Mills, Jacob Eisenstein, Michael Heilman, Dani Yogatama, Jeffrey Flanigan, and Noah Smith, A. 2011. Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments. In Proc. ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. gurobi 2015. Gurobi -- The overall fastest and best supported solver available. http://www.gurobi.com/.Google ScholarGoogle Scholar
  10. Muhammad Imran, Carlos Castillo, Fernando Diaz, and Sarah Vieweg. 2015. Processing social media messages in mass emergency: a survey. ACM Computing Surveys (CSUR) 47, 4 (2015), 67. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier, and Sarah Vieweg. 2014. Aidr: Artificial intelligence for disaster response. In Proc. WWW companion. 159--162. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Chris Kedzie, Kathleen McKeown, and Fernando Diaz. 2015. Predicting Salient Updates for Disaster Summarization. In Proc. ACL. Beijing, China, 1608--1617.Google ScholarGoogle ScholarCross RefCross Ref
  13. Lingpeng Kong, Nathan Schneider, Swabha Swayamdipta, Archna Bhatia, Chris Dyer, and Noah A. Smith. 2014. A Dependency Parser for Tweets. In Proc. EMNLP.Google ScholarGoogle Scholar
  14. Chin-Yew Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Proc. Workshop on Text Summarization Branches Out (with ACL).Google ScholarGoogle Scholar
  15. Polykarpos Meladianos, Giannis Nikolentzos, François Rousseau, Yannis Stavrakas, and Michalis Vazirgiannis. 2015. Degeneracy-based real-time sub-event detection in twitter stream. In Proc. AAAI ICWSM. 248--257.Google ScholarGoogle Scholar
  16. Minh-Tien Nguyen, Asanobu Kitamoto, and Tri-Thanh Nguyen. 2015. TSum4act: A Framework for Retrieving and Summarizing Actionable Tweets during a Disaster for Reaction. In Proc. PAKDD.Google ScholarGoogle ScholarCross RefCross Ref
  17. Miles Osborne, Sean Moran, Richard McCreadie, Alexander Von Lunen, Martin Sykora, Elizabeth Cano, Neil Ireson, Craig Macdonald, Iadh Ounis, Yulan He, Tom Jackson, Fabio Ciravegna, and Ann OBrien. 2014. Real-Time Detection, Tracking, and Monitoring of Automatically Discovered Events in Social Media. In Proc. ACL.Google ScholarGoogle ScholarCross RefCross Ref
  18. Patrick Pantel and Dekang Lin. 2002. Discovering word senses from text. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 613--619. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Daniela Pohl, Abdelhamid Bouchachia, and Hermann Hellwagner. 2012. Automatic sub-event detection in emergency management using social media. In Proc. WWW. ACM, 683--686. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Koustav Rudra, Siddhartha Banerjee, Niloy Ganguly, Pawan Goyal, Muhammad Imran, and Prasenjit Mitra. 2016. Summarizing Situational Tweets in Crisis Scenario. In Proceedings of the 27th ACM Conference on Hypertext and Social Media. ACM, 137--147. {22} Koustav Rudra, Subham Ghosh, Niloy Ganguly, Pawan Goyal, and Saptarshi Ghosh. 2015. Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach. In Proc. CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2010. Earthquake shakes Twitter users: real-time event detection by social sensors. In Proc. WWW. 851--860. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Lidan Shou, Zhenhua Wang, Ke Chen, and Gang Chen. 2013. Sumblr: Continuous Summarization of Evolving Tweet Streams. In Proc. ACM SIGIR. 533--542. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. summary-matrix 2017. Summary Matrix - Kemeny-Young method. https: //en.wikipedia.org/wiki/Kemeny-Young_method.Google ScholarGoogle Scholar
  24. Lynda Tamine, Laure Soulier, Lamjed Ben Jabeur, Frederic Amblard, Chihab Hanachi, Gilles Hubert, and Camille Roth. 2016. Social Media-Based Collaborative Information Access: Analysis of Online Crisis-Related Twitter Conversations. In ACM 27th Conference on Hypertext & Social Media. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Istvan Varga, Motoki Sano, Kentaro Torisawa, Chikara Hashimoto, Kiyonori Ohtake, Takao Kawai, Jong-Hoon Oh, and Stijn De Saeger. 2013. Aid is Out There: Looking for Help from Tweets during a Large Scale Disaster.. In Proc. ACL.Google ScholarGoogle Scholar
  26. Sudha Verma, Sarah Vieweg, William J. Corvey, Leysia Palen, James H. Martin, Martha Palmer, Aaron Schram, and Kenneth M. Anderson. 2011. Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency. In Proc. AAAI ICWSM.Google ScholarGoogle Scholar
  27. Sarah Vieweg, Carlos Castillo, and Muhammad Imran. 2014. Integrating social media communications into the rapid assessment of sudden onset disasters. In Social Informatics. Springer, 444--461.Google ScholarGoogle Scholar
  28. Xiaohui Yan, Jiafeng Guo, Yanyan Lan, and Xueqi Cheng. 2013. A biterm topic model for short texts. In Proc. WWW. ACM, 1445--1456. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Identifying Sub-events and Summarizing Disaster-Related Information from Microblogs

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
              June 2018
              1509 pages
              ISBN:9781450356572
              DOI:10.1145/3209978

              Copyright © 2018 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 27 June 2018

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article

              Acceptance Rates

              SIGIR '18 Paper Acceptance Rate86of409submissions,21%Overall Acceptance Rate792of3,983submissions,20%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader