skip to main content
10.1145/2806416.2806624acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Improving Event Detection by Automatically Assessing Validity of Event Occurrence in Text

Published: 17 October 2015 Publication History

Abstract

Manually inspecting text to assess whether an event occurs in a document collection is an onerous and time consuming task. Although a manual inspection to discard the false events would increase the precision of automatically detected sets of events, it is not a scalable approach. In this paper, we automatize event validation, defined as the task of determining whether a given event occurs in a given document or corpus. The introduction of automatic event validation as a post-processing step of event detection can boost the precision of the detected event set, discarding false events and preserving the true ones. We propose a novel automatic method for event validation, which relies on a supervised model to predict the occurrence of events in a non-annotated corpus. The data for training the model is gathered by exploiting the crowdsourcing paradigm. Experiments on real-world events and documents show that our proposed method (i) outperforms the state-of-the-art event validation approach and (ii) increases the precision of event detection while preserving recall.

References

[1]
J. Allan, R. Papka, and V. Lavrenko. On-line new event detection and tracking. In SIGIR, 1998.
[2]
J. Araki and J. Callan. An annotation similarity model in passage ranking for historical fact validation. In SIGIR, 2014.
[3]
A. Ceroni and M. Fisichella. Towards an entity-based automatic event validation. In ECIR, 2014.
[4]
A. Das Sarma, A. Jain, and C. Yu. Dynamic relationship and event discovery. In Proc. of WSDM '11, 2011.
[5]
G. P. C. Fung, J. X. Yu, P. S. Yu, and H. Lu. Parameter free bursty events detection in text streams. In Proc. of VLDB '05, 2005.
[6]
Q. He, K. Chang, and E.-P. Lim. Analyzing feature trajectories for event detection. In Proc. of SIGIR '07, 2007.
[7]
J. Hoffart, F. Suchanek, K. Berberich, and G. Weikum. Yago2: A spatially and temporally enhanced knowledge base from Wikipedia. Artificial Intelligence, 2012.
[8]
D. Hovy, J. Fan, A. Gliozzo, S. Patwardhan, and C. Welty. When did that happen? linking events and relations to timestamps. EACL'12.
[9]
J. R. Landis and G. G. Koch. The measurement of observer agreement for categorical data. Biometrics, 1977.
[10]
A. J. McMinn, Y. Moshfeghi, and J. M. Jose. Building a large-scale corpus for evaluating event detection on twitter. In CIKM, 2013.
[11]
T. Tran, A. Ceroni, M. Georgescu, K. Djafari Naini, and M. Fisichella. Wikipevent: Leveraging wikipedia edit history for event detection. In WISE. 2014.

Cited By

View all
  • (2024)Harnessing Empathy and Ethics for Relevance Detection and Information Categorization in Climate and COVID-19 TweetsProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679937(4091-4095)Online publication date: 21-Oct-2024
  • (2024)A Trustworthy Approach to Classify and Analyze Epidemic-Related Information From MicroblogsIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.339139511:5(6229-6241)Online publication date: Oct-2024
  • (2023)CollabGraph: A Graph-Based Collaborative Search Summary VisualizationIEEE Transactions on Learning Technologies10.1109/TLT.2023.324217416:3_Part_2(382-398)Online publication date: 1-Jun-2023
  • Show More Cited By

Index Terms

  1. Improving Event Detection by Automatically Assessing Validity of Event Occurrence in Text

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
    October 2015
    1998 pages
    ISBN:9781450337946
    DOI:10.1145/2806416
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. event detection
    2. event validation
    3. precision boosting

    Qualifiers

    • Short-paper

    Funding Sources

    Conference

    CIKM'15
    Sponsor:

    Acceptance Rates

    CIKM '15 Paper Acceptance Rate 165 of 646 submissions, 26%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 19 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Harnessing Empathy and Ethics for Relevance Detection and Information Categorization in Climate and COVID-19 TweetsProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679937(4091-4095)Online publication date: 21-Oct-2024
    • (2024)A Trustworthy Approach to Classify and Analyze Epidemic-Related Information From MicroblogsIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.339139511:5(6229-6241)Online publication date: Oct-2024
    • (2023)CollabGraph: A Graph-Based Collaborative Search Summary VisualizationIEEE Transactions on Learning Technologies10.1109/TLT.2023.324217416:3_Part_2(382-398)Online publication date: 1-Jun-2023
    • (2022)GeoClustExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.118461210:COnline publication date: 30-Dec-2022
    • (2022)Siamese coding network and pair similarity prediction for near-duplicate image detectionInternational Journal of Multimedia Information Retrieval10.1007/s13735-022-00233-w11:2(159-170)Online publication date: 12-Apr-2022
    • (2021)Event Detection in Wikipedia Edit History Improved by Documents Web Based Automatic AssessmentBig Data and Cognitive Computing10.3390/bdcc50300345:3(34)Online publication date: 4-Aug-2021
    • (2021)Can Deep Learning Improve Technical Analysis of Forex Data to Predict Future Price Movements?IEEE Access10.1109/ACCESS.2021.31275709(153083-153101)Online publication date: 2021
    • (2021)Unified approach to retrospective event detection for event- based epidemic intelligenceInternational Journal on Digital Libraries10.1007/s00799-021-00308-922:4(339-364)Online publication date: 1-Dec-2021
    • (2018)Automated Validation of Crowdsourced Data2018 IEEE Student Conference on Research and Development (SCOReD)10.1109/SCORED.2018.8711108(1-6)Online publication date: Nov-2018
    • (2017)JustEvents: A Crowdsourced Corpus for Event Validation with Strict Temporal ConstraintsAdvances in Information Retrieval10.1007/978-3-319-56608-5_38(484-492)Online publication date: 8-Apr-2017
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media