skip to main content
10.1145/3159652.3159712acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction

Authors Info & Claims
Published:02 February 2018Publication History

ABSTRACT

In this paper, we consider the problem of open information extraction (OIE) for extracting entity and relation level intermediate structures from sentences in open-domain. We focus on four types of valuable intermediate structures (Relation, Attribute, Description, and Concept), and propose a unified knowledge expression form, SAOKE, to express them. We publicly release a data set which contains 48,248 sentences and the corresponding facts in the SAOKE format labeled by crowdsourcing. To our knowledge, this is the largest publicly available human labeled data set for open information extraction tasks. Using this labeled SAOKE data set, we train an end-to-end neural model using the sequence-to-sequence paradigm, called Logician, to transform sentences into facts. For each sentence, different to existing algorithms which generally focus on extracting each single fact without concerning other possible facts, Logician performs a global optimization over all possible involved facts, in which facts not only compete with each other to attract the attention of words, but also cooperate to share words. An experimental study on various types of open domain relation extraction tasks reveals the consistent superiority of Logician to other states-of-the-art algorithms. The experiments verify the reasonableness of SAOKE format, the valuableness of SAOKE data set, the effectiveness of the proposed Logician model, and the feasibility of the methodology to apply end-to-end learning paradigm on supervised data sets for the challenging tasks of open information extraction.

References

  1. Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation By Jointly Learning To Align and Translate. In Proceedings of ICLR.Google ScholarGoogle Scholar
  2. Michele Banko, Mj Cafarella, and Stephen Soderland. 2007. Open information extraction for the web. In IJCAI. 2670--2676. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Qingqing Cai and Alexander Yates. 2013. Large-scale Semantic Parsing via Schema Matching and Lexicon Extension. In Proceedings of the 51st Annual Meeting of ACL. 423--433.Google ScholarGoogle Scholar
  4. Kaushik Chakrabarti, Surajit Chaudhuri, Tao Cheng, and Dong Xin. 2011. Entity-tagger: automatically tagging entities with descriptive phrases. In Proceedings of the 20th International Conference Companion on WWW. 19--20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Wanxiang Che, Zhenghua Li, and Ting Liu. 2010. LTP: A Chinese Language Technology Platform. In Proceedings of COLING. 13--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. In Proceedings of the 2014 Conference on EMNLP. 1724--1734.Google ScholarGoogle ScholarCross RefCross Ref
  7. Janara Christensen, Mausam, Stephen Soderland, and Oren Etzioni. 2011. An analysis of open information extraction based on semantic role labeling. In Proceedings of the sixth International Conference on Knowledge Capture. 113--120. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Janara Christensen, Mausam, Stephen Soderland, Oren Etzioni, Mausam, Stephen Soderland, and Oren Etzioni. 2013. Towards Coherent Multi-Document Summarization. In Proceedings of the 2013 Conference of NAACL: HLT. 1163--1173.Google ScholarGoogle Scholar
  9. Janara Christensen, Stephen Soderland, and Gagan Bansal. 2014. Hierarchical Summarization: Scaling Up Multi-Document Summarization. In Proceedings of the 52nd Annual Meeting of ACL. 902--912.Google ScholarGoogle ScholarCross RefCross Ref
  10. Luciano Del Corro and Rainer Gemulla. 2013. ClausIE: Clause-Based Open Information Extraction. In Proceedings of the 22nd International Conference on WWW. 355--366. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Li Dong and Mirella Lapata. 2016. Language to Logical Form with Neural Attention. In In Proceedings of the Annual Meeting of ACL. 33--43. arXiv:1601.01280Google ScholarGoogle ScholarCross RefCross Ref
  12. Oren Etzioni, Anthony Fader, Janara Christensen, Stephen Soderland, and Mausam. 2011. Open information extraction: The second generation. In Proceed- ings of IJCAI. 3--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Anthony Fader, Stephen Soderland, and Oren Etzioni. 2011. Identifying Relations for Open Information Extraction. In Proceedings of the Conference on EMNLP. 1535--1545. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Anthony Fader, Luke S. Zettlemoyer, and Oren Etzioni. 2014. Open Question Answering Over Curated and Extracted Knowledge Bases. In Proceedings of the 20th ACM SIGKDD. 1156--1165. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Jiatao Gu, Zhengdong Lu, Hang Li, and Victor O.K. Li. 2016. Incorporating Copying Mechanism in Sequence-to-Sequence Learning. In Proceedings of the 54th Annual Meeting of ACL. 1631--1640.Google ScholarGoogle Scholar
  16. Rahul Gupta and A. Halevy. 2014. Biperpedia: An Ontology for Search Applica- tions. In Proceedings of the VLDB Endowment. 505--516. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Luheng He, Kenton Lee, Mike Lewis, and Luke Zettlemoyer. 2017. Deep Semantic Role Labeling: What Works and What's Next. In Proceedings of the 55th Annual Meeting of the ACL. 473--483.Google ScholarGoogle ScholarCross RefCross Ref
  18. Luheng He, Mike Lewis, and Luke Zettlemoyer. 2015. Question-Answer Driven Semantic Role Labeling: Using Natural Language to Annotate Natural Language. In Proceedings of the 2015 Conference on EMNLP. 643--653.Google ScholarGoogle ScholarCross RefCross Ref
  19. Marti A. Hearst. 1992. Automatic Acquisition of Hyponyms ftom Large Text Corpora. In Proceedings of the 14th conference on Computational Linguistics. 23--28. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Geoffrey Hinton, Nitish Srivastava, and Kevin Swersky. 2012. Overview of mini-batch gradient descent. Technical Report.Google ScholarGoogle Scholar
  21. Nanda Kambhatla. 2004. Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations. In Proceedings of the ACL: Interactive Poster and Demonstration Sessions. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Rohit J. Kate, Yuk Wah, and Wong Raymond. 2005. Learning to Transform Natural to Formal Languages. In Proceedings of the 20th AAAI. 1062--1068. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Tushar Khot, Ashish Sabharwal, and Peter Clark. 2017. Answering Complex Questions Using Open Information Extraction. In Proceedings of the 55th Annual Meeting of the ACL. 311--316. arXiv:1704.05572Google ScholarGoogle ScholarCross RefCross Ref
  24. Tom Kwiatkowski, Eunsol Choi, Yoav Artzi, and Luke Zettlemoyer. 2013. Scaling Semantic Parsers with On-the-fly Ontology Matching. In Proceedings of the 2013 Conference on EMNLP. 1545--1556.Google ScholarGoogle Scholar
  25. Jinyang Li, Chengyu Wang, Xiaofeng He, Rong Zhang, and Ming Gao. 2015. User Generated Content Oriented Chinese Taxonomy Construction. In Lecture Notes in Computer Science. Vol. 9313. 623--634.Google ScholarGoogle Scholar
  26. Yankai Lin, Shiqi Shen, Zhiyuan Liu, Huanbo Luan, and Maosong Sun. 2016. Neural Relation Extraction with Selective Attention over Instances. In Proceedings of the 54th Annual Meeting of ACL. 2124--2133.Google ScholarGoogle ScholarCross RefCross Ref
  27. Christopher D. Manning, John Bauer, Jenny Finkel, Steven J Bethard, Mihai Surdeanu, and David McClosky. 2014. The Stanford CoreNLP Natural Language Processing Toolkit. In Proceedings of 52nd Annual Meeting of ACL: System Demon- strations. 55--60. arXiv:arXiv:1011.1669v3Google ScholarGoogle ScholarCross RefCross Ref
  28. Mausam. 2016. Open Information Extraction Systems and Downstream Applications. In Proceedings of the 25th IJCAI. 4074--4077. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Mike Mintz, Steven Bills, Rion Snow, and Dan Jurafsky. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Vol. 2. 1003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Makoto Miwa and Mohit Bansal. 2016. End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures. In Proceedings of the 54th Annual Meeting of ACL. 1105--1116.Google ScholarGoogle ScholarCross RefCross Ref
  31. Harinder Pal and Mausam. 2016. Demonyms and Compound Relational Nouns in Nominal Open IE. In Proceedings of the 5th Workshop on AKBC. 35--39.Google ScholarGoogle ScholarCross RefCross Ref
  32. Likun Qiu and Yue Zhang. 2014. ZORE : A Syntax-based System for Chinese Open Relation Extraction. In Proceedings of the 2014 Conference on EMNLP. 1870--1880.Google ScholarGoogle ScholarCross RefCross Ref
  33. John W. Ratcliff and David E. Metzener. 1988. Pattern Matching: The Gestalt Approach. Dr Dobb's 13, 7 (1988).Google ScholarGoogle Scholar
  34. Sebastian Riedel, Limin Yao, Andrew McCallum, and Benjamin M. Marlin. 2013. Relation Extraction with Matrix Factorization and Universal Schemas. Proceedings of the 2013 Conference of NAACL: HLT June (2013), 74--84.Google ScholarGoogle Scholar
  35. Sebastian Riedel, Limin Yao, Andrew McCallum, and Benjamin M. Marlin. 2013. Relation Extraction with Matrix Factorization and Universal Schemas. In Proceedings of the 2013 Conference of NAACL: HLT. 74--84.Google ScholarGoogle Scholar
  36. Michael Schmitz, Robert Bart, Stephen Soderland, and Oren Etzioni. 2012. Open language learning for information extraction. In Proceedings of the 2012 Joint Conference on EMNLP and CoNLL. 523--534. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Stephen Soderland, Brendan Roof, Bo Qin, and Shi Xu. 2010. Adapting Open Information Extraction to Domain-Specific Relations. AI Magazine 31, 3 (2010), 93--102.Google ScholarGoogle ScholarCross RefCross Ref
  38. Gabriel Stanovsky and Ido Dagan. 2016. Creating a Large Benchmark for Open Information Extraction. In Proceedings of the 2016 Conference on EMNLP. 2300--2305.Google ScholarGoogle ScholarCross RefCross Ref
  39. Gabriel Stanovsky, Ido Dagan, and Mausam. 2015. Open IE as an Intermediate Structure for Semantic Tasks. In Proceedings of the 53rd Annual Meeting of ACL and the 7th IJCNLP. 303--308.Google ScholarGoogle ScholarCross RefCross Ref
  40. Zhaopeng Tu, Zhengdong Lu, Yang Liu, Xiaohua Liu, and Hang Li. 2016. Modeling Coverage for Neural Machine Translation. In Proceedings of the Annual Meeting of ACL (2016), 76--85.Google ScholarGoogle ScholarCross RefCross Ref
  41. Vered Shwartz, Yoav Goldberg, Ido Dagan, Vered Shwartz, Yoav Goldberg, and Ido Dagan. 2016. Improving hypernymy detection with an integrated path-based and distributional method. In Proceedings of the 54th Annual Meeting of ACL. 2389--2398. arXiv:1603.06076Google ScholarGoogle ScholarCross RefCross Ref
  42. Chengyu Wang and Xiaofeng He. 2017. A Short Survey on Taxonomy Learning from Text Corpora : Issues, Resources and Recent Advances. In Proceedings of the Conference on EMNLP.Google ScholarGoogle ScholarCross RefCross Ref
  43. Wikipedia. 2017. Assignment problem-Wikipedia, The Free Encyclopedia. (2017).Google ScholarGoogle Scholar
  44. Fei Wu and Daniel S. Weld. 2010. Open Information Extraction using Wikipedia. In Proceedings of the 48th Annual Meeting of ACL. 118--127. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Wentao Wu, Hongsong Li, Haixun Wang, and Kenny Q. Zhu. 2012. Probase: A probabilistic taxonomy for text understanding. In Proceedings of the 2012 ACM SIGMOD. 481--492. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Mohamed Yahya, Steven Euijong Whang, Rahul Gupta, and Alon Halevy. 2014. ReNoun : Fact Extraction for Nominal Attributes. In Proceedings of the Conference on EMNLP 2014, Doha, Qatar. 325--335.Google ScholarGoogle ScholarCross RefCross Ref
  47. Pengcheng Yin, Zhengdong Lu, Hang Li, and Ben Kao. 2016. Neural Enquirer: Learning to Query Tables. In In Proceedings of the Annual Meeting of ACL. 29--35.Google ScholarGoogle Scholar
  48. Dmitry Zelenko, Chinatsu Aone, Anthony Richardella, Jaz Kandola, Thomas Hofmann, Tomaso Poggio, and John Shawe-Taylor. 2003. Kernel Methods for Relation Extraction. Journal of Machine Learning Research 3 (2003), 1083--1106. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Luke S. Zettlemoyer and Michael Collins. 2005. Learning to Map Sentences to Logical Form : Structured Classification with Probabilistic Categorial Grammars. In Proceedings of the 21st Conference on UAI. 658--666. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Suncong Zheng, Feng Wang, Hongyun Bao, Yuexing Hao, Peng Zhou, and Bo Xu. 2017. Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme. Proceedings of the 55th Annual Meeting of the ACL (2017), 1227--1236.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining
          February 2018
          821 pages
          ISBN:9781450355810
          DOI:10.1145/3159652

          Copyright © 2018 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 2 February 2018

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          WSDM '18 Paper Acceptance Rate81of514submissions,16%Overall Acceptance Rate498of2,863submissions,17%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader