skip to main content
10.1145/1518701.1518986acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article

Amplifying community content creation with mixed initiative information extraction

Published:04 April 2009Publication History

ABSTRACT

Although existing work has explored both information extraction and community content creation, most research has focused on them in isolation. In contrast, we see the greatest leverage in the synergistic pairing of these methods as two interlocking feedback cycles. This paper explores the potential synergy promised if these cycles can be made to accelerate each other by exploiting the same edits to advance both community content creation and learning-based information extraction. We examine our proposed synergy in the context of Wikipedia infoboxes and the Kylin information extraction system. After developing and refining a set of interfaces to present the verification of Kylin extractions as a non primary task in the context of Wikipedia articles, we develop an innovative use of Web search advertising services to study people engaged in some other primary task. We demonstrate our proposed synergy by analyzing our deployment from two complementary perspectives: (1) we show we accelerate community content creation by using Kylin's information extraction to significantly increase the likelihood that a person visiting a Wikipedia article as a part of some other primary task will spontaneously choose to help improve the article's infobox, and (2) we show we accelerate information extraction by using contributions collected from people interacting with our designs to significantly improve Kylin's extraction performance.

References

  1. Bryant, S.L., Forte, A. and Bruckman, A. (2005). Becoming Wikipedian: Transformation of Participation in a Collaborative Online Encyclopedia. Proceedings of the ACM Conference on Supporting Group Work (GROUP 2005), 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Cosley, D., Frankowski, D., Terveen, L. and Riedl, J. (2007). SuggestBot: Using Intelligent Task Routing to Help People Find Work in Wikipedia. Proceedings of the International Conference on Intelligent User Interfaces (IUI 2007), 32--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Culotta, A., Kristjansson, T., McCallum, A. and Viola, P. (2006). Corrective Feedback and Persistent Learning for Information Extraction. Artificial Intelligence 170(14). 1101--1122. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. DeRose, P., Chai, X., Gao, B., Shen, W., Doan, A., Bohannon, P. and Zhu, J. (2008). Building Community Wikipedias: A Human-Machine Approach. Proceedings of the IEEE International Conference on Data Engineering (ICDE 2008), 646--655. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Giles, C.L., Bollacker, K. and Lawrence, S. (1998). CiteSeer: An Automatic Citation Indexing System. Proceedings of the ACM Conference on Digital Libraries (DL 1998), 89--98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Grudin, J. (1994). Groupware and Social Dynamics: Eight Challenges for Developers. Communications of the ACM 37(1). 92--105. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Hoffmann, R., Fogarty, J. and Weld, D.S. (2007). Assieme: Finding and Leveraging Implicit References in a Web Search Interface for Programmers. Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2007), 13--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Horvitz, E. (1999). Principles of Mixed-Initiative Interfaces. Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 1999), 159--166. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Huynh, D.F., Miller, R.C. and Karger, D.R. (2006). Enabling Web Browsers to Augment Web Sites' Filtering and Sorting Functionalities. Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2006), 125--134. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kuznetsov, S. (2006). Motivations of Contributors to Wikipedia. ACM Computers and Society 36(2). 1--7. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Mankoff, J., Hudson, S.E. and Abowd, G.D. (2000). Interaction Techniques for Ambiguity Resolution in Recognition-Based Interfaces. Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2000), 11--20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. McCann, R., Shen, W. and Doan, A. (2008). Matching Schemas in Online Communities: A Web 2.0 Approach. Proceedings of the IEEE International Conference on Data Engineering (ICDE 2008), 110--119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. McFarlane, D.C. (2002). Comparison of Four Primary Methods for Coordinating the Interruption of People in Human-Computer Interaction. Human-Computer Interaction 17(1). 63--139. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. MediaWiki. http://www.mediawiki.org/.Google ScholarGoogle Scholar
  15. Priedhorsky, R., Chen, J., Lam, S.T., Panciera, K., Terveen, L. and Riedl, J. (2007). Creating, Destroying, and Restoring Value in Wikipedia. Proceedings of the ACM Conference on Supporting Group Work (GROUP 2007), 259--268. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Shilman, M., Tan, D.S. and Simard, P. (2006). CueTIP: A Mixed-Initiative Interface for Correcting Handwriting Errors. Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2006), 323--332. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. von Ahn, L. and Dabbish, L. (2004). Labeling Images with a Computer Game. Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2004), 319--326. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. von Ahn, L. and Dabbish, L. (2008). Designing Games with a Purpose. Communications of the ACM 51(8). 58--67. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Voss, J. (2005). Measuring Wikipedia. International Conference of the International Society for Scientometrics and Informetrics (ISSI 2005), 221--231.Google ScholarGoogle Scholar
  20. Wikipedia: AutoWikiBrowser. http://en.wikipedia.org/wiki/Wikipedia:AutoWikiBrowser.Google ScholarGoogle Scholar
  21. Wikipedia: Be Bold. http://en.wikipedia.org/wiki/Wikipedia:Be_Bold.Google ScholarGoogle Scholar
  22. Wikipedia: Bot Policy. http://en.wikipedia.org/wiki/Wikipedia:Bots.Google ScholarGoogle Scholar
  23. Wikipedia: Cleanup Tags. http://en.wikipedia.org/wiki/Wikipedia:Template_messages/Cleanup.Google ScholarGoogle Scholar
  24. Wu, F., Hoffman, R. and Weld, D.S. (2008). Information Extraction from Wikipedia: Moving Down the Long Tail. Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining (KDD 2008), 731--739. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Wu, F. and Weld, D.S. (2007). Autonomously Semantifying Wikipedia. Proceedings of the ACM Conference on Information and Knowledge Management (CIKM 2007), 41--50. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Yee, K.-P., Swearingen, K., Li, K. and Hearst, M. (2003). Faceted Metadata for Image Search and Browsing. Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2003), 401--408. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Amplifying community content creation with mixed initiative information extraction

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CHI '09: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
      April 2009
      2426 pages
      ISBN:9781605582467
      DOI:10.1145/1518701

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 April 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CHI '09 Paper Acceptance Rate277of1,130submissions,25%Overall Acceptance Rate6,199of26,314submissions,24%

      Upcoming Conference

      CHI '24
      CHI Conference on Human Factors in Computing Systems
      May 11 - 16, 2024
      Honolulu , HI , USA

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader