research-article

WikiKreator: automatic authoring of Wikipedia content

Authors:
Siddhartha Banerjee

The Pennsylvania State University

The Pennsylvania State University
View Profile

,
Prasenjit Mitra

Qatar Computing Research Institute, Hamad Bin Khalifa University and The Pennsylvania State University

Qatar Computing Research Institute, Hamad Bin Khalifa University and The Pennsylvania State University
View Profile

Authors Info & Claims

AI Matters Volume 2 Issue 1September 2015pp 4–6https://doi.org/10.1145/2813536.2813538

Published:07 October 2015Publication History

AI Matters

Abstract

This article describes ongoing dissertation work on the automatic generation of Wikipedia articles. The goal of this work is to build an AI system to automatically summarize existing web content and utilize the resulting text to improve incomplete Wikipedia articles.

References

Banerjee, S., Caragea, C., & Mitra, P. (2014). Playscript Classification and Automatic Wikipedia Play Articles Generation. In 22nd International Conference on Pattern Recognition (ICPR), Stockholm. Google ScholarDigital Library
Banerjee, S., & Mitra, P. (2015a). Filling the Gaps: Improving Wikipedia Stubs. In the 15th ACM SIGWEB International Symposium on Document Engineering (DocEng). Laussanne, Switzerland: ACM. Google ScholarDigital Library
Banerjee, S., & Mitra, P. (2015b). WikiKreator: Improving Wikipedia Stubs Automatically. In the 53rd Annual Meeting of the Association for Computational Linguistics (ACL). Beijing, China: ACL.Google ScholarDigital Library
Banerjee, S., Mitra, P., & Sugiyama, K. (2015). Multi-Document Abstractive Summarization Using ILP based Multi-Sentence Compression. In 24th International Joint Conference on Artificial Intelligence (IJCAI). Buenos Aires, Argentina: AAAI press. Google ScholarDigital Library
Blei, D.M., Ng, A.Y., & Jordan, M.I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research 3, 993--1022. Google ScholarDigital Library
Filippova, K. (2010). Multi-Sentence Compression?: Finding Shortest Paths in Word Graphs. In Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010) (pp. 322--330). Google ScholarDigital Library
Nenkova, A. (2011). Automatic Summarization. Foundations and Trends in Information Retrieval 5(2), 103--233. doi:10.1561/1500000015Google ScholarCross Ref
Sauper, C., & Barzilay, R. (2009). Automatically Generating Wikipedia Articles?: A Structure-Aware Approach. In ACL 09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (pp. 208--216). Google ScholarDigital Library

Index Terms

WikiKreator: automatic authoring of Wikipedia content
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Information systems
  1. Information systems applications
    1. Decision support systems
      1. Expert systems

Recommendations

Web personal name disambiguation based on reference entity tables mined from the web
WIDM '09: Proceedings of the eleventh international workshop on Web information and data management

Ambiguous personal names are common on the Web, which pose a challenge for many different tasks. The traditional disambiguation employs the clustering methods. However, without reference entity tables, the clustering method can only identify whether two ...
Read More
Comparison of Methods to Annotate Named Entity Corpora

The authors compared two methods for annotating a corpus for the named entity (NE) recognition task using non-expert annotators: (i) revising the results of an existing NE recognizer and (ii) manually annotating the NEs completely. The annotation time, ...
Read More
Named entity recognition and disambiguation using linked data and graph-based centrality scoring
SWIM '12: Proceedings of the 4th International Workshop on Semantic Web Information Management

Named Entity Recognition (NER) is a subtask of information extraction and aims to identify atomic entities in text that fall into predefined categories such as person, location, organization, etc. Recent efforts in NER try to extract entities and link ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
AI Matters Volume 2, Issue 1
September 2015
14 pages
EISSN:2372-3483
DOI:10.1145/2813536
Editors:
Eric Eaton
U. Pennsylvania
,
Kiri Wagstaff
JPL/Caltech
Issue’s Table of Contents
Copyright © 2015 Authors
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 October 2015
Check for updates
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 169
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

WikiKreator: automatic authoring of Wikipedia content

AI Matters

Abstract

References

Cited By

Index Terms

Recommendations

Web personal name disambiguation based on reference entity tables mined from the web

Comparison of Methods to Annotate Named Entity Corpora

Named entity recognition and disambiguation using linked data and graph-based centrality scoring