Abstract
This article describes ongoing dissertation work on the automatic generation of Wikipedia articles. The goal of this work is to build an AI system to automatically summarize existing web content and utilize the resulting text to improve incomplete Wikipedia articles.
- Banerjee, S., Caragea, C., & Mitra, P. (2014). Playscript Classification and Automatic Wikipedia Play Articles Generation. In 22nd International Conference on Pattern Recognition (ICPR), Stockholm. Google ScholarDigital Library
- Banerjee, S., & Mitra, P. (2015a). Filling the Gaps: Improving Wikipedia Stubs. In the 15th ACM SIGWEB International Symposium on Document Engineering (DocEng). Laussanne, Switzerland: ACM. Google ScholarDigital Library
- Banerjee, S., & Mitra, P. (2015b). WikiKreator: Improving Wikipedia Stubs Automatically. In the 53rd Annual Meeting of the Association for Computational Linguistics (ACL). Beijing, China: ACL.Google ScholarDigital Library
- Banerjee, S., Mitra, P., & Sugiyama, K. (2015). Multi-Document Abstractive Summarization Using ILP based Multi-Sentence Compression. In 24th International Joint Conference on Artificial Intelligence (IJCAI). Buenos Aires, Argentina: AAAI press. Google ScholarDigital Library
- Blei, D.M., Ng, A.Y., & Jordan, M.I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research 3, 993--1022. Google ScholarDigital Library
- Filippova, K. (2010). Multi-Sentence Compression?: Finding Shortest Paths in Word Graphs. In Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010) (pp. 322--330). Google ScholarDigital Library
- Nenkova, A. (2011). Automatic Summarization. Foundations and Trends in Information Retrieval 5(2), 103--233. doi:10.1561/1500000015Google ScholarCross Ref
- Sauper, C., & Barzilay, R. (2009). Automatically Generating Wikipedia Articles?: A Structure-Aware Approach. In ACL 09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (pp. 208--216). Google ScholarDigital Library
Index Terms
- WikiKreator: automatic authoring of Wikipedia content
Recommendations
Web personal name disambiguation based on reference entity tables mined from the web
WIDM '09: Proceedings of the eleventh international workshop on Web information and data managementAmbiguous personal names are common on the Web, which pose a challenge for many different tasks. The traditional disambiguation employs the clustering methods. However, without reference entity tables, the clustering method can only identify whether two ...
Comparison of Methods to Annotate Named Entity Corpora
The authors compared two methods for annotating a corpus for the named entity (NE) recognition task using non-expert annotators: (i) revising the results of an existing NE recognizer and (ii) manually annotating the NEs completely. The annotation time, ...
Named entity recognition and disambiguation using linked data and graph-based centrality scoring
SWIM '12: Proceedings of the 4th International Workshop on Semantic Web Information ManagementNamed Entity Recognition (NER) is a subtask of information extraction and aims to identify atomic entities in text that fall into predefined categories such as person, location, organization, etc. Recent efforts in NER try to extract entities and link ...
Comments