ABSTRACT
Manifold-ranking has proved to be an effective method for topic-focused multi-document summarization. As basic manifold-ranking based summarization method constructs the relationships between sentences simply by the bag-of-words cosine similarity, we believe a better similarity metric will further improve the effectiveness of manifold-ranking. In this paper, we propose a joint optimization framework, which integrates the manifold-ranking process with a similarity metric learning process. The joint framework aims at learning better sentence similarity scores and better sentence ranking scores simultaneously. Experiments on DUC datasets show the proposed joint method achieves better performance than the manifold-ranking baselines and several popular methods.
- Xiaoyan Cai and Wenjie Li. Mutually reinforced manifold-ranking based relevance propagation model for query-focused multi-document summarization. Audio, Speech, and Language Processing, 20(5):1597--1607, 2012. Google ScholarDigital Library
- Hoa Trang Dang. Overview of duc 2006. In Document Understanding Conference. New York City, 2006.Google Scholar
- Weiwei Guo and Mona Diab. Modeling sentences in the latent space. In ACL, pages 864--872. Association for Computational Linguistics, 2012. Google ScholarDigital Library
- Zhanying He, Chun Chen, Jiajun Bu, Can Wang, Lijun Zhang, Deng Cai, and Xiaofei He. Document summarization based on data reconstruction. In AAAI, 2012.Google Scholar
- Lei Huang, Yanxiang He, Furu Wei, and Wenjie Li. Modeling document summarization as multi-objective optimization. In IITSI, pages 382--386. IEEE, 2010. Google ScholarDigital Library
- Yong-Deok Kim and Seungjin Choi. Weighted nonnegative matrix factorization. In Acoustics, Speech and Signal Processing, 2009., pages 1541--1544. IEEE, 2009. Google ScholarDigital Library
- Thomas K Landauer, Peter W Foltz, and Darrell Laham. An introduction to latent semantic analysis. Discourse processes, 25(2--3):259--284, 1998.Google Scholar
- Sun Park, Ju-Hong Lee, Chan-Min Ahn, Jun Sik Hong, and Seok-Ju Chun. Query based summarization using non-negative matrix factorization. In Knowledge-Based Intelligent Information and Engineering Systems, pages 84--89. Springer, 2006. Google ScholarDigital Library
- Sun Park, Ju-Hong Lee, Deok-Hwan Kim, and Chan-Min Ahn. Multi-document summarization based on cluster using non-negative matrix factorization. In SOFSEM 2007: Theory and Practice of Computer Science, pages 761--770. Springer, 2007. Google ScholarDigital Library
- Nathan Srebro, Tommi Jaakkola, et al. Weighted low-rank approximations. In ICML, volume 3, pages 720--727, 2003.Google ScholarDigital Library
- Jie Tang, Limin Yao, and Dewei Chen. Multi-topic based query-oriented summarization. In SDM, volume 9, pages 1147--1158. SIAM, 2009.Google ScholarCross Ref
- Xiaojun Wan and Jianguo Xiao. Graph-based multi-modality learning for topic-focused multi-document summarization. In IJCAI, pages 1586--1591, 2009. Google ScholarDigital Library
- Xiaojun Wan, Jianwu Yang, and Jianguo Xiao. Manifold-ranking based topic-focused multi-document summarization. In IJCAI, volume 7, pages 2903--2908, 2007. Google ScholarDigital Library
- Dingding Wang, Tao Li, Shenghuo Zhu, and Chris Ding. Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization. In SIGIR, pages 307--314. ACM, 2008. Google ScholarDigital Library
Index Terms
- Joint Matrix Factorization and Manifold-Ranking for Topic-Focused Multi-Document Summarization
Recommendations
Manifold-ranking based topic-focused multi-document summarization
IJCAI'07: Proceedings of the 20th international joint conference on Artifical intelligenceTopic-focused multi-document summarization aims to produce a summary biased to a given topic or user profile. This paper presents a novel extractive approach based on manifold-ranking of sentences to this summarization task. The manifold-ranking process ...
Research on Multi-document Summarization Based on LDA Topic Model
IHMSC '14: Proceedings of the 2014 Sixth International Conference on Intelligent Human-Machine Systems and Cybernetics - Volume 02Compared with VSM (Vector Space Model) and graph-ranking models, LDA (Latent Dirichlet Allocation) Model can discover latent topics in the corpus and latent topics are beneficial to use sentence-ranking mechanisms to form a good summary. In the paper, ...
Topic analysis for topic-focused multi-document summarization
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementTopic-focused multi-document summarization has been a challenging task because the created summary is required to be biased to the given topic or query. Existing methods consider the given topic as a single coarse unit and then directly incorporate the ...
Comments