skip to main content
10.1145/2676585.2676604acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article

An interpretable method for text summarization based on simplicial non-negative matrix factorization

Published:04 December 2014Publication History

ABSTRACT

Automatic text summarization plays an important role in information retrieval and text mining. Furthermore, it provides an useful solution to the information overload problem. In this paper, we propose a simplicial NMF-based unsupervised generic document summarization method which can inherit some advantages of simplicial NMF such as easy interpretability, low complexity, convexity and sparsity. By focusing on the major topics contained within every sentence as well as entire document, our method generates better summaries with less repetition. The effectiveness of our method is proved by experimental results. On the summarization performance, our approach obtains mostly higher ROUGE scores than NMF-based method.

References

  1. Kenneth L Clarkson. Coresets, sparse greedy approximation, and the frank-wolfe algorithm. ACM Transactions on Algorithms (TALG), 6(4): 63, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. William B Frakes and Ricardo Baeza-Yates. Information retrieval: data structures and algorithms. 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Yihong Gong and Xin Liu. Generic text summarization using relevance measure and latent semantic analysis. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 19--25. ACM, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Patrik O Hoyer. Non-negative matrix factorization with sparseness constraints. The Journal of Machine Learning Research, 5: 1457--1469, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Canasai Kruengkrai and Chuleerat Jaruskulchai. Generic text summarization using local and global properties of sentences. In Web Intelligence, 2003. WI 2003. Proceedings. IEEE/WIC International Conference on, pages 201--206. IEEE, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Ju-Hong Lee, Sun Park, Chan-Min Ahn, and Daeho Kim. Automatic generic document summarization based on non-negative matrix factorization. Information Processing & Management, 45(1): 20--34, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Chin-Yew Lin. Looking for a few good metrics: Automatic summarization evaluation-how many samples are enough. In Proceedings of the NTCIR Workshop, volume 4, pages 1--10, 2004.Google ScholarGoogle Scholar
  8. Inderjeet Mani and Mark T Maybury. Advances in automatic text summarization, volume 293. MIT Press, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Rada Mihalcea. Graph-based ranking algorithms for sentence extraction, applied to text summarization. In Proceedings of the ACL 2004 on Interactive poster and demonstration sessions, page 20. Association for Computational Linguistics, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Gabriel Murray, Steve Renals, and Jean Carletta. Extractive summarization of meeting recordings. In INTERSPEECH, pages 593--596, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  11. Ani Nenkova, Lucy Vanderwende, and Kathleen McKeown. A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization. In Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 573--580. ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Duy Khuong Nguyen, Khoat Than, and Tu Bao Ho. Simplicial nonnegative matrix factorization. In Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2013 IEEE RIVF International Conference on, pages 47--52. IEEE, 2013.Google ScholarGoogle Scholar
  13. Makbule Gulcin Ozsoy, Ilyas Cicekli, and Ferda Nur Alpaslan. Text summarization of turkish texts using latent semantic analysis. In Proceedings of the 23rd international conference on computational linguistics, pages 869--876. Association for Computational Linguistics, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Josef Steinberger and Karel Ježek. Text summarization and singular value decomposition. In Advances in Information Systems, pages 245--254. Springer, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Hongyuan Zha. Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering. In Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pages 113--120. ACM, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. An interpretable method for text summarization based on simplicial non-negative matrix factorization

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      SoICT '14: Proceedings of the 5th Symposium on Information and Communication Technology
      December 2014
      304 pages
      ISBN:9781450329309
      DOI:10.1145/2676585

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 December 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate147of318submissions,46%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader