skip to main content
10.3115/1118935.1118936dlproceedingsArticle/Chapter ViewAbstractPublication PagesiralConference Proceedingsconference-collections
Article
Free access

Improving summarization performance by sentence compression: a pilot study

Published: 07 July 2003 Publication History

Abstract

In this paper we study the effectiveness of applying sentence compression on an extraction based multi-document summarization system. Our results show that pure syntactic-based compression does not improve system performance. Topic signature-based reranking of compressed sentences does not help much either. However reranking using an oracle showed a significant improvement remains possible.

References

[1]
O. Buyukkokten, H. Garcia-Molina, A. Paepcke. 2001. Seeing the Whole in Parts: Text Summarization for Web Browsing on Handheld Devices. The 10th International WWW Conference (WWW10). Hong Kong, China.
[2]
J. Carroll, G. Minnen, Y. Canning, S. Devlin, and J. Tait. 1998. Practical Simplification of English Newspaper Text to Assist Aphasic Readers. In Proceedings of AAAI-98 Workshop on Integrating Artificial Intelligence and Assistive Technology, Madison, WI, USA.
[3]
H. H. Chen, J. J. Kuo, and T. C. Su 2003. Clustering and Visualization in a Multi-Lingual MultiDocument Summarization System. In Proceedings of 25th European Conference on Information Retrieval Research, Lecture Note in Computer Science, April 14-16, Pisa, Italy.
[4]
S. Corston-Oliver. 2001. Text Compaction for Display on Very Small Screens. In Proceedings of the Workshop on Automatic Summarization (WAS 2001), Pittsburgh, PA, USA.
[5]
DUC. 2002. The Document Understanding Conference. http://duc.nist.gov.
[6]
T. Dunning. 1993. Accurate Methods for the Statistics of Surprise and Coincidence. Computational Linguistics 19, 61--74.
[7]
H. P. Edmundson. 1969. New Methods in Automatic Abstracting. Journal of the Association for Computing Machinery. 16(2).
[8]
T. Fukusima and M. Okumura. 2001. Text Summarization Challenge Text Summarization Evaluation in Japan. In Proceedings of the Workshop on Automatic Summarization (WAS 2001), Pittsburgh, PA, USA.
[9]
J. Goldstein, M. Kantrowitz, V. Mittal, and J. Carbonell. 1999. Summarizing Text Documents: Sentence Selection and Evaluation Metrics. In Proceedings of the 22nd International ACM Conference on Research and Development in Information Retrieval (SIGIR-99), Berkeley, CA, USA, 121--128.
[10]
G. Grefenstette. 1998. Producing Intelligent Telegraphic Text Reduction to Provide an Audio Scanning Service for the Blind. In Working Notes of the AAAI Spring Symposium on Intelligent Text Summarization, Stanford University, CA, USA, 111--118.
[11]
E. Hovy and C.-Y. Lin. 1999. Automatic Text Summarization in SUMMARIST. In I. Mani and M. Maybury (eds), Advances in Automatic Text Summarization, 81--94. MIT Press.
[12]
H. Jing. 2000. Sentence simplification in automatic text summarization. In the Proceedings of the 6th Applied Natural Language Processing Conference (ANLP'00). Seattle, Washington, USA.
[13]
K. Knight and D. Marcu. 2000. Statistics-Based Summarization --- Step One: Sentence Compression. In Proceedings of AAAI-2000, Austin, TX, USA.
[14]
J. Kupiec, J. Pederson, and F. Chen. 1995. A Trainable Document Summarizer. In Proceedings of the 18th International ACM Conference on Research and Development in Information Retrieval (SIGIR-95), Seattle, WA, USA, 68--73.
[15]
C.-Y. Lin and E. Hovy. 2000. The Automated Acquisition of Topic Signatures for Text Summarization. In Proceedings of the 18th International Conference on Computational Linguistics (COLING 2000), Saarbrücken, Germany.
[16]
C.-Y. Lin and E. Hovy. 2002. From Single to Multi-document Summarization: A Prototype System and its Evaluation. In Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL-2002), Philadelphia, PA, U.S.A.
[17]
C.-Y. Lin and E. Hovy. 2002. Manual and Automatic Evaluations of Summaries. In Proceedings of the Workshop on Automatic Summarization, post-conference workshop of ACL-2002, pp. 45--51, Philadelphia, PA, USA.
[18]
C.-Y. Lin and E. Hovy. 2003a. Automatic Evaluation of Summaries Using N-gram Cooccurrence Statistics. In Proceedings of the 2003 Human Language Technology Conference (HLT-NAACL 2003), Edmonton, Canada.
[19]
C.-Y. Lin and E. Hovy 2003b. The Potential and Limitations of Sentence Extraction for Summarization. In Proceedings of the Workshop on Automatic Summarization post-conference workshop of HLT-NAACL-2003, Edmonton, Canada.
[20]
H. P. Luhn. 1969. The Automatic Creation of Literature Abstracts. IBM Journal of Research and Development. 2(2).
[21]
D. Marcu. 1999. Discourse trees are good indicators of importance in text. In I. Mani and M. Maybury (eds), Advances in Automatic Text Summarization, 123--136. MIT Press.
[22]
K. McKeown, Barzilay, D. Evans, V. Hatzivassiloglou, J. L. Klavans, A. Nenkova, C. Sable, B. Schiffman, S. Sigelman. 2002. Tracking and Summarizing News on a Daily Basis with Columbia's Newsblaster. In Proceedings of Human Language Technology Conference 2002 (HLT 2002). San Diego, CA, USA.
[23]
NIST. 2002. Automatic Evaluation of Machine Translation Quality using N-gram Co-Occurrence Statistics.
[24]
P. Over and W. Liggett. 2002. Introduction to DUC-2002: an Intrinsic Evaluation of Generic News Text Summarization Systems. In Proceedings of Workshop on Automatic Summarization (DUC 2002), Philadelphia, PA, USA. http://www-nlpir.nist.gov/projects/duc/pubs/2002slides/overview.02.pdf
[25]
K. Papineni, S. Roukos, T. Ward, W.-J. Zhu. 2001. Bleu: a Method for Automatic Evaluation of Machine Translation. IBM Research Report RC22176 (W0 109--022).
[26]
D. R. Radev and K. R. McKeown. 1998. Generating Natural Language Summaries from Multiple On-line Sources. Computational Linguistics, 24(3):469--500.
[27]
T. Strzalkowski, G. Stein, J. Wang, and B, Wise. A Robust Practical Text Summarizer. 1999. In I. Mani and M. Maybury (eds), Advances in Automatic Text Summarization, 137--154. MIT Press.
[28]
M. White, T. Korelsky, C. Cardie, V. Ng, D. Pierce, and K. Wagstaff. 2001. Multidocument Summarization via Information Extraction. In Proceedings of Human Language Technology Conference 2001 (HLT 2001), San Diego, CA, USA.

Cited By

View all
  • (2024)An AI-Resilient Text Rendering Technique for Reading and Skimming DocumentsProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642699(1-22)Online publication date: 11-May-2024
  • (2018)Open-Schema Event Profiling for Massive News CorporaProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3271674(587-596)Online publication date: 17-Oct-2018
  • (2017)Recent advances in document summarizationKnowledge and Information Systems10.1007/s10115-017-1042-453:2(297-336)Online publication date: 1-Nov-2017
  • Show More Cited By
  1. Improving summarization performance by sentence compression: a pilot study

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image DL Hosted proceedings
    AsianIR '03: Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
    July 2003
    175 pages
    • Program Chair:
    • Jun Adachi

    Publisher

    Association for Computational Linguistics

    United States

    Publication History

    Published: 07 July 2003

    Author Tags

    1. evaluation
    2. sentence compression
    3. sentence extraction
    4. text summarization

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)69
    • Downloads (Last 6 weeks)10
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)An AI-Resilient Text Rendering Technique for Reading and Skimming DocumentsProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642699(1-22)Online publication date: 11-May-2024
    • (2018)Open-Schema Event Profiling for Massive News CorporaProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3271674(587-596)Online publication date: 17-Oct-2018
    • (2017)Recent advances in document summarizationKnowledge and Information Systems10.1007/s10115-017-1042-453:2(297-336)Online publication date: 1-Nov-2017
    • (2016)Sentence simplification, compression, and disaggregation for summarization of sophisticated documentsJournal of the Association for Information Science and Technology10.1002/asi.2357667:10(2437-2453)Online publication date: 1-Oct-2016
    • (2013)An abstractive approach to sentence compressionACM Transactions on Intelligent Systems and Technology10.1145/2483669.24836744:3(1-35)Online publication date: 1-Jul-2013
    • (2013)Discursive sentence compressionProceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 210.1007/978-3-642-37256-8_33(394-407)Online publication date: 24-Mar-2013
    • (2012)Framework of automatic text summarization using reinforcement learningProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning10.5555/2390948.2390980(256-265)Online publication date: 12-Jul-2012
    • (2011)Evaluating sentence compressionProceedings of the Workshop on Monolingual Text-To-Text Generation10.5555/2107679.2107690(91-97)Online publication date: 24-Jun-2011
    • (2011)Jointly learning to extract and compressProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 110.5555/2002472.2002534(481-490)Online publication date: 19-Jun-2011
    • (2011)Discourse segmentation for sentence compressionProceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I10.1007/978-3-642-25324-9_27(316-327)Online publication date: 26-Nov-2011
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media