research-article

Multi-Source Pointer Network for Product Title Summarization

Authors:
Fei Sun

Alibaba Group, Beijing, China

Alibaba Group, Beijing, China
View Profile

,
Peng Jiang

Alibaba Group, Beijing, China

Alibaba Group, Beijing, China
View Profile

,
Hanxiao Sun

Alibaba Group, Beijing, China

Alibaba Group, Beijing, China
View Profile

,
Changhua Pei

Alibaba Group, Beijing, China

Alibaba Group, Beijing, China
View Profile

,
Wenwu Ou

Alibaba Group, Beijing, China

Alibaba Group, Beijing, China
View Profile

,
Xiaobo Wang

Alibaba Group, Beijing, China

Alibaba Group, Beijing, China
View Profile

CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge ManagementOctober 2018Pages 7–16https://doi.org/10.1145/3269206.3271722

Published:17 October 2018Publication History

CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

Pages 7–16

ABSTRACT

In this paper, we study the product title summarization problem in E-commerce applications for display on mobile devices. Comparing with conventional sentence summarization, product title summarization has some extra and essential constraints. For example, factual errors or loss of the key information are intolerable for E-commerce applications. Therefore, we abstract two more constraints for product title summarization: (i) do not introduce irrelevant information; (ii) retain the key information (e.g., brand name and commodity name). To address these issues, we propose a novel multi-source pointer network by adding a new knowledge encoder for pointer network. The first constraint is handled by pointer mechanism. For the second constraint, we restore the key information by copying words from the knowledge encoder with the help of the soft gating mechanism. For evaluation, we build a large collection of real-world product titles along with human-written short titles. Experimental results demonstrate that our model significantly outperforms the other baselines. Finally, online deployment of our proposed model has yielded a significant business impact, as measured by the click-through rate.

References

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In Proceedings of ICLR .Google Scholar
Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization . Association for Computational Linguistics, Michigan, 65--72.Google Scholar
Michele Banko, Vibhu O. Mittal, and Michael J. Witbrock. 2000. Headline Generation Based on Statistical Translation. In Proceedings of ACL . Association for Computational Linguistics, 318--325. Google ScholarDigital Library
Abhijnan Chakraborty, Bhargavi Paranjape, Sourya Kakarla, and Niloy Ganguly. 2016. Stop Clickbait: Detecting and preventing clickbaits in online news media. In Proceedings of ASONAM . 9--16. Google ScholarDigital Library
Jianpeng Cheng and Mirella Lapata. 2016. Neural Summarization by Extracting Sentences and Words. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) . Association for Computational Linguistics, Berlin, Germany, 484--494.Google ScholarCross Ref
Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation. In Proceedings of EMNLP . Association for Computational Linguistics, Doha, Qatar, 1724--1734.Google ScholarCross Ref
Sumit Chopra, Michael Auli, and Alexander M. Rush. 2016. Abstractive Sentence Summarization with Attentive Recurrent Neural Networks. In Proceedings of NAACL. Association for Computational Linguistics, San Diego, California, 93--98.Google Scholar
Trevor Cohn and Mirella Lapata. 2008. Sentence Compression Beyond Word Deletion. In Proceedings of COLING . Manchester, UK, 137--144. Google ScholarDigital Library
Trevor Cohn and Mirella Lapata. 2013. An Abstractive Approach to Sentence Compression. ACM Trans. Intell. Syst. Technol. , Vol. 4, 3, Article 41 (July 2013), bibinfonumpages35 pages. Google ScholarDigital Library
Bonnie Dorr, David Zajic, and Richard Schwartz. 2003. Hedge Trimmer: A Parse-and-Trim Approach to Headline Generation. In Proceedings of the HLT-NAACL 03 Text Summarization Workshop. 1--8. Google ScholarDigital Library
John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. J. Mach. Learn. Res. , Vol. 12 (July 2011), 2121--2159. Google ScholarDigital Library
Mihail Eric and Christopher Manning. 2017. A Copy-Augmented Sequence-to-Sequence Architecture Gives Good Performance on Task-Oriented Dialogue. In Proceedings of EACL . Association for Computational Linguistics, Valencia, Spain, 468--473.Google ScholarCross Ref
Katja Filippova, Enrique Alfonseca, Carlos A. Colmenares, Lukasz Kaiser, and Oriol Vinyals. 2015. Sentence Compression by Deletion with LSTMs. In Proceedings of EMNLP . Association for Computational Linguistics, Lisbon, Portugal, 360--368.Google ScholarCross Ref
Katja Filippova and Michael Strube. 2008. Dependency Tree Based Sentence Compression. In Proceedings of INLG . Association for Computational Linguistics, Salt Fork, Ohio, 25--32. Google ScholarDigital Library
Dimitrios Galanis and Ion Androutsopoulos. 2010. An extractive supervised two-stage method for sentence compression. In Proceedings of NAACL . Association for Computational Linguistics, Los Angeles, California, 885--893. Google ScholarDigital Library
Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Deep Sparse Rectifier Neural Networks. In Proceedings of AISTAT. PMLR, Fort Lauderdale, FL, USA, 315--323.Google Scholar
Jiatao Gu, Zhengdong Lu, Hang Li, and Victor O.K. Li. 2016. Incorporating Copying Mechanism in Sequence-to-Sequence Learning. In Proceedings of ACL. Association for Computational Linguistics, Berlin, Germany, 1631--1640.Google Scholar
Caglar Gulcehre, Sungjin Ahn, Ramesh Nallapati, Bowen Zhou, and Yoshua Bengio. 2016. Pointing the Unknown Words. In Proceedings of ACL . Association for Computational Linguistics, Berlin, Germany, 140--149.Google ScholarCross Ref
Shizhu He, Cao Liu, Kang Liu, and Jun Zhao. 2017. Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning. In Proceedings of ACL . Association for Computational Linguistics, Vancouver, Canada, 199--208.Google ScholarCross Ref
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Comput. , Vol. 9, 8 (Nov. 1997), 1735--1780. Google ScholarDigital Library
Hongyan Jing. 2002. Using Hidden Markov Modeling to Decompose Human-written Summaries. Comput. Linguist. , Vol. 28, 4 (Dec. 2002), 527--543. Google ScholarDigital Library
Rudolf Kadlec, Martin Schmid, Ondvrej Bajgar, and Jan Kleindienst. 2016. Text Understanding with the Attention Sum Reader Network. In Proceedings of ACL . Association for Computational Linguistics, Berlin, Germany, 908--918.Google ScholarCross Ref
Kevin Knight and Daniel Marcu. 2000. Statistics-Based Summarization - Step One: Sentence Compression. In Proceedings of AAAI . 703--710. Google ScholarDigital Library
Chin-Yew Lin. 2004. ROUGE: A Package for Automatic Evaluation of Summaries. In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop. Association for Computational Linguistics, Barcelona, Spain, 74--81.Google Scholar
Wang Ling, Phil Blunsom, Edward Grefenstette, Karl Moritz Hermann, Tomávs Kovciský, Fumin Wang, and Andrew Senior. 2016. Latent Predictor Networks for Code Generation. In Proceedings of ACL . Association for Computational Linguistics, Berlin, Germany, 599--609.Google ScholarCross Ref
Ryan McDonald. 2006. Discriminative Sentence Compression with Soft Syntactic Evidence. In Proceedings of EACL . 297--304.Google Scholar
Stephen Merity, Caiming Xiong, James Bradbury, and Richard Socher. 2017. Pointer Sentinel Mixture Models. In Proceedings of ICLR .Google Scholar
Yishu Miao and Phil Blunsom. 2016. Language as a Latent Variable: Discrete Generative Models for Sentence Compression. In Proceedings of EMNLP. Association for Computational Linguistics, Austin, Texas, 319--328.Google ScholarCross Ref
Rada Mihalcea and Paul Tarau. 2004. TextRank: Bringing Order into Texts. In Proceedings of EMNLP 2004 . Association for Computational Linguistics, Barcelona, Spain, 404--411.Google Scholar
Lili Mou, Yiping Song, Rui Yan, Ge Li, Lu Zhang, and Zhi Jin. 2016. Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation. In Proceedings of COLING. Osaka, Japan, 3349--3358.Google Scholar
Ramesh Nallapati, Bowen Zhou, C'i cero Nogueira dos Santos, cC aglar Gü lcc ehre, and Bing Xiang. 2016. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond. In Proceedings of CoNLL. Berlin, Germany, 280--290.Google Scholar
Courtney Napoles, Chris Callison-Burch, Juri Ganitkevitch, and Benjamin Van Durme. 2011. Paraphrastic Sentence Compression with a Character-based Metric: Tightening without Deletion. In Proceedings of the Workshop on Monolingual Text-To-Text Generation. Association for Computational Linguistics, Portland, Oregon, 84--90. Google ScholarDigital Library
Paul Over, Hoa Dang, and Donna Harman. 2007. DUC in Context. Inf. Process. Manage. , Vol. 43, 6 (Nov. 2007), 1506--1520. Google ScholarDigital Library
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. Bleu: a Method for Automatic Evaluation of Machine Translation. In Proceedings of ACL . Association for Computational Linguistics, Philadelphia, Pennsylvania, USA, 311--318. Google ScholarDigital Library
Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio. 2013. On the difficulty of training recurrent neural networks. In Proceedings of ICML . PMLR, Atlanta, Georgia, USA, 1310--1318. Google ScholarDigital Library
Romain Paulus, Caiming Xiong, and Richard Socher. 2018. A Deep Reinforced Model for Abstractive Summarization. In Proceedings of ICLR .Google Scholar
Alexander M. Rush, Sumit Chopra, and Jason Weston. 2015. A Neural Attention Model for Abstractive Sentence Summarization. In Proceedings of EMNLP . Association for Computational Linguistics, Lisbon, Portugal, 379--389.Google ScholarCross Ref
Abigail See, Peter J. Liu, and Christopher D. Manning. 2017. Get To The Point: Summarization with Pointer-Generator Networks. In Proceedings of ACL. Association for Computational Linguistics, Vancouver, Canada, 1073--1083.Google Scholar
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to Sequence Learning with Neural Networks. In Proceedings of NIPS . Curran Associates, Inc., 3104--3112. Google ScholarDigital Library
Jiwei Tan, Xiaojun Wan, and Jianguo Xiao. 2017. Abstractive Document Summarization with a Graph-Based Attentional Neural Model. In Proceedings of ACL . Association for Computational Linguistics, Vancouver, Canada, 1171--1181.Google ScholarCross Ref
Oriol Vinyals, Meire Fortunato, and Navdeep Jaitly. 2015. Pointer Networks. In Proceedings of NIPS , , C. Cortes, N. D. Lawrence, D. D. Lee, M. Sugiyama, and R. Garnett (Eds.). Curran Associates, Inc., 2692--2700. Google ScholarDigital Library
Jingang Wang, Junfeng Tian, Long Qiu, Sheng Li, Jun Lang, Luo Si, and Man Lan. 2018. A Multi-task Learning Approach for Improving Product Title Compression with User Search Log Data. In Proceedings of AAAI .Google Scholar
Shuohang Wang and Jing Jiang. 2017. Machine Comprehension Using Match-LSTM and Answer Pointer. In Proceedings of ICLR .Google Scholar
Wenhui Wang, Nan Yang, Furu Wei, Baobao Chang, and Ming Zhou. 2017. Gated Self-Matching Networks for Reading Comprehension and Question Answering. In Proceedings of ACL . Association for Computational Linguistics, Vancouver, Canada, 189--198.Google ScholarCross Ref
Kristian Woodsend, Yansong Feng, and Mirella Lapata. 2010. Title Generation with Quasi-Synchronous Grammar. In Proceedings of EMNLP . Association for Computational Linguistics, Cambridge, MA, 513--523. Google ScholarDigital Library
Sander Wubben, Antal van den Bosch, and Emiel Krahmer. 2012. Sentence Simplification by Monolingual Machine Translation. In Proceedings of ACL . Association for Computational Linguistics, Jeju Island, Korea, 1015--1024. Google ScholarDigital Library
Chen Xing, Wei Wu, Yu Wu, Jie Liu, Yalou Huang, Ming Zhou, and Wei-Ying Ma. 2017. Topic Aware Neural Response Generation. In Proceedings of AAAI. 3351--3357.Google Scholar
Lili Yao, Yaoyuan Zhang, Yansong Feng, Dongyan Zhao, and Rui Yan. 2017. Towards Implicit Content-Introducing for Generative Short-Text Conversation Systems. In Proceedings of EMNLP. Association for Computational Linguistics, Copenhagen, Denmark, 2190--2199.Google ScholarCross Ref
David M. Zajic, Bonnie J. Dorr, and Richard M. Schwartz. 2004. BBN/UMD at DUC-2004: Topiary. In Proceedings of the HLT-NAACL 2004 Document Understanding Workshop. 112----119.Google Scholar

Index Terms

Multi-Source Pointer Network for Product Title Summarization
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Summarization

Recommendations

Hybrid multi-document summarization using pre-trained language models
Abstract
Abstractive multi-document summarization is a type of automatic text summarization. It obtains information from multiple documents and generates a human-like summary from them. In this paper, we propose an abstractive multi-document ...
Highlights
- Introducing a multi-document summarizer, called HMSumm, based on pre-trained methods.
Read More
Sentiment Lossless Summarization
Abstract
The aim of automatic text summarization (ATS) is to extract representative texts from documents and keep major points of the extracted texts consistent with the original documents. However, most existing studies ignore sentimental ...
Read More
Exploring events and distributed representations of text in multi-document summarization

We explore an event detection framework to improve multi-document summarizationWe use distributed representations of text to address different lexical realizationsSummarization is based on the hierarchical combination of single-document summariesWe ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management
October 2018
2362 pages
ISBN:9781450360142
DOI:10.1145/3269206
General Chair:
Alfredo Cuzzocrea
University of Trieste, Italy
,
Program Chairs:
James Allan
University of Massachusetts, USA
,
Norman Paton
University of Manchester, United Kingdom
,
Divesh Srivastava
AT&T Labs Research, USA
,
Rakesh Agrawal
Data Insights Lab, USA
,
Andrei Broder
Google Research, USA
,
Mohammed Zaki
Rensselaer Polytechnic Institute, USA
,
Selcuk Candan
Arizona State University, USA
,
Alexandros Labrinidis
University of Pittsburgh, USA
,
Assaf Schuster
Technion, Israel
,
Haixun Wang
Google Research, USA
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 October 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
extractive summarization
pointer network
title summarization
Qualifiers
- research-article
Conference

Acceptance Rates
CIKM '18 Paper Acceptance Rate147of826submissions,18%Overall Acceptance Rate1,861of8,427submissions,22%
More
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 16
  Total Citations
  View Citations
- 829
  Total Downloads
- Downloads (Last 12 months)27
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Multi-Source Pointer Network for Product Title Summarization

CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Hybrid multi-document summarization using pre-trained language models

Sentiment Lossless Summarization

Exploring events and distributed representations of text in multi-document summarization