ABSTRACT
The number of natural language queries submitted to search engines is increasing as search environments get diversified. However, legacy search engines are still optimized for short keyword queries. Thus, the use of natural language queries at legacy search engines degrades the retrieval performance of the engines. This paper proposes a novel method to translate a natural language query into a keyword query relevant to the natural language query for retrieving better search results without change of the engines. The proposed method formulates the translation as a generation task. That is, the method generates a keyword query from a natural language query by preserving the semantics of the natural language query. A recurrent neural network encoder-decoder architecture is adopted as a generator of keyword queries from natural language queries. In addition, an attention mechanism is also used to cope with long natural language queries.
- I. Antonellis, H. G. Molina, and C. C. Chang. 2008. Simrank++: Query Rewriting Through Link Analysis of the Click Graph. Proceedings of VLDB, Vol. 1, 1 (2008), 408--421. Google ScholarDigital Library
- D. Bahdanau, K. Cho, and Y. Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. Proceedings of ICLR.Google Scholar
- K. Cho, B. van Merrienboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio. 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. In Proceedings of EMNLP. 1724--1734. Google ScholarCross Ref
- S. Chopra, M. Auli, and A. M. Rush. 2016. Abstractive Sentence Summarization with Attentive Recurrent Neural Networks. Proceedings of NAACL. 93--98. Google ScholarCross Ref
- D. Ferrucci, E. Brown, J. Chu-Carroll, J. Fan, D. Gondek, A. A Kalyanpur, A. Lally, J. W. Murdock, E. Nyberg, and J. Prager. 2010. Building Watson: An overview of the DeepQA project. AI magazine, Vol. 31, 3 (2010), 59--79. Google ScholarCross Ref
- J. Gao and J.-Y. Nie. 2012. Towards Concept-Based Translation Models Using Search Logs for Query Expansion. Proceedings of CIKM. 1:1--1:10.Google Scholar
- A. Graves and J. Schmidhuber. 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks, Vol. 18, 5 (2005), 602--610. Google ScholarDigital Library
- Y. He, J. Tang, H. Ouyang, C. Kang, D. Yin, and Y. Chang. 2016. Learning to Rewrite Queries. In Proceedings of CIKM. 1443--1452. Google ScholarDigital Library
- S. Hochreiter and J. Schmidhuber. 1997. Long short-term memory. Neural computation, Vol. 9, 8 (1997), 1735--1780. Google ScholarDigital Library
- S. Huston and W. B. Croft. 2010. Evaluating Verbose Query Processing Techniques. In Proceedings of SIGIR. 291--298. Google ScholarDigital Library
- R. Jones, B. Rey, O. Madani, and W. Greiner. 2006. Generating query substitutions. In Proceedings of WWW. 387--396.Google Scholar
- G. Kumaran and V. R. Carvalho. 2009. Reducing Long Queries Using Query Quality Predictors Proceedings of SIGIR. 564--571.Google Scholar
- B. Li and I. King. 2010. Routing Questions to Appropriate Answerers in Community Question Answering Services. Proceedings of CIKM. 1585--1588. Google ScholarDigital Library
- T. Luong, H. Pham, and C. D. Manning. 2015. Effective Approaches to Attention-based Neural Machine Translation. Proceedings of EMNLP. 1412--1421. Google ScholarCross Ref
- K. T. Maxwell and W. B. Croft. 2013. Compact Query Term Selection Using Topically Related Text. Proceedings of SIGIR. 583--592. Google ScholarDigital Library
- A. Otegi, X. Arregi, O. Ansa, and E. Agirre. 2015. Using knowledge-based relatedness for information retrieval. Knowledge and Information Systems Vol. 44, 3 (2015), 689--718. Google ScholarDigital Library
- J. H. Park and W. B. Croft. 2010. Query Term Ranking Based on Dependency Parsing of Verbose Queries. Proceedings of SIGIR. 829--830. Google ScholarDigital Library
- S. Riezler and Y. Liu. 2010. Query Rewriting Using Monolingual Statistical Machine Translation. Computational Linguistics Vol. 36, 3 (2010), 569--582. Google ScholarDigital Library
- I. Sutskever, O. Vinyals, and Q. V Le. 2014. Sequence to sequence learning with neural networks. Proceedings of NIPS. 3104--3112.Google Scholar
- X. Wang and C. Zhai. 2008. Mining Term Association Patterns from Search Logs for Effective Query Reformulation. Proceedings of CIKM. 479--488. Google ScholarDigital Library
- G. Weikum and M. Theobald. 2010. From Information to Knowledge: Harvesting Entities and Relationships from Web Sources. Proceedings of SIGMOD/PODS. 65--76.Google Scholar
- J. Xu and W. B. Croft. 1996. Query Expansion Using Local and Global Document Analysis. Proceedings of SIGIR. 4--11. Google ScholarDigital Library
Index Terms
- Translation of Natural Language Query Into Keyword Query Using a RNN Encoder-Decoder
Recommendations
Neural Attention Learning for Legal Query Reformulation
ICAIL '19: Proceedings of the Seventeenth International Conference on Artificial Intelligence and LawQuery reformulation is the process of iteratively modifying a query to improve the quality of search engine results. In recent years, the task of reformulating natural language (NL) queries has received considerable diligence from both industry and ...
Location-aware query reformulation for search engines
Query reformulation, including query recommendation and query auto-completion, is a popular add-on feature of search engines, which provide related and helpful reformulations of a keyword query. Due to the dropping prices of smartphones and the ...
Evaluating verbose query processing techniques
SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrievalVerbose or long queries are a small but significant part of the query stream in web search, and are common in other applications such as collaborative question answering (CQA). Current search engines perform well with keyword queries but are not, in ...
Comments