skip to main content
10.1145/2959100.2959180acmconferencesArticle/Chapter ViewAbstractPublication PagesrecsysConference Proceedingsconference-collections
research-article
Public Access

Ask the GRU: Multi-task Learning for Deep Text Recommendations

Published: 07 September 2016 Publication History

Abstract

In a variety of application domains the content to be recommended to users is associated with text. This includes research papers, movies with associated plot summaries, news articles, blog posts, etc. Recommendation approaches based on latent factor models can be extended naturally to leverage text by employing an explicit mapping from text to factors. This enables recommendations for new, unseen content, and may generalize better, since the factors for all items are produced by a compactly-parametrized model. Previous work has used topic models or averages of word embeddings for this mapping. In this paper we present a method leveraging deep recurrent neural networks to encode the text sequence into a latent vector, specifically gated recurrent units (GRUs) trained end-to-end on the collaborative filtering task. For the task of scientific paper recommendation, this yields models with significantly higher accuracy. In cold-start scenarios, we beat the previous state-of-the-art, all of which ignore word order. Performance is further improved by multi-task learning, where the text encoder network is trained for a combination of content recommendation and item metadata prediction. This regularizes the collaborative filtering model, ameliorating the problem of sparsity of the observed rating matrix.

Supplementary Material

MP4 File (p107.mp4)

References

[1]
Ido Guy, Naama Zwerdling, Inbal Ronen, David Carmel, and Erel Uziel. Social media recommendation based on people and tags. In SIGIR, 2010.
[2]
Owen Phelan, Kevin McCarthy, and Barry Smyth. Using twitter to recommend real-time topical news. In RecSys, 2009.
[3]
Trapit Bansal, Mrinal Das, and Chiranjib Bhattacharyya. Content driven user profiling for comment-worthy recommendations of news and blog articles. In RecSys, 2015.
[4]
Julian McAuley and Jure Leskovec. Hidden factors and hidden topics: understanding rating dimensions with review text. In RecSys, 2013.
[5]
Chong Wang and David M Blei. Collaborative topic modeling for recommending scientific articles. In SIGKDD, 2011.
[6]
Yehuda Koren, Robert Bell, and Chris Volinsky. Matrix factorization techniques for recommender systems. Computer, (8): 30--37, 2009.
[7]
Andriy Mnih and Ruslan Salakhutdinov. Probabilistic matrix factorization. In NIPS, 2007.
[8]
nd Shoham(1997)}balabanovic1997fabMarko Balabanović and Yoav Shoham. Fab: content-based, collaborative recommendation. Communications of the ACM, 40 (3): 66--72, 1997.
[9]
Raymond J Mooney and Loriene Roy. Content-based book recommending using learning for text categorization. In ACM conference on Digital libraries, 2000.
[10]
Chumki Basu, Haym Hirsh, William Cohen, et al. Recommendation as classification: Using social and content-based information in recommendation. In AAAI, 1998.
[11]
Andrew I Schein, Alexandrin Popescul, Lyle H Ungar, and David M Pennock. Methods and metrics for cold-start recommendations. In SIGIR, 2002.
[12]
Justin Basilico and Thomas Hofmann. Unifying collaborative and content-based filtering. In ICML, 2004.
[13]
Hao Wang, Naiyan Wang, and Dit-Yan Yeung. Collaborative deep learning for recommender systems. In SIGKDD, 2015.
[14]
Prem Melville, Raymond J Mooney, and Ramadass Nagarajan. Content-boosted collaborative filtering for improved recommendations. In AAAI, 2002.
[15]
Prem K Gopalan, Laurent Charlin, and David Blei. Content-based recommendations with poisson factorization. In NIPS, 2014.
[16]
Deepak Agarwal and Bee-Chung Chen. Regression-based latent factor models. In SIGKDD, 2009.
[17]
Hanna M Wallach. Topic modeling: beyond bag-of-words. In ICML, 2006.
[18]
Paul J Werbos. Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78 (10): 1550--1560, 1990.
[19]
Burget, Cernockỳ, and Khudanpur}mikolov2010recurrentTomas Mikolov, Martin Karafiát, Lukas Burget, Jan Cernockỳ, and Sanjeev Khudanpur. Recurrent neural network based language model. INTERSPEECH, 2010.
[20]
Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using rnn encoder-decoder for statistical machine translation. In EMNLP, 2014.
[21]
Robert M Bell and Yehuda Koren. Lessons from the netflix prize challenge. SIGKDD Explorations Newsletter, 9 (2): 75--79, 2007.
[22]
Guang Ling, Michael R Lyu, and Irwin King. Ratings meet reviews, a combined approach to recommend. In RecSys, 2014.
[23]
Amjad Almahairi, Kyle Kastner, Kyunghyun Cho, and Aaron Courville. Learning distributed representations from reviews for collaborative filtering. In RecSys, 2015.
[24]
Jason Weston, Samy Bengio, and Nicolas Usunier. Wsabie: Scaling up to large vocabulary image annotation. In IJCAI, 2011.
[25]
Yifan Hu, Yehuda Koren, and Chris Volinsky. Collaborative filtering for implicit feedback datasets. In ICDM, 2008.
[26]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. Bpr: Bayesian personalized ranking from implicit feedback. In UAI, 2009.
[27]
Yue Shi, Martha Larson, and Alan Hanjalic. Collaborative filtering beyond the user-item matrix: A survey of the state of the art and future challenges. ACM Computing Surveys, 47 (1): 3, 2014.
[28]
Steffen Rendle. Factorization machines. In ICDM, 2010.
[29]
Zeno Gantner, Lucas Drumond, Christoph Freudenthaler, Steffen Rendle, and Lars Schmidt-Thieme. Learning attribute-to-feature mappings for cold-start recommendations. In ICDM, 2010.
[30]
Rich Caruana. Multitask learning. Machine learning, 28 (1): 41--75, 1997.
[31]
Ajit P Singh and Geoffrey J Gordon. Relational learning via collective matrix factorization. In SIGKDD, 2008.
[32]
Hao Ma, Haixuan Yang, Michael R Lyu, and Irwin King. Sorec: social recommendation using probabilistic matrix factorization. In CIKM, 2008.
[33]
Ralf Krestel, Peter Fankhauser, and Wolfgang Nejdl. Latent dirichlet allocation for tag recommendation. In RecSys, 2009.
[34]
Yoshua Bengio Ian Goodfellow and Aaron Courville. Deep learning. Book in prep. for MIT Press, 2016.
[35]
Ruslan Salakhutdinov, Andriy Mnih, and Geoffrey Hinton. Restricted boltzmann machines for collaborative filtering. In ICML, 2007.
[36]
Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Lexing Xie. Autorec: Autoencoders meet collaborative filtering. In WWW, 2015.
[37]
Yao Wu, Christopher DuBois, Alice X. Zheng, and Martin Ester. Collaborative denoising auto-encoders for top-n recommender systems. In WSDM, 2016.
[38]
Ali Mamdouh Elkahky, Yang Song, and Xiaodong He. A multi-view deep learning approach for cross domain user modeling in recommendation systems. In WWW, 2015.
[39]
Gintare Karolina Dziugaite and Daniel M Roy. Neural network matrix factorization. arXiv preprint arXiv:1511.06443, 2015.
[40]
Aaron Van den Oord, Sander Dieleman, and Benjamin Schrauwen. Deep content-based music recommendation. In NIPS, 2013.
[41]
Xinxi Wang and Ye Wang. Improving content-based and hybrid music recommendation using deep learning. In International Conference on Multimedia, 2014.
[42]
Jason Weston, Sumit Chopra, and Keith Adams.# tagspace: Semantic embeddings from hashtags. 2014.
[43]
R. He and J. McAuley. VBPR: visual bayesian personalized ranking from implicit feedback. In AAAI, 2016.
[44]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. Distributed representations of words and phrases and their compositionality. In NIPS, pages 3111--3119, 2013.
[45]
Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. Natural language processing (almost) from scratch. JMLR, 12: 2493--2537, 2011.
[46]
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. Sequence to sequence learning with neural networks. In NIPS, pages 3104--3112, 2014.
[47]
Andrew M Dai and Quoc V Le. Semi-supervised sequence learning. In NIPS, pages 3061--3069, 2015.
[48]
Yoshua Bengio, Patrice Simard, and Paolo Frasconi. Learning long-term dependencies with gradient descent is difficult. Neural Networks, 5 (2): 157--166, 1994.
[49]
Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory. Neural computation, 9 (8): 1735--1780, 1997.
[50]
Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555, 2014.
[51]
Rafal Jozefowicz, Wojciech Zaremba, and Ilya Sutskever. An empirical exploration of recurrent network architectures. In ICML, 2015.
[52]
Mike Schuster and Kuldip K Paliwal. Bidirectional recurrent neural networks. Signal Processing, 45 (11): 2673--2681, 1997.
[53]
Arthur P Dempster, Nan M Laird, and Donald B Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the royal statistical society., pages 1--38, 1977.
[54]
Diederik Kingma and Jimmy Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
[55]
Misha Denil, Alban Demiraj, and Nando de Freitas. Extraction of salient sentences from labelled documents. arXiv preprint arXiv:1412.6815, 2014.
[56]
Jiwei Li, Xinlei Chen, Eduard Hovy, and Dan Jurafsky. Visualizing and understanding neural models in nlp. 2016.

Cited By

View all
  • (2024)Fair resource allocation in multi-task learningProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692179(2715-2731)Online publication date: 21-Jul-2024
  • (2024)Secured Smart Meal Delivery System for Women's SafetyAI Tools and Applications for Women’s Safety10.4018/979-8-3693-1435-7.ch017(275-290)Online publication date: 19-Jan-2024
  • (2024)ICA-CRMAS: Intelligent Context-Awareness Approach for Citation Recommendation based on Multi-Agent SystemACM Transactions on Management Information Systems10.1145/368028715:3(1-52)Online publication date: 1-Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
RecSys '16: Proceedings of the 10th ACM Conference on Recommender Systems
September 2016
490 pages
ISBN:9781450340359
DOI:10.1145/2959100
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 September 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cold start
  2. deep learning
  3. multi-task learning
  4. neural networks
  5. recommender systems

Qualifiers

  • Research-article

Funding Sources

Conference

RecSys '16
Sponsor:
RecSys '16: Tenth ACM Conference on Recommender Systems
September 15 - 19, 2016
Massachusetts, Boston, USA

Acceptance Rates

RecSys '16 Paper Acceptance Rate 29 of 159 submissions, 18%;
Overall Acceptance Rate 254 of 1,295 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)643
  • Downloads (Last 6 weeks)72
Reflects downloads up to 08 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Fair resource allocation in multi-task learningProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692179(2715-2731)Online publication date: 21-Jul-2024
  • (2024)Secured Smart Meal Delivery System for Women's SafetyAI Tools and Applications for Women’s Safety10.4018/979-8-3693-1435-7.ch017(275-290)Online publication date: 19-Jan-2024
  • (2024)ICA-CRMAS: Intelligent Context-Awareness Approach for Citation Recommendation based on Multi-Agent SystemACM Transactions on Management Information Systems10.1145/368028715:3(1-52)Online publication date: 1-Aug-2024
  • (2024)Breaking News Identification Using a Learning-Based Approach2024 Parul International Conference on Engineering and Technology (PICET)10.1109/PICET60765.2024.10716115(1-6)Online publication date: 3-May-2024
  • (2024)Application and Research Based on Exponential Weighted Moving Averages and GRU Modeling2024 IEEE 2nd International Conference on Sensors, Electronics and Computer Engineering (ICSECE)10.1109/ICSECE61636.2024.10729530(795-800)Online publication date: 29-Aug-2024
  • (2024)GCTransNet: Combining Graph Convolutional Networks and Transformers for High-Performance and Rapidly Converging Link Prediction2024 IEEE 4th International Conference on Electronic Technology, Communication and Information (ICETCI)10.1109/ICETCI61221.2024.10594469(377-383)Online publication date: 24-May-2024
  • (2024)Hybrid Filtering-Based Product Recommendation System Integrating GRU and BFGS Optimization2024 International Conference on Electronic Systems and Intelligent Computing (ICESIC)10.1109/ICESIC61777.2024.10846187(302-307)Online publication date: 22-Nov-2024
  • (2024)An Intelligent Risk Assessment Model Based on NLP2024 4th International Conference on Consumer Electronics and Computer Engineering (ICCECE)10.1109/ICCECE61317.2024.10504168(375-382)Online publication date: 12-Jan-2024
  • (2024)Research on Short Term Load Forecasting of User Side of Combined Cooling Heating and Power System2024 IEEE 13th Data Driven Control and Learning Systems Conference (DDCLS)10.1109/DDCLS61622.2024.10606699(941-946)Online publication date: 17-May-2024
  • (2024)Fault Diagnosis of Rolling Bearings Based on Cross Attention Network with Multi-Scale Feature Fusion2024 Cross Strait Radio Science and Wireless Technology Conference (CSRSWTC)10.1109/CSRSWTC64338.2024.10811569(1-3)Online publication date: 4-Nov-2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media