research-article

Public Access

Ask the GRU: Multi-task Learning for Deep Text Recommendations

Authors:

David Belanger,

Andrew McCallumAuthors Info & Claims

RecSys '16: Proceedings of the 10th ACM Conference on Recommender Systems

Pages 107 - 114

https://doi.org/10.1145/2959100.2959180

Published: 07 September 2016 Publication History

Abstract

In a variety of application domains the content to be recommended to users is associated with text. This includes research papers, movies with associated plot summaries, news articles, blog posts, etc. Recommendation approaches based on latent factor models can be extended naturally to leverage text by employing an explicit mapping from text to factors. This enables recommendations for new, unseen content, and may generalize better, since the factors for all items are produced by a compactly-parametrized model. Previous work has used topic models or averages of word embeddings for this mapping. In this paper we present a method leveraging deep recurrent neural networks to encode the text sequence into a latent vector, specifically gated recurrent units (GRUs) trained end-to-end on the collaborative filtering task. For the task of scientific paper recommendation, this yields models with significantly higher accuracy. In cold-start scenarios, we beat the previous state-of-the-art, all of which ignore word order. Performance is further improved by multi-task learning, where the text encoder network is trained for a combination of content recommendation and item metadata prediction. This regularizes the collaborative filtering model, ameliorating the problem of sparsity of the observed rating matrix.

Supplementary Material

MP4 File (p107.mp4)

Download
1141.02 MB

References

[1]

Ido Guy, Naama Zwerdling, Inbal Ronen, David Carmel, and Erel Uziel. Social media recommendation based on people and tags. In SIGIR, 2010.

Digital Library

[2]

Owen Phelan, Kevin McCarthy, and Barry Smyth. Using twitter to recommend real-time topical news. In RecSys, 2009.

Digital Library

[3]

Trapit Bansal, Mrinal Das, and Chiranjib Bhattacharyya. Content driven user profiling for comment-worthy recommendations of news and blog articles. In RecSys, 2015.

Digital Library

[4]

Julian McAuley and Jure Leskovec. Hidden factors and hidden topics: understanding rating dimensions with review text. In RecSys, 2013.

Digital Library

[5]

Chong Wang and David M Blei. Collaborative topic modeling for recommending scientific articles. In SIGKDD, 2011.

Digital Library

[6]

Yehuda Koren, Robert Bell, and Chris Volinsky. Matrix factorization techniques for recommender systems. Computer, (8): 30--37, 2009.

Digital Library

[7]

Andriy Mnih and Ruslan Salakhutdinov. Probabilistic matrix factorization. In NIPS, 2007.

Digital Library

[8]

nd Shoham(1997)}balabanovic1997fabMarko Balabanović and Yoav Shoham. Fab: content-based, collaborative recommendation. Communications of the ACM, 40 (3): 66--72, 1997.

Digital Library

[9]

Raymond J Mooney and Loriene Roy. Content-based book recommending using learning for text categorization. In ACM conference on Digital libraries, 2000.

Digital Library

[10]

Chumki Basu, Haym Hirsh, William Cohen, et al. Recommendation as classification: Using social and content-based information in recommendation. In AAAI, 1998.

Digital Library

[11]

Andrew I Schein, Alexandrin Popescul, Lyle H Ungar, and David M Pennock. Methods and metrics for cold-start recommendations. In SIGIR, 2002.

Digital Library

[12]

Justin Basilico and Thomas Hofmann. Unifying collaborative and content-based filtering. In ICML, 2004.

Digital Library

[13]

Hao Wang, Naiyan Wang, and Dit-Yan Yeung. Collaborative deep learning for recommender systems. In SIGKDD, 2015.

Digital Library

[14]

Prem Melville, Raymond J Mooney, and Ramadass Nagarajan. Content-boosted collaborative filtering for improved recommendations. In AAAI, 2002.

Digital Library

[15]

Prem K Gopalan, Laurent Charlin, and David Blei. Content-based recommendations with poisson factorization. In NIPS, 2014.

Digital Library

[16]

Deepak Agarwal and Bee-Chung Chen. Regression-based latent factor models. In SIGKDD, 2009.

Digital Library

[17]

Hanna M Wallach. Topic modeling: beyond bag-of-words. In ICML, 2006.

Digital Library

[18]

Paul J Werbos. Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78 (10): 1550--1560, 1990.

[19]

Burget, Cernockỳ, and Khudanpur}mikolov2010recurrentTomas Mikolov, Martin Karafiát, Lukas Burget, Jan Cernockỳ, and Sanjeev Khudanpur. Recurrent neural network based language model. INTERSPEECH, 2010.

[20]

Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using rnn encoder-decoder for statistical machine translation. In EMNLP, 2014.

[21]

Robert M Bell and Yehuda Koren. Lessons from the netflix prize challenge. SIGKDD Explorations Newsletter, 9 (2): 75--79, 2007.

Digital Library

[22]

Guang Ling, Michael R Lyu, and Irwin King. Ratings meet reviews, a combined approach to recommend. In RecSys, 2014.

Digital Library

[23]

Amjad Almahairi, Kyle Kastner, Kyunghyun Cho, and Aaron Courville. Learning distributed representations from reviews for collaborative filtering. In RecSys, 2015.

Digital Library

[24]

Jason Weston, Samy Bengio, and Nicolas Usunier. Wsabie: Scaling up to large vocabulary image annotation. In IJCAI, 2011.

Digital Library

[25]

Yifan Hu, Yehuda Koren, and Chris Volinsky. Collaborative filtering for implicit feedback datasets. In ICDM, 2008.

Digital Library

[26]

Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. Bpr: Bayesian personalized ranking from implicit feedback. In UAI, 2009.

Digital Library

[27]

Yue Shi, Martha Larson, and Alan Hanjalic. Collaborative filtering beyond the user-item matrix: A survey of the state of the art and future challenges. ACM Computing Surveys, 47 (1): 3, 2014.

Digital Library

[28]

Steffen Rendle. Factorization machines. In ICDM, 2010.

Digital Library

[29]

Zeno Gantner, Lucas Drumond, Christoph Freudenthaler, Steffen Rendle, and Lars Schmidt-Thieme. Learning attribute-to-feature mappings for cold-start recommendations. In ICDM, 2010.

Digital Library

[30]

Rich Caruana. Multitask learning. Machine learning, 28 (1): 41--75, 1997.

Digital Library

[31]

Ajit P Singh and Geoffrey J Gordon. Relational learning via collective matrix factorization. In SIGKDD, 2008.

Digital Library

[32]

Hao Ma, Haixuan Yang, Michael R Lyu, and Irwin King. Sorec: social recommendation using probabilistic matrix factorization. In CIKM, 2008.

Digital Library

[33]

Ralf Krestel, Peter Fankhauser, and Wolfgang Nejdl. Latent dirichlet allocation for tag recommendation. In RecSys, 2009.

Digital Library

[34]

Yoshua Bengio Ian Goodfellow and Aaron Courville. Deep learning. Book in prep. for MIT Press, 2016.

[35]

Ruslan Salakhutdinov, Andriy Mnih, and Geoffrey Hinton. Restricted boltzmann machines for collaborative filtering. In ICML, 2007.

Digital Library

[36]

Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Lexing Xie. Autorec: Autoencoders meet collaborative filtering. In WWW, 2015.

Digital Library

[37]

Yao Wu, Christopher DuBois, Alice X. Zheng, and Martin Ester. Collaborative denoising auto-encoders for top-n recommender systems. In WSDM, 2016.

Digital Library

[38]

Ali Mamdouh Elkahky, Yang Song, and Xiaodong He. A multi-view deep learning approach for cross domain user modeling in recommendation systems. In WWW, 2015.

Digital Library

[39]

Gintare Karolina Dziugaite and Daniel M Roy. Neural network matrix factorization. arXiv preprint arXiv:1511.06443, 2015.

[40]

Aaron Van den Oord, Sander Dieleman, and Benjamin Schrauwen. Deep content-based music recommendation. In NIPS, 2013.

Digital Library

[41]

Xinxi Wang and Ye Wang. Improving content-based and hybrid music recommendation using deep learning. In International Conference on Multimedia, 2014.

Digital Library

[42]

Jason Weston, Sumit Chopra, and Keith Adams.# tagspace: Semantic embeddings from hashtags. 2014.

[43]

R. He and J. McAuley. VBPR: visual bayesian personalized ranking from implicit feedback. In AAAI, 2016.

Digital Library

[44]

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. Distributed representations of words and phrases and their compositionality. In NIPS, pages 3111--3119, 2013.

Digital Library

[45]

Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. Natural language processing (almost) from scratch. JMLR, 12: 2493--2537, 2011.

Digital Library

[46]

Ilya Sutskever, Oriol Vinyals, and Quoc V Le. Sequence to sequence learning with neural networks. In NIPS, pages 3104--3112, 2014.

Digital Library

[47]

Andrew M Dai and Quoc V Le. Semi-supervised sequence learning. In NIPS, pages 3061--3069, 2015.

Digital Library

[48]

Yoshua Bengio, Patrice Simard, and Paolo Frasconi. Learning long-term dependencies with gradient descent is difficult. Neural Networks, 5 (2): 157--166, 1994.

Digital Library

[49]

Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory. Neural computation, 9 (8): 1735--1780, 1997.

Digital Library

[50]

Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555, 2014.

[51]

Rafal Jozefowicz, Wojciech Zaremba, and Ilya Sutskever. An empirical exploration of recurrent network architectures. In ICML, 2015.

Digital Library

[52]

Mike Schuster and Kuldip K Paliwal. Bidirectional recurrent neural networks. Signal Processing, 45 (11): 2673--2681, 1997.

Digital Library

[53]

Arthur P Dempster, Nan M Laird, and Donald B Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the royal statistical society., pages 1--38, 1977.

[54]

Diederik Kingma and Jimmy Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.

[55]

Misha Denil, Alban Demiraj, and Nando de Freitas. Extraction of salient sentences from labelled documents. arXiv preprint arXiv:1412.6815, 2014.

[56]

Jiwei Li, Xinlei Chen, Eduard Hovy, and Dan Jurafsky. Visualizing and understanding neural models in nlp. 2016.

Cited By

Ban HJi KSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Fair resource allocation in multi-task learningProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692179(2715-2731)Online publication date: 21-Jul-2024
https://dl.acm.org/doi/10.5555/3692070.3692179
Kumar CSakthivelu UNaresh RSenthil Kumar S(2024)Secured Smart Meal Delivery System for Women's SafetyAI Tools and Applications for Women’s Safety10.4018/979-8-3693-1435-7.ch017(275-290)Online publication date: 19-Jan-2024
https://doi.org/10.4018/979-8-3693-1435-7.ch017
Degha HLaallam F(2024)ICA-CRMAS: Intelligent Context-Awareness Approach for Citation Recommendation based on Multi-Agent SystemACM Transactions on Management Information Systems10.1145/368028715:3(1-52)Online publication date: 1-Aug-2024
https://dl.acm.org/doi/10.1145/3680287
Show More Cited By

Index Terms

Ask the GRU: Multi-task Learning for Deep Text Recommendations
1. Human-centered computing
  1. Collaborative and social computing
    1. Collaborative and social computing theory, concepts and paradigms
      1. Collaborative filtering
2. Information systems
  1. World Wide Web
    1. Web searching and information discovery
      1. Personalization

Recommendations

User-Specific Feature-Based Similarity Models for Top-n Recommendation of New Items
Survey Paper, Regular Papers and Special Section on Participatory Sensing and Crowd Intelligence

Recommending new items for suitable users is an important yet challenging problem due to the lack of preference history for the new items. Noncollaborative user modeling techniques that rely on the item features can be used to recommend new items. ...
Naïve filterbots for robust cold-start recommendations
KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining

The goal of a recommender system is to suggest items of interest to a user based on historical behavior of a community of users. Given detailed enough history, item-based collaborative filtering (CF) often performs as well or better than almost any ...
Merging trust in collaborative filtering to alleviate data sparsity and cold start

Providing high quality recommendations is important for e-commerce systems to assist users in making effective selection decisions from a plethora of choices. Collaborative filtering is a widely accepted technique to generate recommendations based on ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

RecSys '16: Proceedings of the 10th ACM Conference on Recommender Systems

September 2016

490 pages

ISBN:9781450340359

DOI:10.1145/2959100

General Chairs:
Shilad Sen
Macalester College, USA
,
Werner Geyer
IBM Research Cambridge, USA
,
Program Chairs:
Jill Freyne
CSIRO, Australia
,
Pablo Castells
Universidad Autónoma de Madrid, Spain

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGCHI: ACM Special Interest Group on Computer-Human Interaction

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 September 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

NSF DMREF
The Allen Institute for Artificial Intelligence
National Science Foundation, Mallet
Defense Advanced Research Projects Agency

Conference

RecSys '16

Sponsor:

SIGCHI

RecSys '16: Tenth ACM Conference on Recommender Systems

September 15 - 19, 2016

Massachusetts, Boston, USA

Acceptance Rates

RecSys '16 Paper Acceptance Rate 29 of 159 submissions, 18%;

Overall Acceptance Rate 254 of 1,295 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

239
Total Citations
View Citations
6,328
Total Downloads

Downloads (Last 12 months)643
Downloads (Last 6 weeks)72

Reflects downloads up to 08 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Ban HJi KSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Fair resource allocation in multi-task learningProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692179(2715-2731)Online publication date: 21-Jul-2024
https://dl.acm.org/doi/10.5555/3692070.3692179
Kumar CSakthivelu UNaresh RSenthil Kumar S(2024)Secured Smart Meal Delivery System for Women's SafetyAI Tools and Applications for Women’s Safety10.4018/979-8-3693-1435-7.ch017(275-290)Online publication date: 19-Jan-2024
https://doi.org/10.4018/979-8-3693-1435-7.ch017
Degha HLaallam F(2024)ICA-CRMAS: Intelligent Context-Awareness Approach for Citation Recommendation based on Multi-Agent SystemACM Transactions on Management Information Systems10.1145/368028715:3(1-52)Online publication date: 1-Aug-2024
https://dl.acm.org/doi/10.1145/3680287
Rout DPanda MSahoo PPanda S(2024)Breaking News Identification Using a Learning-Based Approach2024 Parul International Conference on Engineering and Technology (PICET)10.1109/PICET60765.2024.10716115(1-6)Online publication date: 3-May-2024
https://doi.org/10.1109/PICET60765.2024.10716115
Huang XWang ZAi YFeng J(2024)Application and Research Based on Exponential Weighted Moving Averages and GRU Modeling2024 IEEE 2nd International Conference on Sensors, Electronics and Computer Engineering (ICSECE)10.1109/ICSECE61636.2024.10729530(795-800)Online publication date: 29-Aug-2024
https://doi.org/10.1109/ICSECE61636.2024.10729530
Chen M(2024)GCTransNet: Combining Graph Convolutional Networks and Transformers for High-Performance and Rapidly Converging Link Prediction2024 IEEE 4th International Conference on Electronic Technology, Communication and Information (ICETCI)10.1109/ICETCI61221.2024.10594469(377-383)Online publication date: 24-May-2024
https://doi.org/10.1109/ICETCI61221.2024.10594469
Suresh AKumar RNagaraju DSundaram KAnandan B(2024)Hybrid Filtering-Based Product Recommendation System Integrating GRU and BFGS Optimization2024 International Conference on Electronic Systems and Intelligent Computing (ICESIC)10.1109/ICESIC61777.2024.10846187(302-307)Online publication date: 22-Nov-2024
https://doi.org/10.1109/ICESIC61777.2024.10846187
Yang XZhang HHuang X(2024)An Intelligent Risk Assessment Model Based on NLP2024 4th International Conference on Consumer Electronics and Computer Engineering (ICCECE)10.1109/ICCECE61317.2024.10504168(375-382)Online publication date: 12-Jan-2024
https://doi.org/10.1109/ICCECE61317.2024.10504168
Guo YWang JLi XFan SKang JChen H(2024)Research on Short Term Load Forecasting of User Side of Combined Cooling Heating and Power System2024 IEEE 13th Data Driven Control and Learning Systems Conference (DDCLS)10.1109/DDCLS61622.2024.10606699(941-946)Online publication date: 17-May-2024
https://doi.org/10.1109/DDCLS61622.2024.10606699
Tao TJia XSu DZhang XGao JXu RMo LWang X(2024)Fault Diagnosis of Rolling Bearings Based on Cross Attention Network with Multi-Scale Feature Fusion2024 Cross Strait Radio Science and Wireless Technology Conference (CSRSWTC)10.1109/CSRSWTC64338.2024.10811569(1-3)Online publication date: 4-Nov-2024
https://doi.org/10.1109/CSRSWTC64338.2024.10811569
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten