short-paper

Efficient online recommendation via low-rank ensemble sampling

Authors:
Xiuyuan Lu

Stanford University

Stanford University
View Profile

,
Zheng Wen

Adobe Research

Adobe Research
View Profile

,
Branislav Kveton

Google Research

Google Research
View Profile

RecSys '18: Proceedings of the 12th ACM Conference on Recommender SystemsSeptember 2018Pages 460–464https://doi.org/10.1145/3240323.3240408

Published:27 September 2018Publication History

RecSys '18: Proceedings of the 12th ACM Conference on Recommender Systems

Pages 460–464

ABSTRACT

The low-rank structure is one of the most prominent features in modern recommendation problems. In this paper, we consider an online learning problem with a low-rank expected reward matrix where both row features and column features are unknown a priori, and the agent aims to learn to choose the best row-column pair (i.e. the maximum entry) in the matrix. We develop a novel online recommendation algorithm based on ensemble sampling, a recently developed computationally efficient approximation of Thompson sampling. Our computational results show that our algorithm consistently achieves order-of-magnitude improvements over the baselines in both synthetic and real-world experiments.

References

S. Agrawal and N. Goyal. 2013. Thompson Sampling for Contextual Bandits with Linear Payoffs. In Proceedings of The 30th International Conference on Machine Learning. 127--135. Google ScholarDigital Library
Samuel Burer and Renato D.C. Monteiro. 2005. Local Minima and Convergence in Low-Rank Semidefinite Programming. Mathematical Programming 103, 3 (01 Jul 2005), 427--444. Google ScholarDigital Library
Emmanuel J. Candès and Benjamin Recht. 2009. Exact Matrix Completion via Convex Optimization. Foundations of Computational Mathematics 9, 6 (03 Apr 2009), 717.Google Scholar
Olivier Chapelle and Lihong Li. 2011. An Empirical Evaluation of Thompson Sampling. In Advances in Neural Information Processing Systems 24. 2249--2257. Google ScholarDigital Library
M. A. Davenport and J. Romberg. 2016. An Overview of Low-Rank Matrix Recovery From Incomplete Observations. IEEE Journal of Selected Topics in Signal Processing 10, 4 (June 2016), 608--622.Google ScholarCross Ref
D. Gross. 2011. Recovering Low-Rank Matrices From Few Coefficients in Any Basis. IEEE Trans. Inf. Theor. 57, 3 (March 2011), 1548--1566. Google ScholarDigital Library
F. Maxwell Harper and Joseph A. Konstan. 2015. The Movie Lens Datasets: History and Context. ACM Trans. Interact. Intell. Syst. 5, 4, Article 19 (Dec. 2015), 19 pages. Google ScholarDigital Library
Prateek Jain, Praneeth Netrapalli, and Sujay Sanghavi. 2013. Low-rank Matrix Completion Using Alternating Minimization. In Proceedings of the Forty-fifth Annual ACM Symposium on Theory of Computing (STOC '13). ACM, New York, NY, USA, 665--674. Google ScholarDigital Library
Dietmar Jannach, Markus Zanker, Alexander Felfernig, and Gerhard Friedrich. 2010. Recommender Systems: An Introduction (1st ed.). Cambridge University Press, New York, NY, USA. Google ScholarDigital Library
Sumeet Katariya, Branislav Kveton, Csaba Szepesvári, Claire Vernade, and Zheng Wen. 2017. Bernoulli Rank-1 Bandits for Click Feedback. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, Melbourne, Australia, August 19--25, 2017. 2001--2007. Google ScholarDigital Library
Sumeet Katariya, Branislav Kveton, Csaba Szepesvari, Claire Vernade, and Zheng Wen. 2017. Stochastic Rank-1 Bandits. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics. PMLR, Fort Lauderdale, FL, USA.Google Scholar
Jaya Kawale, Hung H Bui, Branislav Kveton, Long Tran-Thanh, and Sanjay Chawla. 2015. Efficient Thompson Sampling for Online Matrix-Factorization Recommendation. In Advances in Neural Information Processing Systems 28, C. Cortes, N. D. Lawrence, D. D. Lee, M. Sugiyama, and R. Garnett (Eds.). Curran Associates, Inc., 1297--1305. Google ScholarDigital Library
R. H. Keshavan, A. Montanari, and S. Oh. 2010. Matrix Completion From a Few Entries. IEEE Transactions on Information Theory 56, 6 (June 2010), 2980--2998. Google ScholarDigital Library
Branislav Kveton, Csaba Szepesvári, Anup Rao, Zheng Wen, Yasin Abbasi-Yadkori, and S. Muthukrishnan. 2017. Stochastic Low-Rank Bandits. CoRR abs/1712.04644 (2017). http://arxiv.org/abs/1712.04644Google Scholar
Shuai Li, Alexandros Karatzoglou, and Claudio Gentile. 2016. Collaborative Filtering Bandits. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '16). ACM, New York, NY, USA, 539--548. Google ScholarDigital Library
Xiuyuan Lu and Benjamin Van Roy. 2017. Ensemble Sampling. In Advances in Neural Information Processing Systems 30, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). Curran Associates, Inc., 3258--3266.Google Scholar
Andriy Mnih and Ruslan R Salakhutdinov. 2008. Probabilistic Matrix Factorization. In Advances in Neural Information Processing Systems 20, J. C. Platt, D. Koller, Y. Singer, and S. T. Roweis (Eds.). Curran Associates, Inc., 1257--1264. Google ScholarDigital Library
Benjamin Recht and Christopher Ré. 2013. Parallel stochastic gradient algorithms for large-scale matrix completion. Mathematical Programming Computation 5, 2 (01 Jun 2013), 201--226.Google Scholar
Daniel Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband, and Zheng Wen. 2018. A Tutorial on Thompson Sampling. CoRR (2018). Google ScholarDigital Library
D. Russo and B. Van Roy. 2014. Learning to optimize via posterior sampling. Mathematics of Operations Research 39, 4 (2014), 1221--1243.Google ScholarDigital Library
Daniel Russo and Benjamin Van Roy. 2016. An Information-theoretic Analysis of Thompson Sampling. J. Mach. Learn. Res. 17, 1 (Jan. 2016), 2442--2471. Google ScholarDigital Library
Ruslan Salakhutdinov and Andriy Mnih. 2008. Bayesian Probabilistic Matrix Factorization Using Markov Chain Monte Carlo. In Proceedings of the 25th International Conference on Machine Learning (ICML '08). ACM, New York, NY, USA, 880--887. Google ScholarDigital Library
Rajat Sen, Karthikeyan Shanmugam, Murat Kocaoglu, Alexandros G. Dimakis, and Sanjay Shakkottai. 2016. Latent Contextual Bandits: A Non-Negative Matrix Factorization Approach. CoRR abs/1606.00119 (2016).Google Scholar
Xiaoxue Zhao, Weinan Zhang, and Jun Wang. 2013. Interactive Collaborative Filtering. In Proceedings of the 22Nd ACM International Conference on Information & Knowledge Management (CIKM '13). 1411--1420. Google ScholarDigital Library

Recommendations

Ensemble Recommendations via Thompson Sampling: an Experimental Study within e-Commerce
IUI '18: Proceedings of the 23rd International Conference on Intelligent User Interfaces

This work presents an extension of Thompson Sampling bandit policy for orchestrating the collection of base recommendation algorithms for e-commerce. We focus on the problem of item-to-item recommendations, for which multiple behavioral and attribute-...
Read More
Compressing Rank-Structured Matrices via Randomized Sampling

Randomized sampling has recently been proven a highly efficient technique for computing approximate factorizations of matrices that have low numerical rank. This paper describes an extension of such techniques to a wider class of matrices that are not ...
Read More
Generating virtual ratings from chinese reviews to augment online recommendations
Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context

Collaborative filtering (CF) recommenders based on User-Item rating matrix as explicitly obtained from end users have recently appeared promising in recommender systems. However, User-Item rating matrix is not always available or very sparse in some web ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
RecSys '18: Proceedings of the 12th ACM Conference on Recommender Systems
September 2018
600 pages
ISBN:9781450359016
DOI:10.1145/3240323
General Chairs:
Sole Pera
Boise State University
,
Michael Ekstrand
Boise State University
,
Program Chairs:
Xavier Amatriain
Curai, USA
,
John O'Donovan
University of California, Santa Barbara
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 September 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
ensemble sampling
low rank
multi-armed bandit
online recommendation
thompson sampling
Qualifiers
- short-paper
Conference

Acceptance Rates
RecSys '18 Paper Acceptance Rate32of181submissions,18%Overall Acceptance Rate254of1,295submissions,20%
More
Upcoming Conference
RecSys '24

Sponsor:

sigchi

18th ACM Conference on Recommender Systems

October 14 - 18, 2024

Bari , Italy
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 491
  Total Downloads
- Downloads (Last 12 months)36
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Efficient online recommendation via low-rank ensemble sampling

RecSys '18: Proceedings of the 12th ACM Conference on Recommender Systems

ABSTRACT

References

Cited By

Recommendations

Ensemble Recommendations via Thompson Sampling: an Experimental Study within e-Commerce

Compressing Rank-Structured Matrices via Randomized Sampling

Generating virtual ratings from chinese reviews to augment online recommendations