ABSTRACT
We analyze the application of ensemble learning to recommender systems on the Netflix Prize dataset. For our analysis we use a set of diverse state-of-the-art collaborative filtering (CF) algorithms, which include: SVD, Neighborhood Based Approaches, Restricted Boltzmann Machine, Asymmetric Factor Model and Global Effects. We show that linearly combining (blending) a set of CF algorithms increases the accuracy and outperforms any single CF algorithm. Furthermore, we show how to use ensemble methods for blending predictors in order to outperform a single blending algorithm. The dataset and the source code for the ensemble blending are available online.
Supplemental Material
- R. M. Bell and Y. Koren. Scalable collaborative filtering with jointly derived neighborhood interpolation weights. In ICDM '07: Proceedings of the 2007 Seventh IEEE International Conference on Data Mining, pages 43--52, Washington, DC, USA, 2007. IEEE Computer Society. Google ScholarDigital Library
- J. Bennet and S. Lanning. The netflix prize. KDD Cup workshop, 2007. "http://www.netflixprize.com".Google Scholar
- L. Breiman. Bagging predictors. In Machine Learning, pages 123--140, 1996. Google ScholarDigital Library
- L. Breiman. Random forests. Machine Learning, 45:5--32, 2001. Google ScholarDigital Library
- R. Caruana, A. Niculescu-Mizil, G. Crew, and A. Ksikes. Ensemble selection from libraries of models. In In Proceedings of the 21st International Conference on Machine Learning, pages 137--144. ACM Press, 2004. Google ScholarDigital Library
- D. A. Davis, N. V. Chawla, N. A. Christakis, and A.-L. Barabási. Time to CARE: a collaborative engine for practical disease prediction. Springer, November 2009.Google Scholar
- J. Friedman. Stochastic gradient boosting. Computational Statistics and Data Analysis, 2002. Google ScholarDigital Library
- P. Geurts, D. Ernst, and L. Wehenkel. Extremely randomized trees. Mach. Learn., 63(1):3--42, 2006. Google ScholarDigital Library
- M. Jahrer. ELF - Ensemble Learning Framework. An open source C++ framework for supervised learning. http://elf-project.sourceforge.net, 2010.Google Scholar
- Y. Koren. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 426--434. ACM, 2008. Google ScholarDigital Library
- Y. Koren. The BellKor solution to the Netflix Grand Prize, 2009.Google Scholar
- Y. Koren. Collaborative filtering with temporal dynamics. In KDD '09: Proceeding of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2009. Google ScholarDigital Library
- Y. Koren. Factor in the neighbors: Scalable and accurate collaborative filtering. In KDD: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, 2009.Google Scholar
- A. Paterek. Improving regularized singular value decomposition for collaborative filtering. Proceedings of KDD Cup and Workshop, 2007.Google Scholar
- M. Piotte and M. Chabbert. The Pragmatic theory solution to the Netflix Grand Prize, 2009.Google Scholar
- R. Salakhutdinov, A. Mnih, and G. E. Hinton. Restricted boltzmann machines for collaborative filtering. In ICML, pages 791--798, 2007. Google ScholarDigital Library
- J. Sill, G. Takacs, L. Mackey, and D. Lin. Feature-weighted linear stacking. arXiv:0911.0460v2, 2009.Google Scholar
- G. Takács, I. Pilászy, B. Németh, and D. Tikk. Matrix factorization and neighbor based algorithms for the netflix prize problem. In RecSys '08: Proceedings of the 2008 ACM conference on Recommender systems, pages 267--274. ACM, 2008. Google ScholarDigital Library
- A. Töscher and M. Jahrer. The BigChaos solution to the Netflix Prize 2008. Technical report, commendo research & consulting, October 2008.Google Scholar
- A. Töscher, M. Jahrer, and R. M. Bell. The BigChaos solution to the Netflix Grand Prize, 2009.Google Scholar
- A. Töscher, M. Jahrer, and R. Legenstein. Improved neighborhood-based algorithms for large-scale recommender systems. In KDD Workshop at SIGKDD 08, August 2008. Google ScholarDigital Library
Index Terms
- Combining predictions for accurate recommender systems
Recommendations
A Scalable, Accurate Hybrid Recommender System
WKDD '10: Proceedings of the 2010 Third International Conference on Knowledge Discovery and Data MiningRecommender systems apply machine learning techniques for filtering unseen information and can predict whether a user would like a given resource. There are three main types of recommender systems: collaborative filtering, content-based filtering, and ...
Combining Trust Propagation and Topic-Level User Interest Expansion in Recommender Systems
With the development of E-commerce and Internet, items are becoming more and more, which brings a so called information overload problem that it is hard for users to find the items they would be interested in. Recommender systems emerge to response to ...
Acquiring User Information Needs for Recommender Systems
WI-IAT '13: Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 03Most recommender systems attempt to use collaborative filtering, content-based filtering or hybrid approach to recommend items to new users. Collaborative filtering recommends items to new users based on their similar neighbours, and content-based ...
Comments