skip to main content
research-article

Improving Top-N Recommendation for Cold-Start Users via Cross-Domain Information

Published:01 June 2015Publication History
Skip Abstract Section

Abstract

Making accurate recommendations for cold-start users is a challenging yet important problem in recommendation systems. Including more information from other domains is a natural solution to improve the recommendations. However, most previous work in cross-domain recommendations has focused on improving prediction accuracy with several severe limitations. In this article, we extend our previous work on clustering-based matrix factorization in single domains into cross domains. In addition, we utilize recent results on unobserved ratings. Our new method can more effectively utilize data from auxiliary domains to achieve better recommendations, especially for cold-start users. For example, our method improves the recall to 21% on average for cold-start users, whereas previous methods result in only 15% recall in the cross-domain Amazon dataset. We also observe almost the same improvements in the Epinions dataset. Considering that it is often difficult to make even a small improvement in recommendations, for cold- start users in particular, our result is quite significant.

References

  1. Gediminas Adomavicius and Alexander Tuzhilin. 2005. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17, 6, 734--749. DOI:http://dx.doi.org/10.1109/TKDE.2005.99 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Wei Chen, Wynne Hsu, and Mong Li Lee. 2013. Making recommendations from multiple domains. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’13). ACM, New York, NY, 892--900. DOI:http://dx.doi.org/10.1145/2487575.2487638 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Paolo Cremonesi, Yehuda Koren, and Roberto Turrin. 2010. Performance of recommender algorithms on top-N recommendation tasks. In Proceedings of the 4th ACM Conference on Recommender Systems (RecSys’10). ACM, New York, NY, 39--46. DOI:http://dx.doi.org/10.1145/1864708.1864721 Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Paolo Cremonesi, Antonio Tripodi, and Roberto Turrin. 2011. Cross-domain recommender systems. In Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops (ICDMW’11). 496--503. DOI:http://dx.doi.org/10.1109/ICDMW.2011.57 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Sheng Gao, Hao Luo, Da Chen, Shantao Li, Patrick Gallinari, and Jun Guo. 2013. Cross-domain recommendation via cluster-level latent factor model. In Machine Learning and Knowledge Discovery in Databases. Lecture Notes in Computer Science, Vol. 8189. Springer, 161--176. DOI:http://dx.doi.org/10.1007/978-3-642-40991-2_11.Google ScholarGoogle Scholar
  6. Liang Hu, Jian Cao, Guandong Xu, Longbing Cao, Zhiping Gu, and Can Zhu. 2013. Personalized recommendation via cross-domain triadic factorization. In Proceedings of the 22nd International Conference on World Wide Web (WWW’13). 595--606. http://dl.acm.org/citation.cfm?id=2488388.2488441. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Mohsen Jamali and Martin Ester. 2010. A matrix factorization technique with trust propagation for recommendation in social networks. In Proceedings of the 4th ACM Conference on Recommender Systems (RecSys’10). ACM, New York, NY, 135--142. DOI:http://dx.doi.org/10.1145/1864708.1864736 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Meng Jiang, Peng Cui, Fei Wang, Qiang Yang, Wenwu Zhu, and Shiqiang Yang. 2012. Social recommendation across multiple relational domains. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). ACM, New York, NY, 1422--1431. DOI:http://dx.doi.org/10.1145/2396761.2398448 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Yehuda Koren. 2008. Factorization meets the neighborhood: A multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’08). ACM, New York, NY, 426--434. DOI:http://dx.doi.org/10.1145/1401890.1401944 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8, 30--37. DOI:http://dx.doi.org/10.1109/MC.2009.263 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Jure Leskovec, Lada A. Adamic, and Bernardo A. Huberman. 2007. The dynamics of viral marketing. ACM Transactions on the Web 1, 1, Article No. 5. DOI:http://dx.doi.org/10.1145/1232722.1232727 Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Bin Li. 2011. Cross-domain collaborative filtering: A brief survey. In Proceedings of the 2011 23rd IEEE International Conference on Tools with Artificial Intelligence (ICTAI’11). 1085--1086. DOI:http://dx.doi.org/10.1109/ICTAI.2011.184 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Bin Li, Qiang Yang, and Xiangyang Xue. 2009. Can movies and books collaborate? Cross-domain collaborative filtering for sparsity reduction. In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI’09). 2052--2057. http://dl.acm.org/citation.cfm?id=1661445.1661773 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Zhongqi Lu, ErHeng Zhong, Lili Zhao, Evan Wei Xiang, Weike Pan, and Qiang Yang 0001. 2012. Selective transfer learning for cross domain recommendation. arXiv:1210.7056. Available at http://dblp.uni-trier.de/db/journals/corr/corr1210.html#abs-1210-7056.Google ScholarGoogle Scholar
  15. Paolo Massa and Paolo Avesani. 2006. Trust-aware bootstrapping of recommender systems. In Proceedings of the ECAI Workshop on Recommender Systems (ECAI’06). 29--33.Google ScholarGoogle Scholar
  16. Simon Meyffret, Emmanuel Guillot, Lionel Medini, and Fredrique Laforest. 2012. RED: A Rich Epinions Dataset for Recommender Systems. Technical Report RR-LIRIS-2012-014. LIRIS UMR 5205 CNRS/INSA de Lyon/Universit Claude Bernard Lyon 1/Universit Lumire Lyon 2/cole Centrale de Lyon. http://liris.cnrs.fr/publis/?id=5787.Google ScholarGoogle Scholar
  17. Nima Mirbakhsh and Charles X. Ling. 2013. Clustering-based factorized collaborative filtering. In Proceedings of the 7th ACM Conference on Recommender Systems (RecSys’13). ACM, New York, NY, 315--318. DOI:http://dx.doi.org/10.1145/2507157.2507233 Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Nima Mirbakhsh and Charles X. Ling. 2014. Leveraging clustering to improve collaborative filtering (submitted and under review).Google ScholarGoogle Scholar
  19. Orly Moreno, Bracha Shapira, Lior Rokach, and Guy Shani. 2012. TALMUD: Transfer learning for multiple domains. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). ACM, New York, NY, 425--434. DOI:http://dx.doi.org/10.1145/2396761.2396817 Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Xia Ning and George Karypis. 2012. Sparse linear methods with side information for top-N recommendations. In Proceedings of the 6th ACM Conference on Recommender Systems (RecSys’12). ACM, New York, NY, 155--162. DOI:http://dx.doi.org/10.1145/2365952.2365983 Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Sinno Jialin Pan and Qiang Yang. 2010. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22, 10, 1345--1359. DOI:http://dx.doi.org/10.1109/TKDE.2009.191 Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Al Mamunur Rashid, George Karypis, and John Riedl. 2008. Learning preferences of new users in recommender systems: An information theoretic approach. SIGKDD Exploration Newsletter 10, 2, 90--100. DOI:http://dx.doi.org/10.1145/1540276.1540302 Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence (UAI’09). 452--461. http://dl.acm.org/citation.cfm?id=1795114.1795167 Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Andrew I. Schein, Alexandrin Popescul, Lyle H. Ungar, and David M. Pennock. 2002. Methods and metrics for cold-start recommendations. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’02). ACM, New York, NY, 253--260. DOI:http://dx.doi.org/10.1145/564376.564421 Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Yue Shi, Martha Larson, and Alan Hanjalic. 2014. Collaborative filtering beyond the user-item matrix: A survey of the state of the art and future challenges. ACM Computing Surveys 47, 1, Article No. 3. DOI:http://dx.doi.org/10.1145/2556270 Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Harald Steck. 2010. Training and testing of recommender systems on data missing not at random. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’10). ACM, New York, NY, 713--722. DOI:http://dx.doi.org/10.1145/1835804.1835895 Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Harald Steck. 2013. Evaluation of recommendations: Rating-prediction and ranking. In Proceedings of the 7th ACM Conference on Recommender Systems (RecSys’13). ACM, New York, NY, 213--220. DOI:http://dx.doi.org/10.1145/2507157.2507160 Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Jie Tang, Sen Wu, Jimeng Sun, and Hang Su. 2012. Cross-domain collaboration recommendation. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’12). ACM, New York, NY, 1285--1293. DOI:http://dx.doi.org/10.1145/2339530.2339730 Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Yin Zhu, Yuqiang Chen, Zhongqi Lu, Sinno Jialin Pan, Gui Rong Xue, Yong Yu, and Qiang Yang. 2010. Heterogeneous transfer learning for image classification. In Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI’10).Google ScholarGoogle Scholar

Index Terms

  1. Improving Top-N Recommendation for Cold-Start Users via Cross-Domain Information

      Recommendations

      Reviews

      A. Squassabia

      Collaborative recommender systems often provide disappointing suggestions to new users who volunteered very few or no ratings of their own for processing: this is known as the cold-start problem. Mitigating such shortcomings with cross-domain information helps; this paper builds on previous work by the authors to introduce incremental improvements to the cold-start problem. The authors previously published an approach to single-domain recommendations based on clustering of latent factors using matrix factorization and k -means. Here, they translate the same principle into seeding cold-start recommendations with cross-domain information, exploiting multiple domains where each domain is endowed with shared users, hence with some measure of user overlap. Validation was carried out using two datasets, one from Amazon comprising ratings for media (video, music, DVD) and goods (electronics, kitchen, toys) and another from Epinions comprising ten disparate categories of items. Validation compared results for single-domain, traditional cross-domain, and their new clustered cross-domain top- N ratings using recall for N of 5, 10, 15 or 20 as a metric. Cross-domain cold-start performed better than single-domain; clustered cross-domain performed as well as traditional cross-domain for low N , and better than traditional for larger N . The main contribution of this paper is the novelty of a relatively simple implementation for the underlying idea, which is not entirely original but new in this form. Its main limitation is in the difficulty of assessing impact on the basis of a single machine-driven metric on only two datasets; albeit traditionally acceptable, validation would be more informative if carried out with more data, multiple metrics, and ideally before a live audience. Online Computing Reviews Service

      Access critical reviews of Computing literature here

      Become a reviewer for Computing Reviews.

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Knowledge Discovery from Data
        ACM Transactions on Knowledge Discovery from Data  Volume 9, Issue 4
        June 2015
        261 pages
        ISSN:1556-4681
        EISSN:1556-472X
        DOI:10.1145/2786971
        Issue’s Table of Contents

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 June 2015
        • Accepted: 1 January 2015
        • Revised: 1 December 2014
        • Received: 1 June 2014
        Published in tkdd Volume 9, Issue 4

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader