research-article

Improving Top-N Recommendation for Cold-Start Users via Cross-Domain Information

Authors:
Nima Mirbakhsh

Western University, London, ON, Canada

Western University, London, ON, Canada
View Profile

,
Charles X. Ling

Western University, London, ON, Canada

Western University, London, ON, Canada
View Profile

Authors Info & Claims

ACM Transactions on Knowledge Discovery from Data Volume 9 Issue 4Article No.: 33pp 1–19https://doi.org/10.1145/2724720

Published:01 June 2015Publication History

ACM Transactions on Knowledge Discovery from Data

Abstract

Making accurate recommendations for cold-start users is a challenging yet important problem in recommendation systems. Including more information from other domains is a natural solution to improve the recommendations. However, most previous work in cross-domain recommendations has focused on improving prediction accuracy with several severe limitations. In this article, we extend our previous work on clustering-based matrix factorization in single domains into cross domains. In addition, we utilize recent results on unobserved ratings. Our new method can more effectively utilize data from auxiliary domains to achieve better recommendations, especially for cold-start users. For example, our method improves the recall to 21% on average for cold-start users, whereas previous methods result in only 15% recall in the cross-domain Amazon dataset. We also observe almost the same improvements in the Epinions dataset. Considering that it is often difficult to make even a small improvement in recommendations, for cold- start users in particular, our result is quite significant.

References

Gediminas Adomavicius and Alexander Tuzhilin. 2005. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17, 6, 734--749. DOI:http://dx.doi.org/10.1109/TKDE.2005.99 Google ScholarDigital Library
Wei Chen, Wynne Hsu, and Mong Li Lee. 2013. Making recommendations from multiple domains. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’13). ACM, New York, NY, 892--900. DOI:http://dx.doi.org/10.1145/2487575.2487638 Google ScholarDigital Library
Paolo Cremonesi, Yehuda Koren, and Roberto Turrin. 2010. Performance of recommender algorithms on top-N recommendation tasks. In Proceedings of the 4th ACM Conference on Recommender Systems (RecSys’10). ACM, New York, NY, 39--46. DOI:http://dx.doi.org/10.1145/1864708.1864721 Google ScholarDigital Library
Paolo Cremonesi, Antonio Tripodi, and Roberto Turrin. 2011. Cross-domain recommender systems. In Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops (ICDMW’11). 496--503. DOI:http://dx.doi.org/10.1109/ICDMW.2011.57 Google ScholarDigital Library
Sheng Gao, Hao Luo, Da Chen, Shantao Li, Patrick Gallinari, and Jun Guo. 2013. Cross-domain recommendation via cluster-level latent factor model. In Machine Learning and Knowledge Discovery in Databases. Lecture Notes in Computer Science, Vol. 8189. Springer, 161--176. DOI:http://dx.doi.org/10.1007/978-3-642-40991-2&lowbar;11.Google Scholar
Liang Hu, Jian Cao, Guandong Xu, Longbing Cao, Zhiping Gu, and Can Zhu. 2013. Personalized recommendation via cross-domain triadic factorization. In Proceedings of the 22nd International Conference on World Wide Web (WWW’13). 595--606. http://dl.acm.org/citation.cfm&quest;id=2488388.2488441. Google ScholarDigital Library
Mohsen Jamali and Martin Ester. 2010. A matrix factorization technique with trust propagation for recommendation in social networks. In Proceedings of the 4th ACM Conference on Recommender Systems (RecSys’10). ACM, New York, NY, 135--142. DOI:http://dx.doi.org/10.1145/1864708.1864736 Google ScholarDigital Library
Meng Jiang, Peng Cui, Fei Wang, Qiang Yang, Wenwu Zhu, and Shiqiang Yang. 2012. Social recommendation across multiple relational domains. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). ACM, New York, NY, 1422--1431. DOI:http://dx.doi.org/10.1145/2396761.2398448 Google ScholarDigital Library
Yehuda Koren. 2008. Factorization meets the neighborhood: A multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’08). ACM, New York, NY, 426--434. DOI:http://dx.doi.org/10.1145/1401890.1401944 Google ScholarDigital Library
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8, 30--37. DOI:http://dx.doi.org/10.1109/MC.2009.263 Google ScholarDigital Library
Jure Leskovec, Lada A. Adamic, and Bernardo A. Huberman. 2007. The dynamics of viral marketing. ACM Transactions on the Web 1, 1, Article No. 5. DOI:http://dx.doi.org/10.1145/1232722.1232727 Google ScholarDigital Library
Bin Li. 2011. Cross-domain collaborative filtering: A brief survey. In Proceedings of the 2011 23rd IEEE International Conference on Tools with Artificial Intelligence (ICTAI’11). 1085--1086. DOI:http://dx.doi.org/10.1109/ICTAI.2011.184 Google ScholarDigital Library
Bin Li, Qiang Yang, and Xiangyang Xue. 2009. Can movies and books collaborate&quest; Cross-domain collaborative filtering for sparsity reduction. In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI’09). 2052--2057. http://dl.acm.org/citation.cfm&quest;id=1661445.1661773 Google ScholarDigital Library
Zhongqi Lu, ErHeng Zhong, Lili Zhao, Evan Wei Xiang, Weike Pan, and Qiang Yang 0001. 2012. Selective transfer learning for cross domain recommendation. arXiv:1210.7056. Available at http://dblp.uni-trier.de/db/journals/corr/corr1210.html#abs-1210-7056.Google Scholar
Paolo Massa and Paolo Avesani. 2006. Trust-aware bootstrapping of recommender systems. In Proceedings of the ECAI Workshop on Recommender Systems (ECAI’06). 29--33.Google Scholar
Simon Meyffret, Emmanuel Guillot, Lionel Medini, and Fredrique Laforest. 2012. RED: A Rich Epinions Dataset for Recommender Systems. Technical Report RR-LIRIS-2012-014. LIRIS UMR 5205 CNRS/INSA de Lyon/Universit Claude Bernard Lyon 1/Universit Lumire Lyon 2/cole Centrale de Lyon. http://liris.cnrs.fr/publis/&quest;id=5787.Google Scholar
Nima Mirbakhsh and Charles X. Ling. 2013. Clustering-based factorized collaborative filtering. In Proceedings of the 7th ACM Conference on Recommender Systems (RecSys’13). ACM, New York, NY, 315--318. DOI:http://dx.doi.org/10.1145/2507157.2507233 Google ScholarDigital Library
Nima Mirbakhsh and Charles X. Ling. 2014. Leveraging clustering to improve collaborative filtering (submitted and under review).Google Scholar
Orly Moreno, Bracha Shapira, Lior Rokach, and Guy Shani. 2012. TALMUD: Transfer learning for multiple domains. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). ACM, New York, NY, 425--434. DOI:http://dx.doi.org/10.1145/2396761.2396817 Google ScholarDigital Library
Xia Ning and George Karypis. 2012. Sparse linear methods with side information for top-N recommendations. In Proceedings of the 6th ACM Conference on Recommender Systems (RecSys’12). ACM, New York, NY, 155--162. DOI:http://dx.doi.org/10.1145/2365952.2365983 Google ScholarDigital Library
Sinno Jialin Pan and Qiang Yang. 2010. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22, 10, 1345--1359. DOI:http://dx.doi.org/10.1109/TKDE.2009.191 Google ScholarDigital Library
Al Mamunur Rashid, George Karypis, and John Riedl. 2008. Learning preferences of new users in recommender systems: An information theoretic approach. SIGKDD Exploration Newsletter 10, 2, 90--100. DOI:http://dx.doi.org/10.1145/1540276.1540302 Google ScholarDigital Library
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence (UAI’09). 452--461. http://dl.acm.org/citation.cfm&quest;id=1795114.1795167 Google ScholarDigital Library
Andrew I. Schein, Alexandrin Popescul, Lyle H. Ungar, and David M. Pennock. 2002. Methods and metrics for cold-start recommendations. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’02). ACM, New York, NY, 253--260. DOI:http://dx.doi.org/10.1145/564376.564421 Google ScholarDigital Library
Yue Shi, Martha Larson, and Alan Hanjalic. 2014. Collaborative filtering beyond the user-item matrix: A survey of the state of the art and future challenges. ACM Computing Surveys 47, 1, Article No. 3. DOI:http://dx.doi.org/10.1145/2556270 Google ScholarDigital Library
Harald Steck. 2010. Training and testing of recommender systems on data missing not at random. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’10). ACM, New York, NY, 713--722. DOI:http://dx.doi.org/10.1145/1835804.1835895 Google ScholarDigital Library
Harald Steck. 2013. Evaluation of recommendations: Rating-prediction and ranking. In Proceedings of the 7th ACM Conference on Recommender Systems (RecSys’13). ACM, New York, NY, 213--220. DOI:http://dx.doi.org/10.1145/2507157.2507160 Google ScholarDigital Library
Jie Tang, Sen Wu, Jimeng Sun, and Hang Su. 2012. Cross-domain collaboration recommendation. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’12). ACM, New York, NY, 1285--1293. DOI:http://dx.doi.org/10.1145/2339530.2339730 Google ScholarDigital Library
Yin Zhu, Yuqiang Chen, Zhongqi Lu, Sinno Jialin Pan, Gui Rong Xue, Yong Yu, and Qiang Yang. 2010. Heterogeneous transfer learning for image classification. In Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI’10).Google Scholar

Index Terms

Improving Top-N Recommendation for Cold-Start Users via Cross-Domain Information
1. Information systems
  1. Information retrieval
  2. Information storage systems

Recommendations

An effective recommendation method for cold start new users using trust and distrust networks

Recommendation systems analyze the purchasing behavior (e.g., item ratings) of users to learn about their preferences and recommend products or services that may be of interest to them. However, as new users require time to become familiar with ...
Read More
Naïve filterbots for robust cold-start recommendations
KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining

The goal of a recommender system is to suggest items of interest to a user based on historical behavior of a community of users. Given detailed enough history, item-based collaborative filtering (CF) often performs as well or better than almost any ...
Read More
Merging trust in collaborative filtering to alleviate data sparsity and cold start

Providing high quality recommendations is important for e-commerce systems to assist users in making effective selection decisions from a plethora of choices. Collaborative filtering is a widely accepted technique to generate recommendations based on ...
Read More

Reviews

Reviewer: A. Squassabia

Collaborative recommender systems often provide disappointing suggestions to new users who volunteered very few or no ratings of their own for processing: this is known as the cold-start problem. Mitigating such shortcomings with cross-domain information helps; this paper builds on previous work by the authors to introduce incremental improvements to the cold-start problem. The authors previously published an approach to single-domain recommendations based on clustering of latent factors using matrix factorization and k -means. Here, they translate the same principle into seeding cold-start recommendations with cross-domain information, exploiting multiple domains where each domain is endowed with shared users, hence with some measure of user overlap. Validation was carried out using two datasets, one from Amazon comprising ratings for media (video, music, DVD) and goods (electronics, kitchen, toys) and another from Epinions comprising ten disparate categories of items. Validation compared results for single-domain, traditional cross-domain, and their new clustered cross-domain top- N ratings using recall for N of 5, 10, 15 or 20 as a metric. Cross-domain cold-start performed better than single-domain; clustered cross-domain performed as well as traditional cross-domain for low N , and better than traditional for larger N . The main contribution of this paper is the novelty of a relatively simple implementation for the underlying idea, which is not entirely original but new in this form. Its main limitation is in the difficulty of assessing impact on the basis of a single machine-driven metric on only two datasets; albeit traditionally acceptable, validation would be more informative if carried out with more data, multiple metrics, and ideally before a live audience. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Knowledge Discovery from Data Volume 9, Issue 4
June 2015
261 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/2786971
Editor:
Philip S. Yu
University of Illinois at Chicago, USA
Issue’s Table of Contents
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 June 2015
- Accepted: 1 January 2015
- Revised: 1 December 2014
- Received: 1 June 2014
Published in tkdd Volume 9, Issue 4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Collaborative filtering
cold start
matrix factorization
recommendation system
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 62
  Total Citations
  View Citations
- 1,382
  Total Downloads
- Downloads (Last 12 months)64
- Downloads (Last 6 weeks)7
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Improving Top-N Recommendation for Cold-Start Users via Cross-Domain Information

ACM Transactions on Knowledge Discovery from Data

Abstract

References

Cited By

Index Terms

Recommendations

An effective recommendation method for cold start new users using trust and distrust networks

Naïve filterbots for robust cold-start recommendations

Merging trust in collaborative filtering to alleviate data sparsity and cold start

Reviews

Access critical reviews of Computing literature here

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Improving Top-N Recommendation for Cold-Start Users via Cross-Domain Information

ACM Transactions on Knowledge Discovery from Data

Abstract

References

Cited By

Index Terms

Recommendations

An effective recommendation method for cold start new users using trust and distrust networks

Naïve filterbots for robust cold-start recommendations

Merging trust in collaborative filtering to alleviate data sparsity and cold start

Reviews

Access critical reviews of Computing literature here

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media