research-article

CCCF: Improving Collaborative Filtering via Scalable User-Item Co-Clustering

Authors:
Yao Wu

Simon Fraser University, Burnaby, BC, Canada

Simon Fraser University, Burnaby, BC, Canada
View Profile

,
Xudong Liu

Institute of Automation, Chinese Academy of Sciences, Beijing, China

Institute of Automation, Chinese Academy of Sciences, Beijing, China
View Profile

,
Min Xie

Walmart Labs, San Bruno, CA, USA

Walmart Labs, San Bruno, CA, USA
View Profile

,
Martin Ester

Simon Fraser University, Burnaby, BC, Canada

Simon Fraser University, Burnaby, BC, Canada
View Profile

,
Qing Yang

Institute of Automation, Chinese Academy of Sciences, Beijing, China

Institute of Automation, Chinese Academy of Sciences, Beijing, China
View Profile

WSDM '16: Proceedings of the Ninth ACM International Conference on Web Search and Data MiningFebruary 2016Pages 73–82https://doi.org/10.1145/2835776.2835836

Published:08 February 2016Publication History

WSDM '16: Proceedings of the Ninth ACM International Conference on Web Search and Data Mining

Pages 73–82

ABSTRACT

Collaborative Filtering (CF) is the most popular method for recommender systems. The principal idea of CF is that users might be interested in items that are favorited by similar users, and most of the existing CF methods measure users' preferences by their behaviours over all the items. However, users might have different interests over different topics, thus might share similar preferences with different groups of users over different sets of items. In this paper, we propose a novel and scalable method CCCF which improves the performance of CF methods via user-item co-clustering. CCCF first clusters users and items into several subgroups, where each subgroup includes a set of like-minded users and a set of items in which these users share their interests. Then, traditional CF methods can be easily applied to each subgroup, and the recommendation results from all the subgroups can be easily aggregated. Compared with previous works, CCCF has several advantages including scalability, flexibility, interpretability and extensibility. Experimental results on four real world data sets demonstrate that the proposed method significantly improves the performance of several state-of-the-art recommendation algorithms.

References

D. Agarwal and B.-C. Chen. Regression-based latent factor models. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 19--28. ACM, 2009. Google ScholarDigital Library
E. M. Airoldi, D. M. Blei, S. E. Fienberg, and E. P. Xing. Mixed membership stochastic blockmodels. In Advances in Neural Information Processing Systems, pages 33--40, 2009.Google ScholarDigital Library
A. Beutel, A. Ahmed, and A. J. Smola. Accams: Additive co-clustering to approximate matrices succinctly. In Proceedings of the 24th International Conference on World Wide Web, pages 119--129, 2015. Google ScholarDigital Library
A. Beutel, K. Murray, C. Faloutsos, and A. J. Smola. Cobafi: collaborative bayesian filtering. In Proceedings of the 23rd international conference on World wide web, pages 97--108, 2014. Google ScholarDigital Library
J. Foulds, L. Boyles, C. DuBois, P. Smyth, and M. Welling. Stochastic collapsed variational bayesian inference for latent dirichlet allocation. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 446--454. ACM, 2013. Google ScholarDigital Library
T. George and S. Merugu. A scalable collaborative filtering framework based on co-clustering. In ICDM'05. IEEE, 2005. Google ScholarDigital Library
L. Hong, A. S. Doumith, and B. D. Davison. Co-factorization machines: modeling user interests and predicting individual decisions in twitter. In Proceedings of the sixth ACM international conference on Web search and data mining, pages 557--566. ACM, 2013. Google ScholarDigital Library
Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. Computer, (8):30--37, 2009. Google ScholarDigital Library
J. Lee, S. Bengio, S. Kim, G. Lebanon, and Y. Singer. Local collaborative ranking. In Proceedings of the 23rd international conference on World wide web, pages 85--96, 2014. Google ScholarDigital Library
J. Lee, S. Kim, G. Lebanon, and Y. Singer. Local low-rank matrix approximation. In Proceedings of The 30th International Conference on Machine Learning, pages 82--90, 2013.Google ScholarDigital Library
L. W. Mackey, M. I. Jordan, and A. Talwalkar. Divide-and-conquer matrix factorization. In Advances in Neural Information Processing Systems, pages 1134--1142, 2011.Google ScholarDigital Library
A. Mnih and R. Salakhutdinov. Probabilistic matrix factorization. In Advances in neural information processing systems, pages 1257--1264, 2007.Google ScholarDigital Library
M. O'Connor and J. Herlocker. Clustering items for collaborative filtering. In Proceedings of the ACM SIGIR workshop on recommender systems, volume 128, 1999.Google Scholar
V. Y. Pan and Z. Q. Chen. The complexity of the matrix eigenproblem. In Proceedings of the Thirty-first Annual ACM Symposium on Theory of Computing, STOC '99, pages 507--516. ACM, 1999. Google ScholarDigital Library
S. Rendle, C. Freudenthaler, Z. Gantner, and L. Schmidt-Thieme. Bpr: Bayesian personalized ranking from implicit feedback. In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, pages 452--461, 2009. Google ScholarDigital Library
F. Ricci, L. Rokach, and B. Shapira. Introduction to recommender systems handbook. Springer, 2011.Google ScholarCross Ref
B. Sarwar, G. Karypis, J. Konstan, and J. Riedl. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th International Conference on World Wide Web, pages 285--295. ACM, 2001. Google ScholarDigital Library
H. Shan and A. Banerjee. Bayesian co-clustering. In ICDM'08, pages 530--539. IEEE, 2008. Google ScholarDigital Library
H. Steck. Training and testing of recommender systems on data missing not at random. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 713--722. ACM, 2010. Google ScholarDigital Library
P. Wang, C. Domeniconi, and K. B. Laskey. Latent dirichlet bayesian co-clustering. In Machine Learning and Knowledge Discovery in Databases, pages 522--537. Springer, 2009. Google ScholarDigital Library
P. Wang, K. B. Laskey, C. Domeniconi, and M. I. Jordan. Nonparametric bayesian co-clustering ensembles. In SDM. SIAM, 2011.Google ScholarCross Ref
J. Weston, S. Bengio, and N. Usunier. Wsabie: Scaling up to large vocabulary image annotation. In IJCAI, pages 2764--2770, 2011. Google ScholarDigital Library
J. Weston, C. Wang, R. Weiss, and A. Berenzweig. Latent collaborative retrieval. In Proceedings of the 29th International Conference on Machine Learning, pages 9--16, 2012.Google Scholar
J. Weston, R. J. Weiss, and H. Yee. Nonlinear latent factorization by embedding multiple user interests. In Proceedings of the 7th ACM conference on Recommender systems, pages 65--68. ACM, 2013. Google ScholarDigital Library
B. Xu, J. Bu, C. Chen, and D. Cai. An exploration of improving collaborative recommender systems via user-item subgroups. In Proceedings of the 21st International Conference on World Wide Web, pages 21--30. ACM, 2012. Google ScholarDigital Library
G.-R. Xue, C. Lin, Q. Yang, W. Xi, H.-J. Zeng, Y. Yu, and Z. Chen. Scalable collaborative filtering using cluster-based smoothing. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 114--121. ACM, 2005. Google ScholarDigital Library
J. Yang and J. Leskovec. Community-affiliation graph model for overlapping network community detection. In ICDM'12, pages 1170--1175. IEEE, 2012. Google ScholarDigital Library
J. Yang and J. Leskovec. Overlapping community detection at scale: A nonnegative matrix factorization approach. In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, pages 587--596. ACM, 2013. Google ScholarDigital Library
J. Yang, J. McAuley, and J. Leskovec. Community detection in networks with node attributes. In ICDM'13, pages 1151--1156. IEEE, 2013.Google ScholarCross Ref
S.-H. Yang, B. Long, A. J. Smola, H. Zha, and Z. Zheng. Collaborative competitive filtering: learning recommender using context of user choice. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, pages 295--304. ACM, 2011. Google ScholarDigital Library
Y. Zhang, M. Zhang, Y. Liu, and S. Ma. Improve collaborative filtering through bordered block diagonal form matrices. In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval, pages 313--322. ACM, 2013. Google ScholarDigital Library
Y. Zhang, M. Zhang, Y. Liu, S. Ma, and S. Feng. Localized matrix factorization for recommendation based on matrix block diagonal forms. In Proceedings of the 22Nd International Conference on World Wide Web, pages 1511--1520. ACM, 2013. Google ScholarDigital Library

Index Terms

CCCF: Improving Collaborative Filtering via Scalable User-Item Co-Clustering
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Document filtering
      2. Information extraction

Recommendations

A Scalable, Accurate Hybrid Recommender System
WKDD '10: Proceedings of the 2010 Third International Conference on Knowledge Discovery and Data Mining

Recommender systems apply machine learning techniques for filtering unseen information and can predict whether a user would like a given resource. There are three main types of recommender systems: collaborative filtering, content-based filtering, and ...
Read More
A framework for diversifying recommendation lists by user interest expansion

Recommender systems have been widely used to discover users' preferences and recommend interesting items to users during this age of information overload. Researchers in the field of recommender systems have realized that the quality of a top-N ...
Read More
Cross-representation mediation of user models

Personalization is considered a powerful methodology for improving the effectiveness of information search and decision making. It has led to the dissemination of systems capable of suggesting relevant and personalized information (or items) to the users,...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSDM '16: Proceedings of the Ninth ACM International Conference on Web Search and Data Mining
February 2016
746 pages
ISBN:9781450337168
DOI:10.1145/2835776
General Chairs:
Paul N. Bennett
Microsoft Research
,
Vanja Josifovski
Pinterest
,
Program Chairs:
Jennifer Neville
Purdue University
,
Filip Radlinski
Microsoft
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 February 2016
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
co-clustering
collaborative filtering
recommender systems
Qualifiers
- research-article
Conference

Acceptance Rates
WSDM '16 Paper Acceptance Rate67of368submissions,18%Overall Acceptance Rate498of2,863submissions,17%
More
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 39
  Total Citations
  View Citations
- 821
  Total Downloads
- Downloads (Last 12 months)31
- Downloads (Last 6 weeks)7
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

CCCF: Improving Collaborative Filtering via Scalable User-Item Co-Clustering

WSDM '16: Proceedings of the Ninth ACM International Conference on Web Search and Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

A Scalable, Accurate Hybrid Recommender System

A framework for diversifying recommendation lists by user interest expansion

Cross-representation mediation of user models