skip to main content
10.1145/2566486.2567991acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Personalized collaborative clustering

Published:07 April 2014Publication History

ABSTRACT

We study the problem of learning personalized user models from rich user interactions. In particular, we focus on learning from clustering feedback (i.e., grouping recommended items into clusters), which enables users to express similarity or redundancy between different items. We propose and study a new machine learning problem for personalization, which we call collaborative clustering. Analogous to collaborative filtering, in collaborative clustering the goal is to leverage how existing users cluster or group items in order to predict similarity models for other users' clustering tasks. We propose a simple yet effective latent factor model to learn the variability of similarity functions across a user population. We empirically evaluate our approach using data collected from a clustering interface we developed for a goal-oriented data exploration (or sensemaking) task: asking users to explore and organize attractions in Paris. We evaluate using several realistic use cases, and show that our approach learns more effective user models than conventional clustering and metric learning approaches.

References

  1. E. Acar, D. M. Dunlavy, T. G. Kolda, and M. Mørup. Scalable tensor factorizations with missing data. In SIAM Conference on Data Mining (SDM), 2010.Google ScholarGoogle ScholarCross RefCross Ref
  2. S. Amershi, J. Fogarty, and D. Weld. Regroup: Interactive machine learning for on-demand group creation in social networks. In ACM Conference on Human Factors in Computing Systems (CHI), 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Balcan and A. Blum. Clustering with interactive feedback. In International Conference on Algorithmic Learning Theory (ALT), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Basu, M. Bilenko, and R. J. Mooney. A probabilistic framework for semi-supervised clustering. In ACM Conference on Knowledge Discovery and Data Mining (KDD), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Basu, D. Fisher, S. Drucker, and H. Lu. Assisting users with clustering tasks by combining metric learning and classification. In National Conference on Artificial Intelligence (AAAI), 2010.Google ScholarGoogle Scholar
  6. J. Blitzer and J. Weston. Latent structured ranking. In Conference on Uncertainty in Artificial Intelligence (UAI), 2012.Google ScholarGoogle Scholar
  7. S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein. Distributed optimization and statistical learning via the alternatiing direction method of multipliers. Foundations and Trends in Machine Learning, 3(1):1--122, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Brandt, T. Joachims, Y. Yue, and J. Bank. Dynamic ranked retrieval. In ACM Conference on Web Search and Data Mining (WSDM), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. H. Chau, A. Kittur, J. I. Hong, and C. Faloutsos. Apolo: Making sense of large network data by combining rich user interaction and machine learning. In ACM Conference on Human Factors in Computing Systems (CHI), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Davis, B. Kulis, P. Jain, S. Sra, and I. Dhillon. Information-theoretic metric learning. In International Conference on Machine Learning (ICML), 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Evgeniou and M. Pontil. Regularized multi-task learning. In ACM Conference on Knowledge Discovery and Data Mining (KDD), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. G. Forestier, P. Gançarski, and C. Wemmert. Collaborative clustering with background knowledge. Journal of Data & Knowledge Engineering, 69(2):211--228, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Gomes, P. Welinder, A. Krause, and P. Perona. Crowdclustering. In Neural Information Processing Systems (NIPS), 2011.Google ScholarGoogle Scholar
  14. K. Hammouda and M. Kamel. Collaborative document clustering. In SIAM Conference on Data Mining (SDM), 2006.Google ScholarGoogle ScholarCross RefCross Ref
  15. Y. Koren and R. Bell. Advances in collaborative filtering. In Recommender Systems Handbook, pages 145--186. Springer, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  16. Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. IEEE Computer, 42(8):30--37, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. L. Li, W. Chu, J. Langford, and R. Schapire. A contextual-bandit approach to personalized news article recommendation. In World Wide Web Conference (WWW), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. N. Nello Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines and other kernel-based learning methods. Cambridge University Press, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. Niu, J. Dy, and M. Jordan. Multiple non-redundant spectral clustering views. In International Conference on Machine Learning (ICML), 2010.Google ScholarGoogle Scholar
  20. S. Parameswaran and K. Weinberger. Large margin multi-task metric learning. In Neural Information Processing Systems (NIPS), 2010.Google ScholarGoogle Scholar
  21. D. M. Russell, M. J. Stefik, P. Pirolli, and S. K. Card. The cost structure of sensemaking. In Proceedings of the INTERACT'93 and CHI'93 conference on Human factors in computing systems, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In Neural Information Processing Systems (NIPS), 2008.Google ScholarGoogle Scholar
  23. G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information processing & management, 24(5):513--523, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. M. Schultz and T. Joachims. Learning a distance metric from relative comparisons. In Neural Information Processing Systems (NIPS), 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. D. Shahaf, J. Yang, C. Suen, J. Jacobs, H. Wang, and J. Leskovec. Information cartography: Creating zoomable, large-scale maps of information. In ACM Conference on Knowledge Discovery and Data Mining (KDD), 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. N. Srebro. Learning with Matrix Factorizations. PhD thesis, Massachusetts Institute of Technology, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. I. Sutskever, R. Salakhutdinov, and J. Tenenbaum. Modelling relational data using Bayesian clustered tensor factorization. In Neural Information Processing Systems (NIPS), 2009.Google ScholarGoogle Scholar
  28. O. Tamuz, C. Liu, S. Belongie, O. Shamir, and A. T. Kalai. Adaptively learning the crowd kernel. In International Conference on Machine Learning (ICML), 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. K. Wagstaff and C. Cardie. Clustering with instance-level constraints. In National Conference on Artificial Intelligence (AAAI), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. C. Wang and D. M. Blei. Collaborative topic modeling for recommending scientific articles. In ACM Conference on Knowledge Discovery and Data Mining (KDD), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. E. Xing, A. Ng, M. Jordan, and S. Russell. Distance metric learning, with application to clustering with side-information. In Neural Information Processing Systems (NIPS), 2002.Google ScholarGoogle Scholar
  32. Y. Zhang and D. Yeung. Transfer metric learning by learning task relationships. In ACM Conference on Knowledge Discovery and Data Mining (KDD), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Personalized collaborative clustering

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          WWW '14: Proceedings of the 23rd international conference on World wide web
          April 2014
          926 pages
          ISBN:9781450327442
          DOI:10.1145/2566486

          Copyright © 2014 Copyright is held by the International World Wide Web Conference Committee (IW3C2).

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 7 April 2014

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          WWW '14 Paper Acceptance Rate84of645submissions,13%Overall Acceptance Rate1,899of8,196submissions,23%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader