skip to main content
10.1145/1008992.1009053acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

A nonparametric hierarchical bayesian framework for information filtering

Published: 25 July 2004 Publication History

Abstract

Information filtering has made considerable progress in recent years. The predominant approaches are content-based methods and collaborative methods. Researchers have largely concentrated on either of the two approaches since a principled unifying framework is still lacking. This paper suggests that both approaches can be combined under a hierarchical Bayesian framework. Individual content-based user profiles are generated and collaboration between various user models is achieved via a common learned prior distribution. However, it turns out that a parametric distribution (e.g. Gaussian) is too restrictive to describe such a common learned prior distribution. We thus introduce a nonparametric common prior, which is a sample generated from a Dirichlet process which assumes the role of a hyper prior. We describe effective means to learn this nonparametric distribution, and apply it to learn users' information needs. The resultant algorithm is simple and understandable, and offers a principled solution to combine content-based filtering and collaborative filtering. Within our framework, we are now able to interpret various existing techniques from a unifying point of view. Finally we demonstrate the empirical success of the proposed information filtering methods.

References

[1]
C. E. Antoniak. Mixtures of dirichlet processes with applications to Bayesian nonparametric problems. Annals of Statistics, 2(6), Nov. 1974.
[2]
M. Balabanovic and Y. Shoham. Fab: Content-based, collaborative recommendation. Communications of the ACM, 40(3):66--72, 1997.
[3]
C. Basu, H. Hirsh, and W. W. Cohen. Recommendation as classification: Using social and content-based information in recommendation. In Proceedings of the Fifteenth National Conference on Artificial Intelligencen AAAI/IAAI, pages 714--720, 1998.
[4]
D. Billsus and M. J. Pazzani. Learning collaborative information filters. In Proceedings of the 15th International Conference on Machine Learning, pages 46--54. Morgan Kaufmann, San Francisco, CA, 1998.
[5]
D. Blei, T. L. Griffiths, M. I. Jordan, and J. B. Tenenbaum. Hierarichical topic models and the nested chinese restaurant process. In Advances in Neural Information Processing Systems 16. MIT Press, 2004.
[6]
J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence, pages 43--52, 1998.
[7]
M. Claypool, A. Gokhale, T. Miranda, P. Murnikov, D. Netes, and M. Sartin. Combining content-based and collaborative filtering in an online newspaper. In Proceedings of ACM SIGIR Workshop on Recommender Systems, August 1999.
[8]
M. D. Escobar and M. West. Bayesian density estimation and inference using mixtures. Journal of the American Statistical Association, 90(430), June 1995.
[9]
P. Melville, R. J. Mooney, and R. Nagarajan. Content-boosted collaborative filtering for improved recommendations. In Proceedings of the Eighteenth National Conference on Artificial Intelligence (AAAI-2002), pages 187--192, Edmonton, Canada, 2002.
[10]
R. Mooney and L. Roy. Content-based book recommending using learning for text categorization. In Proceedings of the Fifth ACM Conference on Digital Libaries, pages 195--204, San Antonio, US, 2000. ACM Press, New York, US.
[11]
R. M. Neal. Markov chain sampling methods for dirichlet process mixture models. Technical Report 9815, Dept. of Statistics, University of Toronto.
[12]
M. Pazzani. A framework for collaborative, content-based and demographic filtering. Artificial Intelligence Review, 13(5--6):393--408, 1999.
[13]
M. Pazzani, J. Muramastsu, and D. Billsus. Syskill and webert: Identifying interesting web sites. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pages 54--61, Portland, OR, August 1996.
[14]
D. M. Pennock, E. Horvitz, S. Lawrence, and C. Giles. Collaborative filtering by personality diagnosis: A hybrid memory- and model-based approach. In Proc. of the 16th Conference on Uncertainty in Artificial Intelligence, pages 473--480, 2000.
[15]
J. C. Platt. Probabilities for SV machines. In A. Smola, P. Bartlett, B. Scholkopf, and D. Schuurmans, editors, Advances in Large Margin Classifiers, pages 61--74, Cambridge, MA, 1999. MIT Press.
[16]
A. Popescul, L. Ungar, D. Pennock, and S. Lawrence. Probabilistic models for unified collaborative and content-based recommendation in sparse-data environments. In 17th Conference on Uncertainty in Artificial Intelligence, pages 437--444, Seattle, Washington, August 2--5 2001.
[17]
C. E. Rasmussen and Z. Ghahramani. Infinite mixtures of gaussian process experts. In T. G. Dietterich, S. Becker, and Z. Ghahramani, editors, Advances in Neural Information Processing Systems 14, 2002.
[18]
P. Resnick, N. Iacovou, M. Sushak, P. Bergstrom, and J. Riedl. Grouplens: An open architecture for collaborative filtering of netnews. In Proceedings of the 1994 Computer Supported Collaborative Work Conference, pages 175--186. ACM, 1994.
[19]
J. J. Rocchio. Relevance feedback in information retrieval. In The SMART Retrieval System: Experiments in Automatic Document Processing, pages 313--323. Prentice Hall, 1971.
[20]
U. Shardanand and P. Maes. Social information filtering algorithms for automating `word of mouth'. In Proceedings of ACM CHI'95 Conference, 1995.
[21]
K. Yu, A. Schwaighofer, V. Tresp, W.-Y. Ma, and H. Zhang. Collaborative ensemble learning: Combining collaborative and content-based information filtering via hierarchical Bayes. In Proceedings of the 19th Conference on Uncertainty in Artificial Intelligence (UAI), 2003.

Cited By

View all
  • (2024)Knowledge-Based Commercial Real Estate Recommender SystemAdvances in Artificial Intelligence-Empowered Decision Support Systems10.1007/978-3-031-62316-5_8(197-224)Online publication date: 28-Jun-2024
  • (2019)Recommendation-based Team Formation for On-demand Taxi-calling PlatformsProceedings of the 28th ACM International Conference on Information and Knowledge Management10.1145/3357384.3357869(59-68)Online publication date: 3-Nov-2019
  • (2019)A Continuously Updated, Computationally Efficient Stress Recognition Framework Using Electroencephalogram (EEG) by Applying Online Multitask Learning Algorithms (OMTL)IEEE Journal of Biomedical and Health Informatics10.1109/JBHI.2018.287096323:5(1928-1939)Online publication date: Sep-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
July 2004
624 pages
ISBN:1581138814
DOI:10.1145/1008992
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2004

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. collaborative filtering
  2. content-based filtering
  3. dirichlet process
  4. nonparametric bayesian modelling

Qualifiers

  • Article

Conference

SIGIR04
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)1
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Knowledge-Based Commercial Real Estate Recommender SystemAdvances in Artificial Intelligence-Empowered Decision Support Systems10.1007/978-3-031-62316-5_8(197-224)Online publication date: 28-Jun-2024
  • (2019)Recommendation-based Team Formation for On-demand Taxi-calling PlatformsProceedings of the 28th ACM International Conference on Information and Knowledge Management10.1145/3357384.3357869(59-68)Online publication date: 3-Nov-2019
  • (2019)A Continuously Updated, Computationally Efficient Stress Recognition Framework Using Electroencephalogram (EEG) by Applying Online Multitask Learning Algorithms (OMTL)IEEE Journal of Biomedical and Health Informatics10.1109/JBHI.2018.287096323:5(1928-1939)Online publication date: Sep-2019
  • (2015)Recommender system application developmentsDecision Support Systems10.1016/j.dss.2015.03.00874:C(12-32)Online publication date: 1-Jun-2015
  • (2015)Data Mining Methods for Recommender SystemsRecommender Systems Handbook10.1007/978-1-4899-7637-6_7(227-262)Online publication date: 2015
  • (2014)Hand-shape classification with a wrist contour sensorInternational Journal of Robotics Research10.1177/027836491350798433:4(658-671)Online publication date: 1-Apr-2014
  • (2014)Learning with dual heterogeneityProceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/2623330.2623727(582-590)Online publication date: 24-Aug-2014
  • (2014)Exploiting User Preference for Online Learning in Web Content Optimization SystemsACM Transactions on Intelligent Systems and Technology10.1145/24932595:2(1-23)Online publication date: 30-Apr-2014
  • (2014)Predicting Multiple Attributes via Relative Multi-task LearningProceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition10.1109/CVPR.2014.135(1027-1034)Online publication date: 23-Jun-2014
  • (2013)User Action Interpretation for Online Content OptimizationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2012.13025:9(2161-2174)Online publication date: 1-Sep-2013
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media