ACM Home Page
Please provide us with feedback. Feedback
Collaborative filtering on skewed datasets
Full text pdf formatPdf (71 KB)
Source
International World Wide Web Conference archive
Proceeding of the 17th international conference on World Wide Web table of contents
Beijing, China
POSTER SESSION: Posters table of contents
Pages 1135-1136  
Year of Publication: 2008
ISBN:978-1-60558-085-2
Authors
Somnath Banerjee  Hewlett-Packard Labs, Bangalore, India
Krishnan Ramanathan  Hewlett-Packard Labs, Bangalore, India
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 0,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1367497.1367693
What is a DOI?

ABSTRACT

Many real life datasets have skewed distributions of events when the probability of observing few events far exceeds the others. In this paper, we observed that in skewed datasets the state of the art collaborative filtering methods perform worse than a simple probabilistic model. Our test bench includes a real ad click stream dataset which is naturally skewed. The same conclusion is obtained even from the popular movie rating dataset when we pose a binary prediction problem of whether a user will give maximum rating to a movie or not.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Adomavicius, G. and Tuzhilin, A. Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions. IEEE Transactions on Knowledge and Data Engineering, 2005, Vol 17, 734--749.
 
2
Das, A. Datar, M. Garg, A. and Rajaram, S. Google News Personalization: Online Collaborative Filtering. Proceedings of the 16th international conference on World Wide Web, Banff, Canada, 2007.
 
3
Hofmann, T. Latent Semantic Models for Collaborative Filtering. ACM Transactions on Information Systems (TOIS), 2004, Vol 22, 89--115.
 
4
Hsu, C. N. Chung, H. H. and Huang, H. S. Mining Skewed and Sparse Transaction Data for Personalized Shopping Recommendation. Machine Learning, 2004, Vol 57, 35--59.

Collaborative Colleagues:
Somnath Banerjee: colleagues
Krishnan Ramanathan: colleagues