ABSTRACT
To provide a more robust context for personalization, we desire to extract a continuum of general (long-term) to specific (short-term) interests of a user. Our proposed approach is to learn a user interest hierarchy (UIH) from a set of web pages visited by a user. We devise a divisive hierarchical clustering (DHC) algorithm to group words (topics) into a hierarchy where more general interests are represented by a larger set of words. Each web page can then be assigned to nodes in the hierarchy for further processing in learning and predicting interests. This approach is analogous to building a subject taxonomy for a library catalog system and assigning books to the taxonomy. Our approach does not need user involvement and learns the UIH "implicitly." Furthermore, it allows the original objects, web pages, to be assigned to multiple topics (nodes in the hierarchy). In this paper, we focus on learning the UIH from a set of visited pages. We propose a few similarity functions and dynamic threshold-finding methods, and evaluate the resulting hierarchies according to their meaningfulness and shape
- Bellegarda, J.R. Exploiting both local and global constraints for multi-span statistical language modeling, IEEE Proc. Intl. Conf. on Acoustics, Speech, and Signal Processing, vol. 2, 677--680, 1998.Google ScholarCross Ref
- Billsus, D., and Pazzani, M.J. A Hybrid User Model for News Story Classification, Conf. User Modeling, 1999. Google ScholarDigital Library
- Chan, P.K. A non-invasive learning approach to building web user profiles, KDD-99 Workshop on Web Usage Analysis and User Profiling, 7--12, 1999.Google Scholar
- Fisher, D.H. Knowledge Acquisition via Incremental Conceptual Clustering. Machine Learning 2, 139--172, 1987. Google ScholarDigital Library
- Frakes, W.B., and Baeza-Yates, R. Information Retrieval: Data Structures and Algorithms, Prentice-Hall, 1992. Google ScholarDigital Library
- Han, J. Data Mining Concepts and Techniques, San Francisco : Morgan Kaufmann Publishers, 2001. Google ScholarDigital Library
- He, X., and Ding, C.H.Q., (etc). Automatic topic identification using webpage clustering IEEE ICDM, 2001. Google ScholarDigital Library
- James T. M., Frank H. D., Stastistics, San Francisco, Dellen Pub. Co., 1988Google Scholar
- Pazzani, M., and Billsus, D. Learning and Revising User Profiles: The Identification of Interesting Web Sites, Machine Learning, 27(3), 313--331, 1997. Google ScholarDigital Library
- Perkowitz, M., and Etzioni, O. Towards adaptive Web sites: Conceptual framework and case study, Artificial Intelligence 118, 245--275, 2000. Google ScholarDigital Library
- Richardson, M., and Domingos, P. The Intelligent Surfer: Probabilistic Combination of Link and Content Information in PageRank Advances in Neural Information Processing Systems 14, 2002.Google Scholar
- Russell, S., and Norvig, P. Artificial Intelligence A Modern Approach. Prentice Hall, 74, 1995. Google ScholarDigital Library
- Voorhees, E.M. Implementing Agglomerative Hierarchical Clustering Algorithms for use in document retrieval, Information Processing & Management, 22 (6) 465--476, 1986. Google ScholarDigital Library
- Zamir, O., and Etzioni, O. Groper: A Dynamic Clustering Interface to Web Search Results, The Eighth International World Wide Web Conference, Toronto, 1999. Google ScholarDigital Library
- Zamir, O., and Etzioni, O. Web document clustering: a feasibility demonstration. In Proc. SIGIR-98, 1998. Google ScholarDigital Library
Index Terms
Learning implicit user interest hierarchy for context in personalization
Recommendations
Learning implicit user interest hierarchy for context in personalization
To provide a more robust context for personalization, we desire to extract a continuum of general to specific interests of a user, called a user interest hierarchy (UIH). The higher-level interests are more general, while the lower-level interests are ...
Implicitly Learning a User Interest Profile for Personalization of Web Search Using Collaborative Filtering
WI-IAT '14: Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02The increasing abundance of content on the web has made information filtering even more important in helping users find information related to their interests. Personalization of web search is one such effort, that aims at improving the efficiency with ...
Comments