ABSTRACT
Online communities have become popular for publishing and searching content, as well as for finding and connecting to other users. User-generated content includes, for example, personal blogs, bookmarks, and digital photos. These items can be annotated and rated by different users, and these social tags and derived user-specific scores can be leveraged for searching relevant content and discovering subjectively interesting items. Moreover, the relationships among users can also be taken into consideration for ranking search results, the intuition being that you trust the recommendations of your close friends more than those of your casual acquaintances.
Queries for tag or keyword combinations that compute and rank the top-k results thus face a large variety of options that complicate the query processing and pose efficiency challenges. This paper addresses these issues by developing an incremental top-k algorithm with two-dimensional expansions: social expansion considers the strength of relations among users, and semantic expansion considers the relatedness of different tags. It presents a new algorithm, based on principles of threshold algorithms, by folding friends and related tags into the search space in an incremental on-demand manner. The excellent performance of the method is demonstrated by an experimental evaluation on three real-world datasets, crawled from deli.cio.us, Flickr, and LibraryThing.
- G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng., 17(6):734--749, 2005. Google ScholarDigital Library
- Y.-Y. Ahn et al. Analysis of topological characteristics of huge online social networking services. In WWW, 2007. Google ScholarDigital Library
- S. Amer-Yahia et al. Challenges in searching online communities. IEEE Data Eng. Bull., 30(2):23--31, 2007.Google Scholar
- V. N. Anh and A. Moffat. Pruned query evaluation using pre-computed impacts. In SIGIR, 2006. Google ScholarDigital Library
- R. A. Baeza-Yates and A. Tiberi. Extracting semantic relations from query logs. In KDD, 2007. Google ScholarDigital Library
- S. Bao et al. Optimizing web search using social annotations. In WWW, 2007. Google ScholarDigital Library
- H. Bast et al. IO-Top-k: Index-access optimized top-k query processing. In VLDB, 2006. Google ScholarDigital Library
- M. Bender et al. Peer-to-peer information search: Semantic, social, or spiritual? IEEE Data Eng. Bull., 30(2):51--60, 2007.Google Scholar
- B. Billerbeck and J. Zobel. Questioning query expansion: An examination of behaviour and parameters. In ADC, 2004. Google ScholarDigital Library
- S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks, 30(1-7):107--117, 1998. Google ScholarDigital Library
- A. Damian et al. Peer-sensitive objectrank - valuing contextual information in social networks. In WISE, 2005. Google ScholarDigital Library
- A. Das et al. Google news personalization: scalable online collaborative filtering. In WWW, 2007. Google ScholarDigital Library
- P. A. Dmitriev et al. Using annotations in enterprise search. In WWW, 2006. Google ScholarDigital Library
- M. Dubinko et al. Visualizing tags over time. ACM Transactions on the Web, 1(2), 2007. Google ScholarDigital Library
- R. Fagin et al. Optimal aggregation algorithms for middleware. J. Comput. Syst. Sci., 66(4):614--656, 2003. Google ScholarDigital Library
- S. Golder and B. A. Huberman. Usage patterns of collaborative tagging systems. Journal of Information Science, 32(2):198--208, April 2006. Google ScholarDigital Library
- H. Halpin et al. The complex dynamics of collaborative tagging. In WWW, 2007. Google ScholarDigital Library
- D. Heckerman et al. Dependency networks for inference, collaborative filtering, and data visualization. Journal of Machine Learning Research, 1:49--75, 2000. Google ScholarDigital Library
- J. L. Herlocker et al. Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems, 22(1), 2004. Google ScholarDigital Library
- P. Heymann et al. Can social bookmarking improve web search? In WSDM, 2008. Google ScholarDigital Library
- P. Heymann and H. Garcia-Molina. Collaborative creation of communal hierarchical taxonomies in social tagging systems. Technical Report 2006-10, Stanford University, April 2006.Google Scholar
- A. Hotho et al. Information retrieval in folksonomies: Search and ranking. In The Semantic Web: Research and Applications, pages 411--426, 2006. Google ScholarDigital Library
- K. Järvelin and J. Kekäläinen. IR evaluation methods for retrieving highly relevant documents. In SIGIR, 2000. Google ScholarDigital Library
- R. Kumar et al. Structure and evolution of online social networks. In KDD, 2006. Google ScholarDigital Library
- G. Linden et al. Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Computing, 7(1), 2003. Google ScholarDigital Library
- A. Mislove et al. Exploiting social networks for internet search. In HotNets, 2006.Google Scholar
- J. Pouwelse et al. Tribler: A social-based peer-to-peer system. In IPTPS, 2006.Google Scholar
- S. E. Robertson and S. Walker. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In SIGIR, 1994. Google ScholarDigital Library
- B. M. Sarwar et al. Item-based collaborative filtering recommendation algorithms. In WWW, 2001. Google ScholarDigital Library
- J. B. Schafer et al. Collaborative filtering recommender systems. In The Adaptive Web, 2007. Google ScholarDigital Library
- C. Schmitz et al. Mining association rules in folksonomies. In Data Science and Classification. Springer, 2006.Google ScholarCross Ref
- S. Sen et al. Tagging, communities, vocabulary, evolution. In CSCW, 2006. Google ScholarDigital Library
- C. Tantipathananandh et al. A framework for community identification in dynamic social networks. In KDD, 2007. Google ScholarDigital Library
- M. Theobald et al. Efficient and self-tuning incremental query expansion for top-k query processing. In SIGIR, 2005. Google ScholarDigital Library
- S. Xu et al. Using social annotations to improve language model for information retrieval. In CIKM, 2007. Google ScholarDigital Library
- J. Zhang et al. Expertise networks in online communities: structure and algorithms. In WWW, 2007. Google ScholarDigital Library
Index Terms
- Efficient top-k querying over social-tagging networks
Recommendations
Personalizing Top-k Processing Online in a Peer-to-Peer Social Tagging Network
The rapidly increasing amount of user-generated content in social tagging systems provides a huge source of information. Yet, performing effective search in these systems is very challenging, especially when we seek the most appropriate items that match ...
Social recommendations at work
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrievalOnline communities have become popular for publishing and searching content, and also for connecting to other users. User-generated content includes, for example, personal blogs, bookmarks, and digital photos. Items can be annotated and rated by ...
Adding structure to top-k: from items to expansions
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge managementKeyword based search interfaces are extremely popular as a means for efficiently discovering items of interest from a huge collection, as evidenced by the success of search engines like Google and Bing. However, most of the current search services still ...
Comments