ABSTRACT
We examine the creation of a tag cloud for exploring and understanding a set of objects (e.g., web pages, documents). In the first part of our work, we present a formal system model for reasoning about tag clouds. We then present metrics that capture the structural properties of a tag cloud, and we briefly present a set of tag selection algorithms that are used in current sites (e.g., del.icio.us, Flickr, Technorati) or that have been described in recent work. In order to evaluate the results of these algorithms, we devise a novel synthetic user model. This user model is specifically tailored for tag cloud evaluation and assumes an "ideal" user. We evaluate the algorithms under this user model, as well as the model itself, using two datasets: CourseRank (a Stanford social tool containing information about courses) and del.icio.us (a social bookmarking site). The results yield insights as to when and why certain selection schemes work best.
Supplemental Material
- Courserank. http://www.courserank.com.Google Scholar
- Search cloudlet. http://www.getcloudlet.com/.Google Scholar
- F. Bonchi, C. Castillo, D. Donato, and A. Gionis. Topical query decomposition. In KDD, pages 52---60, 2008. Google ScholarDigital Library
- K. Chakrabarti, S. Chaudhuri, and S.--w. Hwang. Automatic categorization of query results. In SIGMOD, pages 755--766, 2004. Google ScholarDigital Library
- F. Gelgi, H. Davulcu, and S. Vadrevu. Term ranking for clustering web search results. In WebDB, 2007.Google Scholar
- J. Good, C. Shergold, M. Gheorghiu, and J. Davies. Using tag clouds to facilitate search: An evaluation, 1997.Google Scholar
- M. A. Hearst and J. O. Pedersen. Reexamining the cluster hypothesis: Scatter/gather on retrieval results. In SIGIR, pages 76--84, 1996. Google ScholarDigital Library
- G. Koutrika, Z. M. Zadeh, and H. Garcia-Molina. Data clouds: summarizing keyword search results over structured data. In EDBT, pages 391--402, 2009. Google ScholarDigital Library
- B. Kuo, T. Hentrich, B. M. Good, and M. Wilkinson. Tag clouds for summarizing web search results. In WWW, pages 1203--1204, 2007. Google ScholarDigital Library
- I. Masowska. Phrase--based hierarchical clustering of web search results. In ECIR, pages 555--562, 2003. Google ScholarDigital Library
- K. Nigam, A. K. McCallum, S. Thrun, and T. Mitchell. Text classification from labeled and unlabeled documents using em. Mach. Learn., 39(2-3):103--134, 2000. Google ScholarDigital Library
- S. Osinski, J. Stefanowski, and D. Weiss. Lingo: Search results clustering algorithm based on singular value decomposition. In Intelligent Inf. Sys., pages 359--368, 2004.Google ScholarCross Ref
- A. W. Rivadeneira, D. M. Gruen, M. J. Muller, and D. R. Millen. Getting our head in the clouds: toward evaluation studies of tagclouds. In CHI, 2007. Google ScholarDigital Library
- M. Sydow, F. Bonchi, C. Castillo, and D. Donato. Optimising topical query decomposition. In WSCD, pages 43--47, 2009. Google ScholarDigital Library
- V. V. Vazirani. Approximation Algorithms. Springer, March 2004.Google Scholar
- Wikipedia. Tag cloud -- wikipedia, the free encyclopedia, 2010. {Online; accessed 27--February--2010}.Google Scholar
- O. Zamir and O. Etzioni. Grouper: A dynamic clustering interface to web search results. Computer Networks, 31(11--16):1361--1374, 1999. Google ScholarDigital Library
- H.-J. Zeng, Q.-C. He, Z. Chen, W.-Y. Ma, and J. Ma. Learning to cluster web search results. In SIGIR, pages 210--217, 2004. Google ScholarDigital Library
Index Terms
- On the selection of tags for tag clouds
Recommendations
Differential Tag Clouds: Highlighting Particular Features in Documents
WI-IAT '09: Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03This paper introduces the concepts of “summary tag cloud” and “differential tag cloud”. A summary tag cloud is a unique tag cloud that summarizes the contents of all the documents in a set. A differential tag cloud is a tag cloud that highlights the ...
Tag clouds as social signallers
OZCHI '10: Proceedings of the 22nd Conference of the Computer-Human Interaction Special Interest Group of Australia on Computer-Human InteractionTag clouds are becoming increasingly popular visualisation and interaction techniques used on the web today. At the same time, tag clouds have been shown to have somewhat limited capabilities and usefulness. The generation of personalised tag clouds ...
Visual Search Strategies of Tag Clouds - Results from an Eyetracking Study
INTERACT '09: Proceedings of the 12th IFIP TC 13 International Conference on Human-Computer Interaction: Part IITag clouds have become a frequently used interaction technique in the web in the past couple of years. Research has shown the influence of variables such as tag size and location on the perception of tag clouds. However, several questions remain ...
Comments