skip to main content
10.1145/1995966.1995973acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
research-article

Bridging link and query intent to enhance web search

Published: 06 June 2011 Publication History

Abstract

Understanding query intent is essential to generating appropriate rankings for users. Existing methods have provided customized rankings to answer queries with different intent. While previous methods have shown improvement over their non-discriminating counterparts, the web authors' intent when creating a hyperlink is seldom taken into consideration. To mitigate this gap, we categorize hyperlinks into two types that are reasonably comparable to query intent, i.e., links describing the target page's identity and links describing the target page's content. We argue that emphasis on one type of link when ranking documents can benefit the retrieval for that type of query. We start by presenting a link intent classification approach based on the link context representations that captures evidence from anchors, target pages, and their associated links, and then introduce our enhanced retrieval model that incorporates link intent into the estimation of anchor text importance. Comparative experiments on two large scale web corpora demonstrate the efficacy of our approaches.

References

[1]
E. Acar, D. M. Dunlavy, and T. G. Kolda. Link prediction on evolving data using matrix and tensor factorizations. In LDMTA'09: Proceeding of the ICDM'09 Workshop on Large Scale Data Mining Theory and Applications. IEEE Computer Society Press, December 2009. In press.
[2]
J. Allan, B. Carterette, B. Dachev, J. A. Aslam, V. Pavlu, and E. Kanoulas. Million query track 2007 overview. In TREC, 2007.
[3]
A. Broder. A taxonomy of web search. SIGIR Forum, 36(2):3--10, Fall 2002.
[4]
J. Cho, H. Garcia-Molina, T. Haveliwala, W. Lam, A. Paepcke, S. Raghavan, and G. Wesley. Stanford WebBase components and applications. ACM Trans.\ on Internet Technology, 6(2):153--186, 2006.
[5]
C. L. Clarke, N. Craswell, and I. Soboroff. Overview of the trec 2009 web track. Technical report, NIST, no date.
[6]
N. Craswell, D. Hawking, and S. Robertson. Effective site finding using link anchor information. In Proc. of the 24th Annual ACM SIGIR International Conference on Research and Development in Information Retrieval, pages 250--257, New Orleans, LA, Sept. 2001.
[7]
N. Dai, X. Qi, and B. D. Davison. Enhancing web search with entity intent. In Companion Proceedings of the 20th International World Wide Web Conference (WWW), pages 29--30, Mar. 2011.
[8]
Z. Dou, R. Song, J.-Y. Nie, and J.-R. Wen. Using anchor texts with their hyperlink structure for web search. In Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 227--234. ACM, 2009.
[9]
N. Eiron and K. S. McCurley. Analysis of anchor text for web search. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 459--460, New York, NY, USA, 2003. ACM.
[10]
A. Fujii. Modeling anchor text and classifying queries to enhance web document retrieval. In Proceeding of the 17th International Conference on World Wide Web, pages 337--346, New York, NY, USA, 2008. ACM.
[11]
J. Hirai, S. Raghavan, H. Garcia-Molina, and A. Paepcke. WebBase: A repository of web pages. Computer Networks, 33(1--6):277--293, May 2000. Proc. of the 9th Int'l WWW Conf.
[12]
B. J. Jansen, A. Spink, and T. Saracevic. Real life, real users, and real needs: A study and analysis of user queries on the web. Information Processing and Management, 36(2):207--227, 2000.
[13]
K. Jarvelin and J. Kekalainen. IR evaluation methods for retrieving highly relevant documents. In Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 41--48, July 2000.
[14]
I.-H. Kang and G. Kim. Query type classification for web document retrieval. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 64--71, New York, NY, 2003. ACM Press.
[15]
R. Kraft and J. Zien. Mining anchor text for query refinement. In Proceedings of the 13th International Conference on World Wide Web, pages 666--674, New York, NY, USA, 2004. ACM.
[16]
U. Lee, Z. Liu, and J. Cho. Automatic identification of user goals in web search. In Proceedings of the 14th World Wide Web Conference (WWW), pages 391--400, New York, NY, 2005. ACM Press.
[17]
W.-H. Lu, L.-F. Chien, and H.-J. Lee. Anchor text mining for translation of web queries: A transitive translation approach. ACM Trans. Inf. Syst., 22(2):242--269, 2004.
[18]
D. Metzler, J. Novak, H. Cui, and S. Reddy. Building enriched document representations using aggregated anchor text. In Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 219--226. ACM, 2009.
[19]
L. Nie, B. D. Davison, and X. Qi. Topical link analysis for web search. In Proc. of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 91--98, Aug. 2006.
[20]
X. Qi, L. Nie, and B. D. Davison. Measuring similarity to detect qualified links. In Proceedings of the Third International Workshop on Adversarial Information Retrieval on the Web (AIRWeb), pages 49--56, May 2007.
[21]
S. Robertson, H. Zaragoza, and M. Taylor. Simple BM25 extension to multiple weighted fields. In Proceedings of the 13th ACM Conference on Information and Knowledge Management, pages 42--49, New York, NY, USA, 2004. ACM.
[22]
S. E. Robertson. Overview of the OKAPI projects. Journal of Documentation, 53:3--7, 1997.
[23]
B. Taskar, M. fai Wong, P. Abbeel, and D. Koller. Link prediction in relational data. In Advances in Neural Information Processing Systems, 2003.
[24]
T. Westerveld, W. Kraaij, and D. Hiemstra. Retrieving web pages using content, links, urls and anchors. In Proceedings of the 10th Text Retrieval Conference (TREC), pages 663--672. NIST, 2001.
[25]
K. Yu and W. Chu. Gaussian process models for link analysis and transfer learning. In Advances in Neural Information Processing Systems 20, pages 1657--1664. MIT Press, 2008.

Cited By

View all
  • (2016)User Intent in Multimedia SearchACM Computing Surveys10.1145/295493049:2(1-37)Online publication date: 13-Aug-2016
  • (2015)Lexico Semantic Patterns for Customer Intentions Analysis of MicrobloggingProceedings of the 2015 11th International Conference on Semantics, Knowledge and Grids (SKG)10.1109/SKG.2015.40(222-226)Online publication date: 19-Aug-2015

Index Terms

  1. Bridging link and query intent to enhance web search

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    HT '11: Proceedings of the 22nd ACM conference on Hypertext and hypermedia
    June 2011
    348 pages
    ISBN:9781450302562
    DOI:10.1145/1995966
    • General Chair:
    • Paul De Bra,
    • Program Chair:
    • Kaj Grønbæk
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 06 June 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Kronecker product
    2. anchor text
    3. link intent
    4. query intent
    5. term weighting

    Qualifiers

    • Research-article

    Conference

    HT '11
    Sponsor:
    HT '11: 22nd ACM Conference on Hypertext and Hypermedia
    June 6 - 9, 2011
    Eindhoven, The Netherlands

    Acceptance Rates

    Overall Acceptance Rate 378 of 1,158 submissions, 33%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2016)User Intent in Multimedia SearchACM Computing Surveys10.1145/295493049:2(1-37)Online publication date: 13-Aug-2016
    • (2015)Lexico Semantic Patterns for Customer Intentions Analysis of MicrobloggingProceedings of the 2015 11th International Conference on Semantics, Knowledge and Grids (SKG)10.1109/SKG.2015.40(222-226)Online publication date: 19-Aug-2015

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media