skip to main content
10.1145/1458082.1458232acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Modeling multi-step relevance propagation for expert finding

Published:26 October 2008Publication History

ABSTRACT

An expert finding system allows a user to type a simple text query and retrieve names and contact information of individuals that possess the expertise expressed in the query. This paper proposes a novel approach to expert finding in large enterprises or intranets by modeling candidate experts (persons), web documents and various relations among them with so-called expertise graphs. As distinct from the state of-the-art approaches estimating personal expertise through one-step propagation of relevance probability from documents to the related candidates, our methods are based on the principle of multi-step relevance propagation in topic specific expertise graphs. We model the process of expert finding by probabilistic random walks of three kinds: finite, infinite and absorbing. Experiments on TREC Enterprise Track data originating from two large organizations show that our methods using multi-step relevance propagation improve over the baseline one-step propagation based method in almost all cases.

References

  1. IBM Professional Marketplace matches consultants with clients. White paper. November 2006.Google ScholarGoogle Scholar
  2. Enterprise search from Microsoft: Empower people to find information and expertise. White paper. Microsoft, January 2007.Google ScholarGoogle Scholar
  3. M. S. Ackerman, V. Wulf, and V. Pipek. Sharing Expertise: Beyond Knowledge Management. MIT Press, Cambridge, MA, USA, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In WSDM '08: Proceedings of the international conference on Web search and web data mining, pages 183--194, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. K. Balog, L. Azzopardi, and M. de Rijke. Formal models for expert finding in enterprise corpora. In SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 43--50, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. K. Balog, T. Bogers, L. Azzopardi, M. de Rijke, and A. van den Bosch. Broad expertise retrieval in sparse data environments. In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 551--558, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. K. Balog and M. de Rijke. Finding experts and their eetails in e-mail corpora. In WWW '06: Proceedings of the 15th international conference on World Wide Web, pages 1035--1036, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. I. Becerra-Fernandez. Facilitating the online search of experts at NASA using expert seeker people-finder. In PAKM'00, Third International Conference on Practical Aspects of Knowledge Management, 2000.Google ScholarGoogle Scholar
  9. M. Bilenko and R. W. White. Mining the search trails of surfing crowds: identifying relevant websites from user activity. In WWW '08: Proceeding of the 17th international conference on World Wide Web, pages 51--60, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. S. Campbell, P. P. Maglio, A. Cozzi, and B. Dom. Expertise identification using email communications. In CIKM '03: Proceedings of the twelfth international conference on Information and knowledge management, pages 528--531, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Cao, J. Liu, S. Bao, and H. Li. Research on expert search at enterprise track of trec 2005. In Proceedings of 14th Text Retrieval Conference (TREC 2005), 2005.Google ScholarGoogle Scholar
  12. H. Chen, H. Shen, J. Xiong, S. Tan, and X. Cheng. Social Network Structure behind the Mailing Lists: ICT-IIIS at TREC 2006 Expert Finding Track. In Proceeddings of the 15th Text REtrieval Conference (TREC 2006), 2006.Google ScholarGoogle Scholar
  13. K. Collins-Thompson and J. Callan. Query expansion using random walk models. In CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management, pages 704--711, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. N. Craswell, A. de Vries, and I. Soboroff. Overview of the trec-2005 enterprise track. In Proceedings of TREC-2005, Gaithersburg, USA, 2005.Google ScholarGoogle Scholar
  15. N. Craswell, D. Hawking, A.-M. Vercoustre, and P. Wilkins. Panoptic expert: Searching for experts not just for documents. In Ausweb Poster Proceedings, Queensland, Australia, 2001.Google ScholarGoogle Scholar
  16. N. Craswell and M. Szummer. Random walks on the click graph. In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 239--246, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. F. Crestani. Application of spreading activation techniques in information retrieval. Artif. Intell. Rev., 11(6):453--482, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. T. Davenport. Knowledge Management at Microsoft. White paper. 1997.Google ScholarGoogle Scholar
  19. T. Davenport. Ten principles of knowledge management and four case studies. Knowledge and Process Management, 4(3), 1998.Google ScholarGoogle Scholar
  20. L. Fields. 3 great databases for finding experts. The Expert Advisor, (3), March 2007.Google ScholarGoogle Scholar
  21. S. Harabagiu, F. Lacatusu, and A. Hickl. Answering complex questions with random walk models. In SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 220--227, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. D. Hawking. Challenges in enterprise search. In ADC '04: Proceedings of the 15th Australasian database conference, pages 15--24, Darlinghurst, Australia, Australia, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. D. Hiemstra. Using Language Models for Information Retrieval. Phd thesis, University of Twente, 2001.Google ScholarGoogle Scholar
  24. D. Hiemstra, H. Rode, R. van Os, and J. Flokstra. Pftijah: text search in an xml database system. In Proceedings of the 2nd International Workshop on Open Source Information Retrieval (OSIR), pages 12--17, August 2006.Google ScholarGoogle Scholar
  25. M. Idinopulos and L. Kempler. Do you know who your experts are? The McKinsey Quarterly, (4), 2003.Google ScholarGoogle Scholar
  26. G. Jeh and J. Widom. Scaling personalized web search. In WWW '03: Proceedings of the 12th international conference on World Wide Web, pages 271--279, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. P. Jurczyk and E. Agichtein. Discovering authorities in question answer communities by using link analysis. In CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, pages 919--922, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. M. Kleinberg. Authoritative sources in a hyperlinked environment. J. ACM, 46(5):604--632, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. O. Kurland and L. Lee. Respect my authority!: HITS without hyperlinks, utilizing cluster-based language models. In SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 83--90, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. J. Lafferty and C. Zhai. Document language models, query models, and risk minimization for information retrieval. In SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 111--119, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. R. Lempel and S. Moran. Salsa: the stochastic approach for link-structure analysis. ACM Trans. Inf. Syst., 19(2):131--160, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. X. Liu, W. B. Croft, and M. Koll. Finding experts in community-based question-answering services. In CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management, pages 315--316, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. W. Lu, S. Robertson, A. Macfarlane, and H. Zhao. Window-based Enterprise Expert Search. In Proceeddings of the 15th Text REtrieval Conference (TREC 2006), 2006.Google ScholarGoogle Scholar
  34. C. Macdonald and I. Ounis. Voting for candidates: adapting data fusion techniques for an expert search task. In CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management, pages 387--396, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. M. T. Maybury. Expert finding systems. Technical Report MTR06B000040, MITRE Corporation, 2006.Google ScholarGoogle Scholar
  36. M. A. Najork, H. Zaragoza, and M. J. Taylor. Hits on the web: how does it compare? In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 471--478, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. A. Y. Ng, A. X. Zheng, and M. I. Jordan. Stable algorithms for link analysis. In SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 258--266, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford University, 1998.Google ScholarGoogle Scholar
  39. D. Petkova and W. B. Croft. Proximity-based document representation for named entity retrieval. In CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, pages 731--740, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. M. Richardson and P. Domingos. The intelligent surfer: Probabilistic combination of link and content information in pagerank. In NIPS '01: Advances in Neural Information Processing Systems, 2001.Google ScholarGoogle Scholar
  41. P. Serdyukov and D. Hiemsta. Being omnipresent to be almighty: The importance of the global web evidence for organizational expert finding. In In FCHER'08: Proceedings of the SIGIR'08 Workshop on Future Challenges in Expertise Retrieval, 2008.Google ScholarGoogle Scholar
  42. P. Serdyukov and D. Hiemstra. Modeling documents as mixtures of persons for expert finding. In ECIR, pages 309--320, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. P. Serdyukov, D. Hiemstra, M. Fokkinga, and P. M. G. Apers. Generative modeling of persons and documents for expert search. In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 827--828, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. P. Serdyukov, H. Rode, and D. Hiemsta. Exploiting sequential dependencies for expert finding. In SIGIR '08: Proceedings of the 31th annual international ACM SIGIR conference on Research and development in information retrieval, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. P. Serdyukov, H. Rode, and D. Hiemsta. Modeling expert finding as an absorbing random walk. In SIGIR '08: Proceedings of the 31th annual international ACM SIGIR conference on Research and development in information retrieval, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. A. Shakery and C. Zhai. A probabilistic relevance propagation model for hypertext retrieval. In CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management, pages 550--558, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. X. Song, B. L. Tseng, C.-Y. Lin, and M.-T. Sun. Personalized recommendation driven by information flow. In SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 509--516, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. K. Toutanova, C. D. Manning, and A. Y. Ng. Learning random walk models for inducing word dependency distributions. In ICML '04: Proceedings of the twenty-first international conference on Machine learning, page 103, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. T. Tsikrika, P. Serdyukov, H. Rode, T. Westerveld, R. Aly, D. Hiemstra, and A. de Vries. Structured Document Retrieval, Multimedia Retrieval, and Entity Ranking using PF/Tijah. In INEX 2007, 2007.Google ScholarGoogle Scholar
  50. H. Zaragoza, H. Rode, P. Mika, J. Atserias, M. Ciaramita, and G. Attardi. Ranking very many typed entities on wikipedia. In CIKM '07, Lisbon, Portugal, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. J. Zhang, M. S. Ackerman, and L. Adamic. Expertise networks in online communities: structure and algorithms. In WWW '07: Proceedings of the 16th international conference on World Wide Web, pages 221--230, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Modeling multi-step relevance propagation for expert finding

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge management
      October 2008
      1562 pages
      ISBN:9781595939913
      DOI:10.1145/1458082

      Copyright © 2008 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 October 2008

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader