skip to main content
10.1145/1183579.1183581acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Emerging semantic communities in peer web search

Published: 11 November 2006 Publication History

Abstract

Peer network systems are becoming an increasingly important development in Web search technology. Many studies show that peer search systems perform better when a query is sent to a group of peers semantically similar to the query. This suggests that semantic communities should form so that a query can quickly propagate to many appropriate peers. For the network to be functional, its dynamic communication topology must match the semantic clustering of peers. We introduce two criteria to evaluate a peer search network based on the concept of semantic locality: first, the "small-world" topology of the network; second, we use topical semantic similarity to monitor the quality of a peer's neighbors over time by looking at whether a peer chooses semantically appropriate neighbors to route its queries. We present several simulation experiments conducted with different peer search algorithms on our peer Web search system, 6S. The results suggest that 6S, despite its use of an unstructured overlay network; can effectively foster the spontaneous formation of semantic communities through local peer interactions alone.

References

[1]
R. Akavipat, L.-S. Wu, and F. Menczer. Small world peer networks in distributed Web search. In Alt. Track Papers and Posters Proc. 13th International World Wide Web Conference, pages 396--397, 2004.
[2]
E. Amitay, D. Carmel, R. Lempel, and A. Soffer. Scaling ir-system evaluation using term relevance sets. In SIGIR '04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pages 10--17, New York, NY, USA, 2004. ACM Press.
[3]
S. Androutsellis-Theotokis and D. Spinellis. A survey of peer-to-peer content distribution technologies. ACM Comput. Surv., 36(4):335--371, 2004.
[4]
R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retreival. ACM Press, New York, 1999.
[5]
M. Bawa, R. Bayardo Jr, S. Rajagoplan, and E. Shekita. Make it fresh, make it quick --- searching a network of personal webservers. In Proc. 12th International World Wide Web Conference, 2003.
[6]
H. Chu and M. Rosenthal. Search engines for the World Wide Web: A comparative study and evaluation methodology. In Annual Conference Proceedings (ASIS'96), pages 127--135, October 1996.
[7]
A. Clauset and C. Moore. How do networks become navigable? Technical report, arXiv.org:cond-mat/0309415, 2004.
[8]
A. Crespo and H. Garcia-Molina. Semantic overlay networks for P2P systems. Technical report, Computer Science Department, Stanford University, 2002.
[9]
S. Deerwester, S. Dumais, F. GW, T. Landauer, and R. Harshman. Indexing by Latent Semantic Analysis. Journal of the American Society for Information Science, 41:391--407, 1990.
[10]
M. Girvan and M. Newman. Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA, 99:8271--8276, 2002.
[11]
S. Joseph. Neurogrid: Semantically routing queries in Peer-to-Peer networks. In Proc. Intl. Workshop on Peer-to-Peer Computing, 2002.
[12]
V. Kalogeraki, D. Gunopulos, and D. Zeinalipour-Yazti. A local search mechanism for peer-to-peer networks. In Proc. 11th Intl. Conf. on Information and Knowledge Management (CIKM), 2002.
[13]
M. Khambatti, K. Ryu, and P. Dasgupta. Efficient discovery of implicitly formed P2P communities. International Journal of Parallel and Distributed Systems and Networks, 5(4), 2002.
[14]
I. A. Klampanos, V. Poznański, J. M. Jose, and P. Dickman. A suite of testbeds for the realistic evaluation of peer-to-peer information retrieval systems. Lecture Notes in Computer Science, 3408:38--51, 2005.
[15]
D. Leake, A. Maguitman, and T. Reichherzer. Exploiting rich context: An incremental approach to context-based web search. In International and Interdisciplinary Conference on Modeling and Using Context, CONTEXT'05, pages 254--267, Paris, France, July 2005. Springer.
[16]
D. Lin. An information-theoretic definition of similarity. In Proceedings of the Fifteenth International Conference on Machine Learning, pages 296--304. Morgan Kaufmann Publishers Inc., 1998.
[17]
J. Lu and J. Callan. Content-based retrieval in hybrid peer-to-peer networks. In Proc. 12th Intl. Conf. on Information and Knowledge Management (CIKM'03), 2003.
[18]
J. Lu and J. Callan. Federated search of text digital libraries in hierarchical peer-to-peer networks. In Proc. 27th European Conference on Information Retrieval (ECIR), 2005.
[19]
A. G. Maguitman, F. Menczer, F. Erdinc, H. Roinestad, and A. Vespignani. Algorithmic computation and approximation of semantic similarity. World Wide Web, 2006. Published online at http://dx.doi.org/10.1007/s11280-006-8562-2.
[20]
A. G. Maguitman, F. Menczer, H. Roinestad, and A. Vespignani. Algorithmic detection of semantic similarity. In WWW '05: Proceedings of the 14th International Conference on World Wide Web, pages 107--116, New York, NY, USA, 2005. ACM Press.
[21]
A. G. Maguitman, F. Menczer, H. Roinestad, and A. Vespignani. Algorithmic detection of semantic similarity. In Proc. 14th International World Wide Web Conference, pages 107--116, 2005.
[22]
G. Pant, P. Srinivasan, and F. Menczer. Crawling the Web. In M. Levene and A. Poulovassilis, editors, Web Dynamics. Springer, 2004.
[23]
J. Pujol, R. Sangüesa, and J. Bermúdez. Porqpine: A distributed and collaborative search engine. In Proc. 12th Intl. World Wide Web Conference, 2003.
[24]
F. Radicchi, C. Castellano, F. Cecconi, V. Loreto, and D. Parisi. Defining and identifying communities in networks. Proc. Nat. Acad. Sci. USA, 101(9):2658--2663, 2004.
[25]
P. Resnik. Using information content to evaluate semantic similarity in a taxonomy. In IJCAI, pages 448--453, 1995.
[26]
T. Saracevic. Evaluation of evaluation in information retrieval. In Proceedings of SIGIR, pages 138--146, 1995.
[27]
K. Sripanidkulchai, B. Maggs, and H. Zhang. Efficient content location using interest-based locality in peer-to-peer systems. In Proc. INFOCOM Conference, 2004.
[28]
C. Suel, J.-W. Wu, J. Zhang, A. Delis, M. Kharrazi, X. Long, and K. Shanmugasundaram. ODISSEA: A Peer-to-Peer architecture for scalable Web search and information retrieval. In International Workshop on the Web and Databases (WebDB), 2003.
[29]
C. Tang, Z. Xu, and S. Dwarkadas. Peer-to-peer information retrieval using self-organizing semantic overlay networks. In Proc. ACM SIGCOMM '03, 2003.
[30]
D. Tsoumakos and N. Roussopoulos. Adaptive probabilistic search for peer-to-peer networks. In Proc. 3rd International Conference on Peer-to-Peer Computing (P2P), 2003.
[31]
S. Voulgaris, A. Kermarrec, L. Massoulié, and M. van Steen. Exploiting semantic proximity in peer-to-peer content searching. In Proc.10th Intl. Workshop on Future Trends in Distributed Computing Systems (FTDCS), 2004.
[32]
S. Waterhouse. JXTA Search: Distributed search for distributed networks. Technical report, Sun Microsystems Inc., 2001.
[33]
D. Watts and S. Strogatz. Collective dynamics of "small-world" networks. Nature, 393:440--442, 1998.
[34]
L. Wishard. Precision among Internet search engines: An earth sciences case study. Issues in Science and Technology Librarianship, Spring 1998.
[35]
L.-S. Wu, R. Akavipat, and F. Menczer. 6S: Distributing crawling and searching across Web peers. In Proceedings of the IASTED International Conference on Web technologies, Applications, and Services, Calgary, Canada, July 2005.

Cited By

View all
  • (2019)On Convergence of Controlled Snowball Sampling for Scientific Abstracts CollectionInformation and Communication Technologies in Education, Research, and Industrial Applications10.1007/978-3-030-13929-2_2(18-42)Online publication date: 14-Feb-2019
  • (2016)ArgP2P: An Argumentative Approach for Intelligent Query Routing in P2P NetworksTheory and Applications of Formal Argumentation10.1007/978-3-319-28460-6_12(194-210)Online publication date: 7-Jan-2016
  • (2013)A Fully Distributed Scheme for Discovery of Semantic RelationshipsIEEE Transactions on Services Computing10.1109/TSC.2012.166:4(457-469)Online publication date: 1-Oct-2013
  • Show More Cited By

Index Terms

  1. Emerging semantic communities in peer web search

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      P2PIR '06: Proceedings of the international workshop on Information retrieval in peer-to-peer networks
      November 2006
      66 pages
      ISBN:1595935274
      DOI:10.1145/1183579
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 11 November 2006

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. coverage
      2. global coherence
      3. peer search
      4. semantic locality
      5. small-world networks
      6. topical semantic similarity

      Qualifiers

      • Article

      Conference

      CIKM06
      Sponsor:
      CIKM06: Conference on Information and Knowledge Management
      November 11, 2006
      Virginia, Arlington, USA

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 07 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2019)On Convergence of Controlled Snowball Sampling for Scientific Abstracts CollectionInformation and Communication Technologies in Education, Research, and Industrial Applications10.1007/978-3-030-13929-2_2(18-42)Online publication date: 14-Feb-2019
      • (2016)ArgP2P: An Argumentative Approach for Intelligent Query Routing in P2P NetworksTheory and Applications of Formal Argumentation10.1007/978-3-319-28460-6_12(194-210)Online publication date: 7-Jan-2016
      • (2013)A Fully Distributed Scheme for Discovery of Semantic RelationshipsIEEE Transactions on Services Computing10.1109/TSC.2012.166:4(457-469)Online publication date: 1-Oct-2013
      • (2013)A study of relevance propagation in large topic ontologiesJournal of the American Society for Information Science and Technology10.1002/asi.2292564:11(2238-2255)Online publication date: 3-Sep-2013
      • (2012)Peer-to-Peer Information RetrievalACM Transactions on Information Systems10.1145/2180868.218087130:2(1-34)Online publication date: 1-May-2012
      • (2011)Automatic Discovery and Transfer of Task Hierarchies in Reinforcement LearningAI Magazine10.1609/aimag.v32i1.234232:1(35-50)Online publication date: 1-Mar-2011
      • (2011)The Case for Case‐Based Transfer LearningAI Magazine10.1609/aimag.v32i1.233132:1(54-69)Online publication date: 1-Mar-2011
      • (2011)Deep TransferAI Magazine10.1609/aimag.v32i1.233032:1(51-53)Online publication date: 1-Mar-2011
      • (2011)An Introduction to Intertask Transfer for Reinforcement LearningAI Magazine10.1609/aimag.v32i1.232932:1(15-34)Online publication date: 1-Mar-2011
      • (2011)CoFeed: privacy‐preserving Web search recommendation based on collaborative aggregation of interest feedbackSoftware: Practice and Experience10.1002/spe.112743:10(1165-1184)Online publication date: 6-Oct-2011
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media