ACM Home Page
Please provide us with feedback. Feedback
Characterizing the query behavior in peer-to-peer file sharing systems
Full text PdfPdf (526 KB)
Source Internet Measurement Conference archive
Proceedings of the 4th ACM SIGCOMM conference on Internet measurement table of contents
Taormina, Sicily, Italy
SESSION: Traffic characterization table of contents
Pages: 55 - 67  
Year of Publication: 2004
ISBN:1-58113-821-0
Authors
Alexander Klemm  University of Dortmund, Dortmund, Germany
Christoph Lindemann  University of Dortmund, Dortmund, Germany
Mary K. Vernon  University of Wisconsin - Madison, Madison, WI
Oliver P. Waldhorst  University of Dortmund, Dortmund, Germany
Sponsors
SIGCOMM: ACM Special Interest Group on Data Communication
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 108,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1028788.1028796
What is a DOI?

ABSTRACT

This paper characterizes the query behavior of peers in a peer-to-peer (P2P) file sharing system. In contrast to previous work, which provides various aggregate workload statistics, we characterize peer behavior in a form that can be used for constructing representative synthetic workloads for evaluating new P2P system designs. In particular, the analysis exposes heterogeneous behavior that occurs on different days, in different geographical regions (i. e., Asia, Europe, and North America) or during different periods of the day. The workload measures include the fraction of connected sessions that are passive (i. e., issue no queries), the duration of such sessions, and for each active session, the number of queries issued, time until first query, query interarrival time, time after last query, and distribution of query popularity. Moreover, the key correlations in these workload measures are captured in the form of conditional distributions, such that the correlations can be accurately reproduced in a synthetic workload. The characterization is based on trace data gathered in the Gnutella P2P system over a period of 40 days. To characterize system-independent user behavior, we eliminate queries that are specific to the Gnutella system software, such as re-queries that are automatically issued by some client implementations to improve system responsiveness.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E. Adar and B. Hubermann, Free Riding on Gnutella, Technical Report, Xerox PARC, 2000.
 
2
R. Bhagwan, S. Savage, and G. Voelker, Understanding Availability, Proc. 2nd Int. Workshop on P2P Systems, Berkeley, CA, 2002.
3
 
4
J. Chu, K. Labonte, and B. Levine, Availability and Locality Measurements of Peer-to-Peer File Systems, Proc. SPIE ITCom: Scalability and Traffic Control in IP Networks, Boston. MA, 2002.
5
 
6
Collab. Net and Sun Microsystems, 2003. http://www.jxta.org.
 
7
Z. Ge, D. R. Figueiredo, S. Jaiswal, J. Kurose, and D. Towsley, Modeling Peer-Peer File Sharing Systems, Proc. IEEE Conference on Computer Communications (INFOCOM '03), San Francisco, CA, 2003.
 
8
Gnutella Developer Forum, Gnutella - A Protocol for a Revolution, 2003. http://rfc-gnutella.sourceforge.net.
9
 
10
MaxMind, LLC, Geotargeting IP Address, http://www.maxmind.com.
 
11
 
12
Mutella Hompage. http://mutella.sourceforge.net.
 
13
Napster Homepage. http://www.napster.com.
14
15
 
16
S. Saroiu, K. Gummadi, and S. Gribble, A Measurement Study of Peer-to-Peer File Sharing Systems, Proc. Multimedia Computing and Networking (MMCN '02), San Jose, CA, 2002.
17
 
18
Sharman Networks Ltd., http://www. kazaa. org.
19
 
20
K. Sripanidkulchai, The Popularity of Gnutella Queries and its Implications on Scalability, Featured on O'Reilly's www. openp2p. com website, February 2001.
 
21
StreamCast Networks, http://www. morpheus.com.
 
22

CITED BY  7
 
 
 

Collaborative Colleagues:
Alexander Klemm: colleagues
Christoph Lindemann: colleagues
Mary K. Vernon: colleagues
Oliver P. Waldhorst: colleagues