|
ABSTRACT
This paper characterizes the query behavior of peers in a peer-to-peer (P2P) file sharing system. In contrast to previous work, which provides various aggregate workload statistics, we characterize peer behavior in a form that can be used for constructing representative synthetic workloads for evaluating new P2P system designs. In particular, the analysis exposes heterogeneous behavior that occurs on different days, in different geographical regions (i. e., Asia, Europe, and North America) or during different periods of the day. The workload measures include the fraction of connected sessions that are passive (i. e., issue no queries), the duration of such sessions, and for each active session, the number of queries issued, time until first query, query interarrival time, time after last query, and distribution of query popularity. Moreover, the key correlations in these workload measures are captured in the form of conditional distributions, such that the correlations can be accurately reproduced in a synthetic workload. The characterization is based on trace data gathered in the Gnutella P2P system over a period of 40 days. To characterize system-independent user behavior, we eliminate queries that are specific to the Gnutella system software, such as re-queries that are automatically issued by some client implementations to improve system responsiveness.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
E. Adar and B. Hubermann, Free Riding on Gnutella, Technical Report, Xerox PARC, 2000.
|
| |
2
|
R. Bhagwan, S. Savage, and G. Voelker, Understanding Availability, Proc. 2nd Int. Workshop on P2P Systems, Berkeley, CA, 2002.
|
 |
3
|
Yatin Chawathe , Sylvia Ratnasamy , Lee Breslau , Nick Lanham , Scott Shenker, Making gnutella-like P2P systems scalable, Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications, August 25-29, 2003, Karlsruhe, Germany
[doi> 10.1145/863955.864000]
|
| |
4
|
J. Chu, K. Labonte, and B. Levine, Availability and Locality Measurements of Peer-to-Peer File Systems, Proc. SPIE ITCom: Scalability and Traffic Control in IP Networks, Boston. MA, 2002.
|
 |
5
|
Edith Cohen , Scott Shenker, Replication strategies in unstructured peer-to-peer networks, Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications, August 19-23, 2002, Pittsburgh, Pennsylvania, USA
|
| |
6
|
Collab. Net and Sun Microsystems, 2003. http://www.jxta.org.
|
| |
7
|
Z. Ge, D. R. Figueiredo, S. Jaiswal, J. Kurose, and D. Towsley, Modeling Peer-Peer File Sharing Systems, Proc. IEEE Conference on Computer Communications (INFOCOM '03), San Francisco, CA, 2003.
|
| |
8
|
Gnutella Developer Forum, Gnutella - A Protocol for a Revolution, 2003. http://rfc-gnutella.sourceforge.net.
|
 |
9
|
Krishna P. Gummadi , Richard J. Dunn , Stefan Saroiu , Steven D. Gribble , Henry M. Levy , John Zahorjan, Measurement, modeling, and analysis of a peer-to-peer file-sharing workload, Proceedings of the nineteenth ACM symposium on Operating systems principles, October 19-22, 2003, Bolton Landing, NY, USA
|
| |
10
|
MaxMind, LLC, Geotargeting IP Address, http://www.maxmind.com.
|
| |
11
|
|
| |
12
|
Mutella Hompage. http://mutella.sourceforge.net.
|
| |
13
|
Napster Homepage. http://www.napster.com.
|
 |
14
|
Sylvia Ratnasamy , Paul Francis , Mark Handley , Richard Karp , Scott Schenker, A scalable content-addressable network, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.161-172, August 2001, San Diego, California, United States
|
 |
15
|
Stefan Saroiu , Krishna P. Gummadi , Richard J. Dunn , Steven D. Gribble , Henry M. Levy, An analysis of internet content delivery systems, Proceedings of the 5th symposium on Operating systems design and implementation Due to copyright restrictions we are not able to make the PDFs for this conference available for downloading, December 09-11, 2002, Boston, Massachusetts
[doi> 10.1145/1060289.1060319]
|
| |
16
|
S. Saroiu, K. Gummadi, and S. Gribble, A Measurement Study of Peer-to-Peer File Sharing Systems, Proc. Multimedia Computing and Networking (MMCN '02), San Jose, CA, 2002.
|
 |
17
|
|
| |
18
|
Sharman Networks Ltd., http://www. kazaa. org.
|
 |
19
|
Ion Stoica , Robert Morris , David Karger , M. Frans Kaashoek , Hari Balakrishnan, Chord: A scalable peer-to-peer lookup service for internet applications, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.149-160, August 2001, San Diego, California, United States
|
| |
20
|
K. Sripanidkulchai, The Popularity of Gnutella Queries and its Implications on Scalability, Featured on O'Reilly's www. openp2p. com website, February 2001.
|
| |
21
|
StreamCast Networks, http://www. morpheus.com.
|
| |
22
|
|
CITED BY 7
|
|
|
|
|
|
|
|
Naimul Basher , Aniket Mahanti , Anirban Mahanti , Carey Williamson , Martin Arlitt, A comparative analysis of web and peer-to-peer traffic, Proceeding of the 17th international conference on World Wide Web, April 21-25, 2008, Beijing, China
|
|
|
|
|
|
|
|
|
|