ACM Home Page
Please provide us with feedback. Feedback
On the bias of traceroute sampling: or, power-law degree distributions in regular graphs
Full text PdfPdf (235 KB)
Source Annual ACM Symposium on Theory of Computing archive
Proceedings of the thirty-seventh annual ACM symposium on Theory of computing table of contents
Baltimore, MD, USA
SESSION: Session 14B table of contents
Pages: 694 - 703  
Year of Publication: 2005
ISBN:1-58113-960-8
Authors
Dimitris Achlioptas  Microsoft Corporation, Redmond, WA
Aaron Clauset  University of New Mexico, Albuquerque, NM
David Kempe  University of Southern California, Los Angeles, CA
Cristopher Moore  University of New Mexico, Albuquerque, NM
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 65,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1060590.1060693
What is a DOI?

ABSTRACT

Understanding the structure of the Internet graph is a crucial step for building accurate network models and designing efficient algorithms for Internet applications. Yet, obtaining its graph structure is a surprisingly difficult task, as edges cannot be explicitly queried. Instead, empirical studies rely on traceroutes to build what are essentially single-source, all-destinations, shortest-path trees. These trees only sample a fraction of the network's edges, and a recent paper by Lakhina et al. found empirically that the resuting sample is intrinsically biased. For instance, the observed degree distribution under traceroute sampling exhibits a power law even when the underlying degree distribution is Poisson.In this paper, we study the bias of traceroute sampling systematically, and, for a very general class of underlying degree distributions, calculate the likely observed distributions explicitly. To do this, we use a continuous-time realization of the process of exposing the BFS tree of a random graph with a given degree distribution, calculate the expected degree distribution of the tree, and show that it is sharply concentrated. As example applications of our machinery, we show how traceroute sampling finds power-law degree distributions in both δ-regular and Poisson-distributed random graphs. Thus, our work puts the observations of Lakhina et al. on a rigorous footing, and extends them to nearly arbitrary degree distributions.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
L. Amini, A. Shaikh, and H. Schulzrinne. Issues with inferring Internet topological attributes. In SPIE IT-Com, 2002.
2
 
3
B. Bollobás. Random graphs. Academic Press, 1985.
 
4
 
5
Q. Chen, H. Chang, R. Govindan, S. Jamin, S. J. Shenker, and W. Willinger. The origin of power laws in Internet topologies revisited. In Proc. 21st ACM SIGCOMM Conference, 2002.
 
6
A. Clauset and C. Moore. Accuracy and scaling phenomena in Internet mapping. Physical Review Letters, 94:018701, 2005.
 
7
P. Erdős and A. Rényi. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci., 5:17--61, 1960.
 
8
9
 
10
R. Govindan and H. Tangmunarunkit. Heuristics for Internet map discovery. In Proc. 19th ACM SIGCOMM Conference, 2000.
 
11
W. Hoeffding. Probability inequalities for sums of bounded random variables. J. of the American Statistical Association, 58:13--30, 1963.
 
12
J. H. Kim. Poisson cloning model for random graphs. Preprint, 2004.
 
13
J. Kleinberg, S. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. The web as a graph: Measurements, models and methods. In Intl. Conf. on Combinatorics and Computing, 1999.
 
14
A. Lakhina, J. Byers, M. Crovella, and P. Xie. Sampling biases in IP topology measurements. In Proc. 22nd ACM SIGCOMM Conference, 2003.
 
15
J. Leguay, M. Latapy, T. Friedman, and K. Salamatian. Describing and simulating internet routes. Preprint cs.NI/0411051, 2004.
 
16
C. McDiarmid. Concentration. In M. Habib, C. McDiarmid, J. Ramirez-Alfonsin, and B. Reed, editors, Probabilistic Methods for Algorithmic Discrete Mathematics, pages 195--247. Springer, 1998.
 
17
M. Mihail, C. Papadimitriou, and A. Saberi. On certain connectivity properties of the Internet topology. In Proc. 35th ACM Symp. on Theory of Computing, 2003.
 
18
 
19
 
20
21
 
22
T. Petermann and P. De Los Rios. Exploration of scale-free networks: do we measure the real exponents? Euro. Phys. Journal B, 38:201, 2004.
 
23
 
24
J. Spanier and K.B. Oldham. The exponential integral ei(x) and related functions. In An Atlas of Functions, chapter 37, pages 351--360. Hemisphere, 1987.
 
25
 
26
N.C. Wormald. Models of random regular graphs. In J. Lamb and D. Preece, editors, London Math. Soc. Lecture Note Series, volume 276, pages 239--298. Cambridge University Press, 1999.


Collaborative Colleagues:
Dimitris Achlioptas: colleagues
Aaron Clauset: colleagues
David Kempe: colleagues
Cristopher Moore: colleagues