|
ABSTRACT
Monitoring and analyzing network traffic usage patterns is vital for managing IP Networks. An important problem is to provide network managers with information about changes in traffic, informing them about "what's new." Specifically, we focus on the challenge of finding significantly large differences in traffic: over time, between interfaces and between routers. We introduce the idea of a deltoid: an item that has a large difference, whether the difference is absolute, relative or variational.We present novel algorithms for finding the most significant deltoids in high-speed traffic data, and prove that they use small space, very small time per update, and are guaranteed to find significant deltoids with pre-specified accuracy. In experimental evaluation with real network traffic, our algorithms perform well and recover almost all deltoids. This is the first work to provide solutions capable of working over the data with one pass, at network traffic speeds.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
{1} Internet Traffic Achive {Online}. Available: http://ita.ee.lbl.gov/
|
| |
2
|
{2} Proc. Workshop on Management and Processing of Data Streams (MPDS), San Diego, CA, Jun. 2003.
|
 |
3
|
Noga Alon , Phillip B. Gibbons , Yossi Matias , Mario Szegedy, Tracking join and self-join sizes in limited storage, Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.10-20, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
[doi> 10.1145/303976.303978]
|
 |
4
|
Noga Alon , Yossi Matias , Mario Szegedy, The space complexity of approximating the frequency moments, Proceedings of the twenty-eighth annual ACM symposium on Theory of computing, p.20-29, May 22-24, 1996, Philadelphia, Pennsylvania, United States
[doi> 10.1145/237814.237823]
|
 |
5
|
Brian Babcock , Shivnath Babu , Mayur Datar , Rajeev Motwani , Jennifer Widom, Models and issues in data stream systems, Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 03-05, 2002, Madison, Wisconsin
[doi> 10.1145/543613.543615]
|
| |
6
|
|
| |
7
|
{7} G. Cormode, M. Datar, P. Indyk, and S. Muthukrishnan, "Comparing data streams using Hamming norms," in Proc. 28th Int. Conf. Very Large Data Bases, 2002, pp. 335-345.
|
| |
8
|
{8} G. Cormode, F. Korn, S. Muthukrishnan, and D. Srivastava, "Finding hierarchical heavy hitters in data streams," in Proc. 29th Int. Conf. Very Large Databases, 2003, pp. 464-475.
|
| |
9
|
{9} G. Cormode and S. Muthukrishnan, "Estimating dominance norms of multiple data streams," in Proc. 11th Eur. Symp. Algorithms (ESA), Lecture Notes in Computer Science vol. 2838, 2003, pp. 148-160.
|
 |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
{13} D.-Z. Du and F. K. Hwang, Combinatorial Group Testing and its Applications . Singapore: World Scientific, 1993, vol. 3, Series on Applied Mathematics.
|
 |
14
|
Cristian Estan , Stefan Savage , George Varghese, Automatically inferring patterns of resource consumption in network traffic, Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications, August 25-29, 2003, Karlsruhe, Germany
[doi> 10.1145/863955.863972]
|
 |
15
|
|
| |
16
|
{16} C. Estan and G. Varghese, "Data streaming in computer networking," in Proc. Workshop on Management and Processing of Data Streams (MPDS), San Diego, CA, 2003. {Online.} http://www.research.att.com/conf/mpds2003/schedule/estan V.ps.
|
 |
17
|
|
| |
18
|
|
 |
19
|
|
 |
20
|
Anna C. Gilbert , Sudipto Guha , Piotr Indyk , Yannis Kotidis , S. Muthukrishnan , Martin J. Strauss, Fast, small-space algorithms for approximate histogram maintenance, Proceedings of the thiry-fourth annual ACM symposium on Theory of computing, May 19-21, 2002, Montreal, Quebec, Canada
[doi> 10.1145/509907.509966]
|
| |
21
|
{21} A. Gilbert, Y. Kotidis, S. Muthukrishnan, and M. Strauss, "QuickSAND: Quick Summary and Analysis of Network Data," DIMACS, Tech. Repo. 2001-43, 2001.
|
| |
22
|
{22} M. Henzinger, "Algorithmic challenges in search engines," Internet Math., vol. 1, no. 1, pp. 115-126, 2003.
|
| |
23
|
|
 |
24
|
|
 |
25
|
Balachander Krishnamurthy , Subhabrata Sen , Yin Zhang , Yan Chen, Sketch-based change detection: methods, evaluation, and applications, Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement, October 27-29, 2003, Miami Beach, FL, USA
[doi> 10.1145/948205.948236]
|
| |
26
|
|
| |
27
|
{27} G. S. Manku and R. Motwani, "Approximate frequency counts over data streams," in Proc. 28th Int. Conf. Very Large Data Bases, 2002, pp. 346-357.
|
| |
28
|
|
| |
29
|
|
| |
30
|
|
| |
31
|
{31} G. Varghese, "Detecting packet patterns at high speeds," presented at the ACM SIGCOMM, Tutorial, Pittsburgh, PA, 2002.
|
 |
32
|
|
|