ACM Home Page
Please provide us with feedback. Feedback
What's new: finding significant differences in network data streams
Full text PdfPdf (611 KB)
Source IEEE/ACM Transactions on Networking (TON) archive
Volume 13 ,  Issue 6  (December 2005) table of contents
Pages: 1219 - 1232  
Year of Publication: 2005
ISSN:1063-6692
Authors
Graham Cormode  Bell Laboratories, Murray Hill, NJ and Center for Discrete Mathematics and Computer Science (DIMACS), Rutgers University, Piscataway, NJ
S. Muthukrishnan  Division of Computer and Information Systems, Rutgers University, Piscataway, NJ
Publisher
IEEE Press  Piscataway, NJ, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 59,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: 10.1109/TNET.2005.860096

ABSTRACT

Monitoring and analyzing network traffic usage patterns is vital for managing IP Networks. An important problem is to provide network managers with information about changes in traffic, informing them about "what's new." Specifically, we focus on the challenge of finding significantly large differences in traffic: over time, between interfaces and between routers. We introduce the idea of a deltoid: an item that has a large difference, whether the difference is absolute, relative or variational.We present novel algorithms for finding the most significant deltoids in high-speed traffic data, and prove that they use small space, very small time per update, and are guaranteed to find significant deltoids with pre-specified accuracy. In experimental evaluation with real network traffic, our algorithms perform well and recover almost all deltoids. This is the first work to provide solutions capable of working over the data with one pass, at network traffic speeds.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
{1} Internet Traffic Achive {Online}. Available: http://ita.ee.lbl.gov/
 
2
{2} Proc. Workshop on Management and Processing of Data Streams (MPDS), San Diego, CA, Jun. 2003.
3
4
5
 
6
 
7
{7} G. Cormode, M. Datar, P. Indyk, and S. Muthukrishnan, "Comparing data streams using Hamming norms," in Proc. 28th Int. Conf. Very Large Data Bases, 2002, pp. 335-345.
 
8
{8} G. Cormode, F. Korn, S. Muthukrishnan, and D. Srivastava, "Finding hierarchical heavy hitters in data streams," in Proc. 29th Int. Conf. Very Large Databases, 2003, pp. 464-475.
 
9
{9} G. Cormode and S. Muthukrishnan, "Estimating dominance norms of multiple data streams," in Proc. 11th Eur. Symp. Algorithms (ESA), Lecture Notes in Computer Science vol. 2838, 2003, pp. 148-160.
10
 
11
 
12
 
13
{13} D.-Z. Du and F. K. Hwang, Combinatorial Group Testing and its Applications . Singapore: World Scientific, 1993, vol. 3, Series on Applied Mathematics.
14
15
 
16
{16} C. Estan and G. Varghese, "Data streaming in computer networking," in Proc. Workshop on Management and Processing of Data Streams (MPDS), San Diego, CA, 2003. {Online.} http://www.research.att.com/conf/mpds2003/schedule/estan V.ps.
17
 
18
19
20
 
21
{21} A. Gilbert, Y. Kotidis, S. Muthukrishnan, and M. Strauss, "QuickSAND: Quick Summary and Analysis of Network Data," DIMACS, Tech. Repo. 2001-43, 2001.
 
22
{22} M. Henzinger, "Algorithmic challenges in search engines," Internet Math., vol. 1, no. 1, pp. 115-126, 2003.
 
23
24
25
 
26
 
27
{27} G. S. Manku and R. Motwani, "Approximate frequency counts over data streams," in Proc. 28th Int. Conf. Very Large Data Bases, 2002, pp. 346-357.
 
28
 
29
 
30
 
31
{31} G. Varghese, "Detecting packet patterns at high speeds," presented at the ACM SIGCOMM, Tutorial, Pittsburgh, PA, 2002.
32


Collaborative Colleagues:
Graham Cormode: colleagues
S. Muthukrishnan: colleagues