ACM Home Page
Please provide us with feedback. Feedback
Detection and prediction of distance-based outliers
Full text PdfPdf (11.41 MB)
Source Symposium on Applied Computing archive
Proceedings of the 2005 ACM symposium on Applied computing table of contents
Santa Fe, New Mexico
SESSION: Data mining (DM) table of contents
Pages: 537 - 542  
Year of Publication: 2005
ISBN:1-58113-964-0
Authors
Fabrizio Angiulli  ICAR-CNR, Via Pietro Bucci, Rende (CS), Italy
Stefano Basta  ICAR-CNR, Via Pietro Bucci, Rende (CS), Italy
Clara Pizzuti  ICAR-CNR, Via Pietro Bucci, Rende (CS), Italy
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 69,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1066677.1066797
What is a DOI?

ABSTRACT

In this paper we present an unsupervised distance-based outlier detection method designed to learn a model over the objects contained in a data set. The learned model, called solving set, is a small subset of the data set that is used to classify new unseen objects as outliers or not. We provide an algorithm that computes a solving set with sub-quadratic time requirements, and we give experimental evidence that the computed solving set is small and that the false positive rate, i.e. the fraction of new objects misclassified as outliers using the solving set instead of the overall data set, is negligible.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
A. Arning, C. Aggarwal, and P. Raghavan. A linear method for deviation detection in large databases. In Proc. Int. Conf. on Knowledge Discovery and Data Mining(KDD'96), pages 164--169, 1996.
 
3
V. Barnett and T. Lewis. Outliers in Statistical Data. John Wiley & Sons, 1994.
4
5
 
6
E. Eskin, A. Arnold, M. Prerau, L. Portnoy, and S. Stolfo. A geometric framework for unsupervised anomaly detection. In Applications of Data Mining in Computer Security, Kluwer, 2002.
 
7
 
8
 
9
10
 
11
 
12
 
13
A. Lazarevic, L. Ertoz, V. Kumar, A. Ozgur, and J. Srivastava. A comparative study of anomaly detection schemes in network intrusion detection. In Proc. SIAM Int. Conf. on Data Mining, 2003.
 
14
W. Lee, S. J. Stolfo, and K. W. Mok. Mining audit data to build intrusion detection models. In Proc. Int. Conf on Knowledge Discovery and Data Mining (KDD-98), pages 66--72, 1998.
15
16

Collaborative Colleagues:
Fabrizio Angiulli: colleagues
Stefano Basta: colleagues
Clara Pizzuti: colleagues