skip to main content
10.1145/2983323.2983344acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

A Self-Learning and Online Algorithm for Time Series Anomaly Detection, with Application in CPU Manufacturing

Published:24 October 2016Publication History

ABSTRACT

The problem of anomaly detection in time series has received a lot of attention in the past two decades. However, existing techniques cannot locate where the anomalies are within anomalous time series, or they require users to provide the length of potential anomalies. To address these limitations, we propose a self-learning online anomaly detection algorithm that automatically identifies anomalous time series, as well as the exact locations where the anomalies occur in the detected time series. We evaluate our approach on several real datasets, including two CPU manufacturing data from Intel. We demonstrate that our approach can successfully detect the correct anomalies without requiring any prior knowledge about the data.

References

  1. C. C. Aggarwal and S. Y. Philip. On clustering massive text and categorical data streams. Knowledge and information systems, 24(2):171--196, 2010.Google ScholarGoogle Scholar
  2. N. Begum, L. Ulanova, J. Wang, and E. Keogh. Accelerating dynamic time warping clustering with a novel admissible pruning strategy. In KDD, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Budalakoti, A. N. Srivastava, R. Akella, and E. Turkov. Anomaly detection in large sets of high-dimensional symbol sequences. Tech. Rep, 2006.Google ScholarGoogle Scholar
  4. V. Chandola, D. Cheboli, and V. Kumar. Detecting anomalies in a time series database. Tech. Rep., 2009.Google ScholarGoogle Scholar
  5. M. Gupta, J. Gao, C. Aggarwal, and J. Han. Outlier detection for temporal data. Synthesis Lectures on Data Mining and Knowledge Discovery, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. M. Hawkins. Identification of outliers, volume 11. Springer, 1980.Google ScholarGoogle ScholarCross RefCross Ref
  7. Z. He, X. Xu, and S. Deng. Discovering cluster-based local outliers. Pattern Recognition Letters, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. J. Hyndman, E. Wang, and N. Laptev. Large-scale unusual time series detection. In Proceedings of International Conference on Data Mining series, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. H. Izakian and W. Pedrycz. Anomaly detection and characterization in spatial time series data: A cluster-centric approach. IEEE.T.Fuzzy Syst., 2014.Google ScholarGoogle Scholar
  10. P. Jaccard. The distribution of the flora in the alpine zone. New phytologist, 11(2):37--50, 1912.Google ScholarGoogle ScholarCross RefCross Ref
  11. E. Keogh, K. Chakrabarti, M. Pazzani, and S. Mehrotra. Dimensionality reduction for fast similarity search in large time series databases. Knowledge and information Systems, 2001.Google ScholarGoogle Scholar
  12. E. Keogh and J. Lin. Clustering of time-series subsequences is meaningless: implications for previous and future research. Knowl. and Inf. Syst., 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. E. Keogh, J. Lin, and A. Fu. Hot sax: Efficiently finding the most unusual time series subsequence. In ICDM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. Keogh, J. Lin, S.-H. Lee, and H. Van Herle. Finding the most unusual time series subsequence: algorithms and applications. Knowl. and Inf. Syst., 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. N. Laptev, S. Amizadeh, and I. Flint. Generic and scalable framework for automated time-series anomaly detection. In KDD, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Y. Li, J. Lin, and T. Oates. Visualizing variable-length time series motifs. In SDM, pages 895--906, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  17. J. Lin, E. Keogh, L. Wei, and S. Lonardi. Experiencing SAX: a novel symbolic representation of time series. Data Mining and knowledge discovery, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C. G. Nevill-Manning and I. H. Witten. Identifying hierarchical structure in sequences: A linear-time algorithm. J. Artif. Intell. Res.(JAIR), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Pires and C. Santos-Pereira. Using clustering and robust estimators to detect outliers in multivariate data. In the Int'l. Conf. on Robust Stats., 2005.Google ScholarGoogle Scholar
  20. F. Pukelsheim. The three sigma rule. The American Statistician, 48(2):88--91, 1994.Google ScholarGoogle Scholar
  21. P. Senin, J. Lin, X. Wang, T. Oates, S. Gandhi, A. P. Boedihardjo, C. Chen, and S. Frankenstein. Time series anomaly discovery with grammar-based compression. In EDBT, pages 481--492, 2015.Google ScholarGoogle Scholar
  22. P. Senin, J. Lin, X. Wang, T. Oates, S. Gandhi, A. P. Boedihardjo, C. Chen, S. Frankenstein, and M. Lerner. Grammarviz 2.0: a tool for grammar-based pattern discovery in time series. In ECML/PKDD, pages 468--472. Springer, 2014.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. K. Sequeira and M. Zaki. Admit: anomaly-based data mining for intrusions. In KDD, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. H. Sun, Y. Bao, F. Zhao, G. Yu, and D. Wang. Cd-trees: An efficient index structure for outlier detection. In WAIM. Springer, 2004.Google ScholarGoogle Scholar
  25. H. Wang, M. Tang, Y.-S. Park, and C. E. Priebe. Locality statistics for anomaly detection in time series of graphs. Sig. Pro., IEEE Trans. on, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. X. Wang, Y. Gao, J. Lin, H. Rangwala, and R. Mittu. A machine learning approach to false alarm detection for critical arrhythmia alarms. In ICMLA, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  27. L. Wei, E. Keogh, and X. Xi. Saxually explicit images: finding unusual shapes. In ICDM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Y. Xie, J. Huang, and R. Willett. Change-point detection for high-dimensional time series with missing data. J. Sel. Top. Signal Process., 2013.Google ScholarGoogle ScholarCross RefCross Ref
  29. Y. Zhang, N. Meratnia, and P. Havinga. Outlier detection techniques for wireless sensor networks: A survey. Com. Surveys & Tutorials, IEEE, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A Self-Learning and Online Algorithm for Time Series Anomaly Detection, with Application in CPU Manufacturing

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
          October 2016
          2566 pages
          ISBN:9781450340731
          DOI:10.1145/2983323

          Copyright © 2016 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 24 October 2016

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          CIKM '16 Paper Acceptance Rate160of701submissions,23%Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader