skip to main content
10.1145/2632048.2632061acmconferencesArticle/Chapter ViewAbstractPublication PagesubicompConference Proceedingsconference-collections
research-article

Mining multivariate time series with mixed sampling rates

Published:13 September 2014Publication History

ABSTRACT

Fitting sensors to humans and physical structures is becoming more and more common. These developments provide many opportunities for ubiquitous computing, as well as challenges for analyzing the resulting sensor data. From these challenges, an underappreciated problem arises: modeling multivariate time series with mixed sampling rates. Although mentioned in several application papers using sensor systems, this problem has been left almost unexplored, often hidden in a preprocessing step or solved manually as a one-pass procedure (feature extraction/construction). This leaves an opportunity to formalize and develop methods that address mixed sampling rates in an automatic fashion.

We approach the problem of dealing with multiple sampling rates from an aggregation perspective. We propose Accordion, a new embedded method that constructs and selects aggregate features iteratively, in a memory-conscious fashion. Our algorithms work on both classification and regression problems. We describe three experiments on real-world time series datasets, with satisfying results.

Skip Supplemental Material Section

Supplemental Material

p413-cachucho.mov

mov

159.2 MB

References

  1. Angelini, E., Henry, J., and Marcellino, M. Interpolation and backdating with a large information set. Journal of Economic Dynamics and Control 30, 12 (Dec. 2006), 2693--2724.Google ScholarGoogle ScholarCross RefCross Ref
  2. Armesto, M. Forecasting with mixed frequencies. Federal Reserve Bank of St. Louis (2010), 521--536.Google ScholarGoogle Scholar
  3. Bao, L., and Intille, S. Activity recognition from user-annotated acceleration data. Pervasive Computing (2004), 1--17.Google ScholarGoogle Scholar
  4. Brush, A., Krumm, J., Scott, J., and Saponas, T. Recognizing Activities from Mobile Sensor Data: Challenges and Opportunities. In Proceedings Ubicomp' 11 (2011).Google ScholarGoogle Scholar
  5. Figo, D., Diniz, P. C., Ferreira, D. R., and Cardoso, J. a. M. P. Preprocessing techniques for context recognition from accelerometer data. Personal and Ubiquitous Computing 14, 7 (Mar. 2010), 645--662. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Friedman, J., Hastie, T., and Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software (2010).Google ScholarGoogle Scholar
  7. Ghysels, E., Santa-Clara, P., and Valkanov, R. Predicting volatility: getting the most out of return data sampled at different frequencies. Journal of Econometrics (2006).Google ScholarGoogle Scholar
  8. Gujarati, D. Basic econometrics, fourth ed. McGraw Hill, 2003.Google ScholarGoogle Scholar
  9. Hastie, T., Tibshirany, R., and Friedman, J. The Elements of Statistical Learning Data Mining, Inference, and Prediction, second ed. Springer, 2009.Google ScholarGoogle Scholar
  10. Hobson, J. A. Rem sleep and dreaming: towards a theory of protoconsciousness. Nature Reviews Neuroscience 10, 11 (2009), 803--813.Google ScholarGoogle ScholarCross RefCross Ref
  11. Huynh, T., and Schiele, B. Analyzing features for activity recognition. In Smart objects and ambient intelligence: innovative context-aware services: usages and technologies, no. 10 (2005). Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. John, G., Kohavi, R., and Pfleger, K. Irrelevant features and the subset selection problem. In Machine Learning (1994), 121--129.Google ScholarGoogle Scholar
  13. Keogh, E., and Kasetty, S. On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Mining and Knowledge Discovery (2002), 349--371. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Kuzin, V., Marcellino, M., and Schumacher, C. MIDAS vs. mixed-frequency VAR: Nowcasting GDP in the euro area. International Journal of Forecasting 27, 2 (Apr. 2011), 529--542.Google ScholarGoogle ScholarCross RefCross Ref
  15. Kwapisz, J., Weiss, G., and Moore, S. Activity recognition using cell phone accelerometers. ACM SIGKDD Explorations Newsletter (2011). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Lin, J., Keogh, E., Lonardi, S., and Chiu, B. A symbolic representation of time series, with implications for streaming algorithms. Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery - DMKD '03 (2003), 2. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Makridakis, S., Wheelwright, S. C., and Hyndman, R. J. Forecasting methods and applications. John Wiley & Sons, 1998.Google ScholarGoogle Scholar
  18. Miluzzo, E., Lane, N., and Fodor, K. Sensing meets mobile social networks: the design, implementation and evaluation of the CenceMe application. In Proceedings of Embedded network sensor systems (2008), 337--350. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Park, J.-g., Patel, A., Curtis, D., Teller, S., and Ledlie, J. Online pose classification and walking speed estimation using handheld devices. Proceedings of the 2012 ACM Conference on Ubiquitous Computing - UbiComp '12 (2012), 10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Quinlan, J. Induction of decision trees. Machine learning (1986), 81--106. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Stodden, V. Model selection when the number of variables exceeds the number of observations. PhD thesis, 2006.Google ScholarGoogle Scholar
  22. Tibshirani, R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B 58 (1994).Google ScholarGoogle Scholar
  23. Vespier, U., Knobbe, A., Nijssen, S., and Vanschoren, J. MDL-Based Analysis of Time Series at Multiple Time-Scales. In Proceedings of ECML PKDD '12 (2012). Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Mining multivariate time series with mixed sampling rates

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          UbiComp '14: Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing
          September 2014
          973 pages
          ISBN:9781450329682
          DOI:10.1145/2632048

          Copyright © 2014 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 13 September 2014

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate764of2,912submissions,26%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader