skip to main content
10.1145/1401890.1401911acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Semi-supervised learning with data calibration for long-term time series forecasting

Published: 24 August 2008 Publication History

Abstract

Many time series prediction methods have focused on single step or short term prediction problems due to the inherent difficulty in controlling the propagation of errors from one prediction step to the next step. Yet, there is a broad range of applications such as climate impact assessments and urban growth planning that require long term forecasting capabilities for strategic decision making. Training an accurate model that produces reliable long term predictions would require an extensive amount of historical data, which are either unavailable or expensive to acquire. For some of these domains, there are alternative ways to generate potential scenarios for the future using computer-driven simulation models, such as global climate and traffic demand models. However, the data generated by these models are currently utilized in a supervised learning setting, where a predictive model trained on past observations is used to estimate the future values. In this paper, we present a semi-supervised learning framework for long-term time series forecasting based on Hidden Markov Model Regression. A covariance alignment method is also developed to deal with the issue of inconsistencies between historical and model simulation data. We evaluated our approach on data sets from a variety of domains, including climate modeling. Our experimental results demonstrate the efficacy of the approach compared to other supervised learning methods for long-term time series forecasting.

References

[1]
http://www.ccsn.ca/, canadian climate change scenarios network, environment canada.
[2]
P. M. Baggenstos. A modified Baum-Welch algorithm for hidden markov models with multiple observation spaces. IEEE Trans. on Speech Audio Processing, pages 411--416, 2001.
[3]
A. Blum and S. Chawla. Learning from labeled and unlabeled data using graph mincuts. In Proc. of the 18th Int'l Conf. on Machine Learning, pages 19--26, 2001.
[4]
A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In Proc. of the Workshop on Computational Learning Theory, pages 92--100, 1998.
[5]
Y.-A. L. Borgne, S. Santini, and G. Bontempi. Adaptive model selection for time series prediction in wireless sensor networks. Signal Process, 87(12):3010--3020, 2007.
[6]
U. Brefeld, T. Gärtner, T. Scheffer, and S. Wrobel. Efficient co-regularised least squares regression. In Proc. of the 23rd Int'l Conf. on Machine learning, pages 137--144, 2006.
[7]
G. Celeux and J. Durand. Selecting hidden markov model state number with cross-validated likelihood. In Computational Statistics, 2007.
[8]
S. Charles, B. Bates, I. Smith, and J. Hughes. Statistical downscaling of daily precipitation from observed and modelled atmospheric fields. In Hydrological Processes, pages 1373--1394, 2004.
[9]
H. Cheng, P.-N. Tan, J. Gao, and J. Scripps. Multistep-ahead time series prediction. In Proc. of the Pacific-Asia Conf on Knowledge Discovery and Data Mining, pages 765--774, 2006.
[10]
I. Cohen, N. Sebe, F. G. Cozman, M. C. Cirelo, and T. S. Huang. Semi-supervised learning of classifiers: Theory and algorithms for bayesian network classifiers and applications to human-computer interaction. IEEE Trans. on Pattern Analysis and Machine Intelligence, 26(12):1553--1566, Dec 2004.
[11]
C. Cortes and M. Mohri. On transductive regression. In Advances in Neural Information Processing Systems, 2006.
[12]
F. Cozman and I. Cohen. Unlabeled data can degrade classification performance of generative classifiers. In Proc. of the 15th Int'l Florida Artificial Intelligence Society Conference, pages 327--331, 2002.
[13]
F. Cozman, I. Cohen, and M. Cirelo. Semi-supervised learning of mixture models. In Proc of the 20th Int'l Conf. on Machine Learning, 2003.
[14]
W. Enke and A. Spekat. Downscaling climate model outputs into local and regional weather elements by classification and regression. In Climate Research 8, pages 195--207, 1997.
[15]
K. Fujinaga, M. Nakai, H. Shimodaira, and S. Sagayama. Multiple-regression hidden markov model. In Proc. of IEEE Int'l Conf. on Acoustics, Speech, and Signal Processing, 2001.
[16]
C. Giles, S. Lawrence, and A. Tsoi. Noisy time series prediction using a recurrent neural network and grammatical inference. Machine Learning, 44(1-2), pages 161--183, 2001.
[17]
T. Hastie and C. Loader. Local regression: Automatic kernel carpentry. In Statistical Science, pages 120--143, 1993.
[18]
W. Hong, P. Pai, S. Yang, and R. Theng. Highway traffic forecasting by support vector regression model with tabu search algorithms. In Proc. of Int'l Joint Conf. on Neural Networks, pages 1617--1621, 2006.
[19]
T. Joachims. Transductive inference for text classification using support vector machines. In Proc. of the 16th Int'l Conf. on Machine Learning, pages 200--209, Bled, SL, 1999.
[20]
B. Kedem and K. Fokianos. Regression models for time series analysis. Wiley-Interscience ISBN: 0-471-36355, 2002.
[21]
E. Keogh and T. Folias. Uc riverside time series data mining archive. http://www.cs.ucr.edu/ eamonn/TSDMA/index.html.
[22]
C. Leggetter and P. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. In Computer Speech and Language, pages 171--185(15). Academic Press, 1995.
[23]
A. Ober-Sundermeier and H. Zackor. Prediction of congestion due to road works on freeways. In Proc. of IEEE Intelligent Transportation Systems, pages 240--244, 2001.
[24]
A. Smola and B. Scholkopf. A tutorial on support vector regression. In Statistics and Computing, pages 199--222(24). Spring, 2004.
[25]
Q. Tian, J. Yu, Q. Xue, and N. Sebe. A new analysis of the value of unlabeled data in semi-supervised learning for image retrieval. In Proc. of IEEE Int'l Conf. on Multimedia and Expo., pages 1019--1022, 2004.
[26]
M. Wang, X.-S. Hua, Y. Song, L.-R. Dai, and H.-J. Zhang. Semi-supervised kernel regression. In Proc. of the 6th Int'l Conf. on Data Mining, pages 1130--1135, Washington, DC, USA, 2006.
[27]
R. Wilby, S. Charles, E. Zorita, B. Timbal, P. Whetton, and L. Mearns. Guidelines for use of climate scenarios developed from statistical downscaling methods. Available from the DDC of IPCC TGCIA, 2004.
[28]
C.-C. C. Wong, M.-C. Chan, and C.-C. Lam. Financial time series forecasting by neural network using conjugate gradient learning algorithm and multiple linear regression weight initialization. Technical Report 61, Society for Computational Economics, Jul 2000.
[29]
D. Zhou, O. Bousquet, T. Lal, J. Weston, and B. Schölkopf. Learning with local and global consistency. In Advances in Neural Information Processing Systems 16, 2003.
[30]
Z. Zhou and M. Li. Semi-supervised regression with co-training. In Proc. of Int'l Joint Conf. on Artificial Intelligence, 2005.
[31]
X. Zhu. Semi-supervised learning literature survey. In Technical Report,Computer Sciences, University of Wisconsin-Madison, 2005.
[32]
X. Zhu, Z. Ghahramani, and J. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In Proc. of the 20th Int'l Conf. on Machine Learning, volume 20, 2003.
[33]
X. Zhu and A. Goldberg. Kernel regression with order preferences. In Association for the Advancement of Artificial Intelligence, page 681, 2007.

Cited By

View all
  • (2022)Time Series Prediction via Similarity Search: Exploring Invariances, Distance Measures and Ensemble FunctionsIEEE Access10.1109/ACCESS.2022.319284910(78022-78043)Online publication date: 2022
  • (2018)Multi-step Time Series Forecasting of Electric Load Using Machine Learning ModelsArtificial Intelligence and Soft Computing10.1007/978-3-319-91253-0_15(148-159)Online publication date: 11-May-2018
  • (2016)QUEST: Eliminating Online Supervised Learning for Efficient Classification AlgorithmsSensors10.3390/s1610162916:10(1629)Online publication date: 1-Oct-2016
  • Show More Cited By

Index Terms

  1. Semi-supervised learning with data calibration for long-term time series forecasting

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2008
    1116 pages
    ISBN:9781605581934
    DOI:10.1145/1401890
    • General Chair:
    • Ying Li,
    • Program Chairs:
    • Bing Liu,
    • Sunita Sarawagi
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 August 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. semi-supervised learning
    2. time series prediction

    Qualifiers

    • Research-article

    Conference

    KDD08

    Acceptance Rates

    KDD '08 Paper Acceptance Rate 118 of 593 submissions, 20%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 07 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Time Series Prediction via Similarity Search: Exploring Invariances, Distance Measures and Ensemble FunctionsIEEE Access10.1109/ACCESS.2022.319284910(78022-78043)Online publication date: 2022
    • (2018)Multi-step Time Series Forecasting of Electric Load Using Machine Learning ModelsArtificial Intelligence and Soft Computing10.1007/978-3-319-91253-0_15(148-159)Online publication date: 11-May-2018
    • (2016)QUEST: Eliminating Online Supervised Learning for Efficient Classification AlgorithmsSensors10.3390/s1610162916:10(1629)Online publication date: 1-Oct-2016
    • (2016)Latent Space Model for Road Networks to Predict Time-Varying TrafficProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining10.1145/2939672.2939860(1525-1534)Online publication date: 13-Aug-2016
    • (2014)Multi-transferStatistical Analysis and Data Mining10.1002/sam.112267:4(282-293)Online publication date: 1-Aug-2014
    • (2014)Wedge sampling for computing clustering coefficients and triangle counts on large graphsStatistical Analysis and Data Mining10.1002/sam.112247:4(294-307)Online publication date: 1-Aug-2014
    • (2014)Exploring and inferring user-user pseudo-friendship for sentiment analysis with heterogeneous networksStatistical Analysis and Data Mining10.1002/sam.112237:4(308-321)Online publication date: 1-Aug-2014
    • (2014)Contour regressionStatistical Analysis and Data Mining10.1002/sam.112227:4(272-281)Online publication date: 1-Aug-2014
    • (2014)Dual-tree fast exact max-kernel searchStatistical Analysis and Data Mining10.1002/sam.112187:4(229-253)Online publication date: 1-Aug-2014
    • (2014)On handling negative transfer and imbalanced distributions in multiple source transfer learningStatistical Analysis and Data Mining10.1002/sam.112177:4(254-271)Online publication date: 1-Aug-2014
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media