Article

Active learning for Hidden Markov Models: objective functions and algorithms

Authors:

Brigham Anderson,

Andrew MooreAuthors Info & Claims

ICML '05: Proceedings of the 22nd international conference on Machine learning

Pages 9 - 16

https://doi.org/10.1145/1102351.1102353

Published: 07 August 2005 Publication History

Abstract

Hidden Markov Models (HMMs) model sequential data in many fields such as text/speech processing and biosignal analysis. Active learning algorithms learn faster and/or better by closing the data-gathering loop, i.e., they choose the examples most informative with respect to their learning objectives. We introduce a framework and objective functions for active learning in three fundamental HMM problems: model learning, state estimation, and path estimation. In addition, we describe a new set of algorithms for efficiently finding optimal greedy queries using these objective functions. The algorithms are fast, i.e., linear in the number of time steps to select the optimal query and we present empirical results showing that these algorithms can significantly reduce the need for labelled training data.

References

[1]

Boyen, X., & Koller, D. (1998). Tractable inference for complex stochastic processes. Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (pp. 33--42).

Digital Library

[2]

Cohn, D. A., Atlas, L., & Ladner, R. E. (1994). Improving generalization with active learning. Machine Learning, 15, 201--221.

[3]

Cohn, D. A., Ghahramani, Z., & Jordan, M. I. (1995). Active learning with statistical models. Advances in Neural Information Processing Systems (pp. 705--712). The MIT Press.

[4]

Durbin, R., Eddy, S. R., Krogh, A., & Mitchison, G. (2000). Biological sequence analysis: probabilistic models of proteins and nucleic acids. Cambridge Univ. Press, Durbin.

[5]

Freund, Y., Seung, H. S., Shamir, E., & Tishby, N. (1997). Selective sampling using the query by committee algorithm. Machine Learning, 28, 133--168.

Digital Library

[6]

Krause, A., & Guestrin, C. (2005). Optimal nonmyopic value of information in graphical models (Technical Report). Carnegie Mellon University.

[7]

Krishnamurthy, V. (2002). Algorithms for optimal scheduling and management of hidden markov model sensors. IEEE Transactions on Signal Processing, 50, 1382--1397.

Digital Library

[8]

Lewis, D. D., & Catlett, J. (1994). Heterogeneous uncertainty sampling for supervised learning. Proceedings of ICML-94, 11th International Conference on Machine Learning (pp. 148--156). New Brunswick, US: Morgan Kaufmann Publishers, San Francisco, US.

[9]

Mackay, D. (1992). Information-Based Objective Functions for Active Data Selection. Neural Computation, 4, 589--603.

Digital Library

[10]

MacKay, D. (1997). Ensemble learning for hidden markov models (Technical Report). University of Cambridge.

[11]

Minka, T. (2001). A family of algorithms for approximate bayesian inference. Doctoral dissertation, MIT.

Digital Library

[12]

Rabiner, L. R. (1990). A tutorial on hidden markov models and selected applications in speech recognition. In A. Waibel and K.-F. Lee (Eds.), Readings in speech recognition, 267--296. San Mateo, CA: Kaufmann.

Digital Library

[13]

Rezek, I., & Roberts, S. J. (2002). Ensemble hidden markov models for biosignal analysis.

[14]

Roy, N., & McCallum, A. (2001). Toward optimal active learning through sampling estimation of error reduction. Proc. 18th International Conf. on Machine Learning (pp. 441--448). Morgan Kaufmann, San Francisco, CA.

Digital Library

[15]

Scheffer, T., Decomain, C., & Wrobel, S. (2001). Active hidden Markov models for information extraction. Lecture Notes in Computer Science, 2189, 309+.

Digital Library

[16]

Seung, H. S., Opper, M., & Sompolinsky, H. (1992). Query by committee. Computational Learning Theory (pp. 287--294).

Digital Library

[17]

Steck, H., & Jaakkola, T. (2002). Unsupervised active learning in large domains. Proceedings of the 18th Annual Conference on Uncertainty in Artificial Intelligence (UAI-02) (pp. 469--476). San Francisco, CA: Morgan Kaufmann Publishers.

Digital Library

[18]

Tong, S., & Koller, D. (2000). Active learning for parameter estimation in bayesian networks. NIPS (pp. 647--653).

[19]

Tong, S., & Koller, D. (2001). Active learning for structure in bayesian networks. IJCAI (pp. 863--869).

Digital Library

[20]

Tur, G., Schapire, R., & Hakkani-Tur, D. (2003). Active learning for spoken language understanding.

Cited By

Jha AAshwood ZPillow J(2024)Active Learning for Discrete Latent Variable ModelsNeural Computation10.1162/neco_a_0164636:3(437-474)Online publication date: 16-Feb-2024
https://doi.org/10.1162/neco_a_01646
Kim YDán GZhu Q(2024)Human-in-the-Loop Cyber Intrusion Detection Using Active LearningIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.343464719(8658-8672)Online publication date: 2024
https://doi.org/10.1109/TIFS.2024.3434647
Kim YDan G(2022)An Active Learning Approach to Dynamic Alert Prioritization for Real-time Situational Awareness2022 IEEE Conference on Communications and Network Security (CNS)10.1109/CNS56114.2022.9947246(154-162)Online publication date: 3-Oct-2022
https://doi.org/10.1109/CNS56114.2022.9947246
Show More Cited By

Active learning for Hidden Markov Models: objective functions and algorithms
1. Mathematics of computing
  1. Probability and statistics
    1. Probabilistic representations
    2. Stochastic processes
2. Theory of computation
  1. Theory and algorithms for application domains
    1. Machine learning theory

Recommendations

Coding with partially hidden Markov models
DCC '95: Proceedings of the Conference on Data Compression

Partially hidden Markov models (PHMM) are introduced. They are a variation of the hidden Markov models (HMM) combining the power of explicit conditioning on past observations and the power of using hidden states. (P)HMM may be combined with arithmetic ...
Joint semi-supervised learning of Hidden Conditional Random Fields and Hidden Markov Models

Although semi-supervised learning has generated great interest for designing classifiers on static patterns, there has been comparatively fewer works on semi-supervised learning for structured outputs and in particular for sequences. We investigate semi-...
Learning nonsingular phylogenies and hidden Markov models
STOC '05: Proceedings of the thirty-seventh annual ACM symposium on Theory of computing

In this paper, we study the problem of learning phylogenies and hidden Markov models. We call a Markov model nonsingular if all transition matrices have determinants bounded away from 0 (and 1). We highlight the role of the nonsingularity condition for ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICML '05: Proceedings of the 22nd international conference on Machine learning

August 2005

1113 pages

ISBN:1595931805

DOI:10.1145/1102351

General Chair:
Saso Dzeroski
Jozef Stefan Institute, Slovenia
,
Program Chairs:
Luc De Raedt,
Stefan Wrobel

Copyright © 2005 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2005

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Article

Acceptance Rates

Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

30
Total Citations
View Citations
550
Total Downloads

Downloads (Last 12 months)12
Downloads (Last 6 weeks)3

Reflects downloads up to 07 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Jha AAshwood ZPillow J(2024)Active Learning for Discrete Latent Variable ModelsNeural Computation10.1162/neco_a_0164636:3(437-474)Online publication date: 16-Feb-2024
https://doi.org/10.1162/neco_a_01646
Kim YDán GZhu Q(2024)Human-in-the-Loop Cyber Intrusion Detection Using Active LearningIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.343464719(8658-8672)Online publication date: 2024
https://doi.org/10.1109/TIFS.2024.3434647
Kim YDan G(2022)An Active Learning Approach to Dynamic Alert Prioritization for Real-time Situational Awareness2022 IEEE Conference on Communications and Network Security (CNS)10.1109/CNS56114.2022.9947246(154-162)Online publication date: 3-Oct-2022
https://doi.org/10.1109/CNS56114.2022.9947246
Petric FKovacic Z(2019)Design and Validation of MOMDP Models for Child–Robot Interaction Within Tasks of Robot-Assisted ASD Diagnostic ProtocolInternational Journal of Social Robotics10.1007/s12369-019-00577-0Online publication date: 23-Jul-2019
https://doi.org/10.1007/s12369-019-00577-0
Inoue MShirai MMiura T(2017)Sequence Classification Based on Active LearningSoftware Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing10.1007/978-3-319-62048-0_1(1-15)Online publication date: 24-Jun-2017
https://doi.org/10.1007/978-3-319-62048-0_1
Allahverdyan AGalstyan A(2015)Active Inference for Binary Symmetric Hidden Markov ModelsJournal of Statistical Physics10.1007/s10955-015-1321-y161:2(452-466)Online publication date: 9-Aug-2015
https://doi.org/10.1007/s10955-015-1321-y
Chen PLin H(2013)Active Learning for Multiclass Cost-Sensitive Classification Using Probabilistic ModelsProceedings of the 2013 Conference on Technologies and Applications of Artificial Intelligence10.1109/TAAI.2013.17(13-18)Online publication date: 6-Dec-2013
https://dl.acm.org/doi/10.1109/TAAI.2013.17
Chen YNielsen T(2012)Active Learning of Markov Decision Processes for System VerificationProceedings of the 2012 11th International Conference on Machine Learning and Applications - Volume 0210.1109/ICMLA.2012.158(289-294)Online publication date: 12-Dec-2012
https://dl.acm.org/doi/10.1109/ICMLA.2012.158
Strout JDöhler MBernal DMevel L(2012)Changes in the Statistics of Ambient Excitations in the Performance of Two Damage Detection SchemesTopics on the Dynamics of Civil Structures, Volume 110.1007/978-1-4614-2413-0_31(309-316)Online publication date: 6-Mar-2012
https://doi.org/10.1007/978-1-4614-2413-0_31
Alemdar Hvan Kasteren TErsoy C(2011)Activity recognition with Hidden Markov models using active learning2011 IEEE 19th Signal Processing and Communications Applications Conference (SIU)10.1109/SIU.2011.5929862(1161-1164)Online publication date: Apr-2011
https://doi.org/10.1109/SIU.2011.5929862
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents