ACM Home Page
Please provide us with feedback. Feedback
Active learning for Hidden Markov Models: objective functions and algorithms
Full text PdfPdf (754 KB)
Source ACM International Conference Proceeding Series; Vol. 119 archive
Proceedings of the 22nd international conference on Machine learning table of contents
Bonn, Germany
Pages: 9 - 16  
Year of Publication: 2005
ISBN:1-59593-180-5
Authors
Brigham Anderson  Carnegie Mellon University, Pittsburgh, PA
Andrew Moore  Carnegie Mellon University, Pittsburgh, PA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 68,   Citation Count: 1
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1102351.1102353
What is a DOI?

ABSTRACT

Hidden Markov Models (HMMs) model sequential data in many fields such as text/speech processing and biosignal analysis. Active learning algorithms learn faster and/or better by closing the data-gathering loop, i.e., they choose the examples most informative with respect to their learning objectives. We introduce a framework and objective functions for active learning in three fundamental HMM problems: model learning, state estimation, and path estimation. In addition, we describe a new set of algorithms for efficiently finding optimal greedy queries using these objective functions. The algorithms are fast, i.e., linear in the number of time steps to select the optimal query and we present empirical results showing that these algorithms can significantly reduce the need for labelled training data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Boyen, X., & Koller, D. (1998). Tractable inference for complex stochastic processes. Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (pp. 33--42).
 
2
 
3
Cohn, D. A., Ghahramani, Z., & Jordan, M. I. (1995). Active learning with statistical models. Advances in Neural Information Processing Systems (pp. 705--712). The MIT Press.
 
4
Durbin, R., Eddy, S. R., Krogh, A., & Mitchison, G. (2000). Biological sequence analysis: probabilistic models of proteins and nucleic acids. Cambridge Univ. Press, Durbin.
 
5
 
6
Krause, A., & Guestrin, C. (2005). Optimal nonmyopic value of information in graphical models (Technical Report). Carnegie Mellon University.
 
7
Krishnamurthy, V. (2002). Algorithms for optimal scheduling and management of hidden markov model sensors. IEEE Transactions on Signal Processing, 50, 1382--1397.
 
8
Lewis, D. D., & Catlett, J. (1994). Heterogeneous uncertainty sampling for supervised learning. Proceedings of ICML-94, 11th International Conference on Machine Learning (pp. 148--156). New Brunswick, US: Morgan Kaufmann Publishers, San Francisco, US.
 
9
 
10
MacKay, D. (1997). Ensemble learning for hidden markov models (Technical Report). University of Cambridge.
 
11
 
12
 
13
Rezek, I., & Roberts, S. J. (2002). Ensemble hidden markov models for biosignal analysis.
 
14
 
15
16
 
17
Steck, H., & Jaakkola, T. (2002). Unsupervised active learning in large domains. Proceedings of the 18th Annual Conference on Uncertainty in Artificial Intelligence (UAI-02) (pp. 469--476). San Francisco, CA: Morgan Kaufmann Publishers.
 
18
Tong, S., & Koller, D. (2000). Active learning for parameter estimation in bayesian networks. NIPS (pp. 647--653).
 
19
Tong, S., & Koller, D. (2001). Active learning for structure in bayesian networks. IJCAI (pp. 863--869).
 
20
Tur, G., Schapire, R., & Hakkani-Tur, D. (2003). Active learning for spoken language understanding.

Collaborative Colleagues:
Brigham Anderson: colleagues
Andrew Moore: colleagues