ACM Home Page
Please provide us with feedback. Feedback
Confidence estimation for NLP applications
Full text PdfPdf (1.17 MB)
Source ACM Transactions on Speech and Language Processing (TSLP) archive
Volume 3 ,  Issue 3  (October 2006) table of contents
Pages: 1 - 29  
Year of Publication: 2006
ISSN:1550-4875
Authors
Simona Gandrabur  RALI, Université de Montréal, Montréal, Québec, Canada
George Foster  National Research Council, Gatineau, Quebec, Canada
Guy Lapalme  RALI, Université de Montréal, Montréal, Québec, Canada
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 140,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1177055.1177057
What is a DOI?

ABSTRACT

Confidence measures are a practical solution for improving the usefulness of Natural Language Processing applications. Confidence estimation is a generic machine learning approach for deriving confidence measures. We give an overview of the application of confidence estimation in various fields of Natural Language Processing, and present experimental results for speech recognition, spoken language understanding, and statistical machine translation.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Blatz, J., Fitzgerald, E., Foster, G., Gandrabur, S., Goutte, C., Kulesza, A., Sanchis, A., and Ueffing, N. 2004a. Confidence estimation for machine translation. In Proceedings of COLING 2004 (Geneva). 315--321.
 
3
Blatz, J., Fitzgerald, E., Foster, G., Gandrabur, S., Goutte, C., Kulesza, A., Sanchis, A., and Ueffing, N. 2004b. Confidence estimation for machine translation. Tech. Rep., Final report JHU /CLSP 2003 Summer Workshop, Baltimore, MD.
 
4
Caminero, J., De La Torre, D., Villarubia, L., Martin, C., and Hernandez, L. 1996. On-line garbage modeling with discriminant analysis for utterance verification. In Proceedings of the 4th ICSLP 1996. Vol. 4. Philadelphia, PA, 2115--2118.
 
5
Carpenter, P., Jin, C., Wilson, D., Zhang, R., Bohus, D., and Rudnicky, A. 2001. Is this conversation on track? In Eurospeech 2001 (Aalborg, Denmark). 2121--2124.
 
6
 
7
 
8
Collins, M., Roark, B., and Saraclar, M. 2005. Discriminative syntactic language modeling for speech recognition. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05). Association for Computational Linguistics, Ann Arbor, MI. 507--514.
 
9
Collobert, R., Bengio, S., and Mariéthoz, J. 2002. Torch: A modular machine learning software library. Tech. Rep. IDIAP-RR 02-46, IDIAP.
 
10
Culotta, A. and McCallum, A. 2004. Confidence estimation for information extraction. In Proceedings of HLT-NAACL.
 
11
Culotta, A. and McCallum, A. 2005. Reducing labeling effort for structured prediction tasks. In Proceedings of AAAI.
 
12
Delany, S. J., Cunningham, P., and Doyle, D. 2005. Generating estimates of classification confidence for a case-based spam filter. Tech. Rep. TCD-CS-2005-20, Trinity College Dublin Computer Science Department.
 
13
 
14
 
15
 
16
 
17
18
 
19
Guillevic, D., Gandrabur, S., and Normandin, Y. 2002. Robust semantic confidence scoring. In Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP) 2002 (Denver, CO).
 
20
Hasegawa-Johnson, M., Baker, J., Borys, S., Chen, K., Coogan, E., Greenberg, S., Juneja, A., Kirchhoff, K., Livescu, K., Mohan, S., Muller, J., Sonmez, K., and Wang, T. 2005. Landmark-based speech recognition: Report of the 2004 Johns Hopkins Summer Workshop. In ICASSP.
 
21
Hazen, T., Burianek, T., Polifroni, J., and Seneff, S. 2000. Recognition confidence scoring for use in speech understanding systems. In Proc. Automatic Speech Recognition Workshop (Paris, France). 213--220.
 
22
Hazen, T., Seneff, S., and Polifroni, J. 2002. Recognition confidence scoring and its use in speech understanding systems. Compu. Speech Lang., 49--67.
 
23
 
24
Kemp, T. and Schaaf, T. 1997. Estimating confidence using word lattices. In Eurospeech. 827--830.
 
25
 
26
Ma, C., Randolph, M., and Drish, J. 2001. A support vector machines-based rejection technique for speech recognition. In ICASSP 2001.
 
27
Macklovitch, E. 2006. TransType2: The last word. In Proceedings of LREC 2006. Genoa, Italy.
 
28
Maison, B. and Gopinath, R. 2001. Robust confidence annotation and rejection for continuous speech recognition. In ICASSP 2001.
 
29
Mangu, L., Brill, E., and Stolcke, A. 2000. Finding consensus in speech recognition: Word error minimization and other applications of confusion networks. Compu. Speech Lang. 14, 4, 373--400.
 
30
Manmatha, R. and Sever, H. 2002. A formal approach to score normalization for meta-search. In Proceedings of HLT 2002, 2nd International Conference on Human Language Technology Research, M. Marcus, Ed. Morgan Kaufmann, San Francisco, CA. 98--103.
 
31
Moreno, P., Logan, B., and Raj, B. 2001. A boosting approach for confidence scoring. In Proceedings of Eurospeech 2001 (Aalborg, Denmark). 2109--2112.
 
32
Nepveu, L., Lapalme, G., Langlais, P., and Foster, G. 2004. Adaptive language and translation models for interactive machine translation. In EMLNP'04 (Barcelona, Spain). 190--197.
 
33
 
34
NIST. 2001. Automatic evaluation of machine translation quality using n-gram co-occurence statistics. www.nist.gov/speech/tests/mt/doc/ngram-study.pdf.
 
35
 
36
Papineni, K., Roukos, S., Ward, T., and Zhu, W. 2001. Bleu: A method for automatic evaluation of machine translation. Technical Report RC22176 (W0109-022), IBM Research Division, Thomas J. Watson Research Center (2001).
 
37
Quirk, C. B. 2004. Training a sentence-level machine translation confidence measure. In Proceedings of LREC (Lisbon, Portual). 825--828.
 
38
San-Secundo, R., Pellom, B., Hacioglu, K., Ward, W., and Pardo, J. 2001. Confidence measures for spoken dialogue systems. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2001) (Salt Lake City, UT).
 
39
Sanchis, A., Juan, A., and Vidal, E. 2003. Improving utterance verification using a smoothed naive Bayes model. In ICASSP 2003, (Hong Kong). 29--35.
 
40
Schapire, R. 2001. The boosting approach to machine learning: An overview. In Proceedings of the MSRI Workshop on Nonlinear Estimation and Classification (Berkeley, CA, Mar.).
 
41
Shen, L., Sarkar, A., and Och, F. J. 2004. Discriminative reranking for machine translation. In HLT-NAACL. 177--184.
 
42
Siu, M. and Gish, H. 1999. Evaluation of word confidence for speech recognition systems. Comput. Speech Lang. 13, 4, 299--318.
 
43
Stolcke, A., Koenig, Y., and Weintraub, M. 1997. Explicit word error minimization in N-best list rescoring. In Proceedings of the 5th European Conference on Speech Communication and Technology. Vol. 1. (Rhodes, Greece). 163--166.
 
44
 
45
Ueffing, N., Macherey, K., and Ney, H. 2003. Confidence measures for statistical machine translation. In Proceedings of the 6th Conference of the Association for Machine Translation in the Americas (New-Orleans, LA). Springer-Verlag, New York, 394--401.
 
46
Ueffing, N. and Ney, H. 2005. Word-level confidence estimation for machine translation using phrase-based translation models. In Proceedings of HLT/EMNLP. Vancouver, BC, Canada.
 
47
 
48
Wessel, F., Schluter, R., Macherey, K., and Ney, H. 2001. Confidence measures for large vocabulary continuous speech recognition. IEEE Trans. Speech Audio Process. 9, 3, 288--298.
 
49
Xu, J., Licuanan, A., May, J., Miller, S., and Weischedel, R. 2002. Trec2002qa at bbn: Answer selection and confidence estimation. In Proceedings of the 10th Text REtrieval Conference (TREC 2002).
 
50
Zhang, R. and Rudnicky, A. 2001. Word level confidence annotation using combinations of features. In Eurospeech. Aalborg, Denmark, 2105--2108.
 
51
Zou, K. H. 2004. Receiver operating characteristic (ROC) literature research. http://splweb.bwh.harvard.edu:8000/pages/ppl/zou/roc.html.

Collaborative Colleagues:
Simona Gandrabur: colleagues
George Foster: colleagues
Guy Lapalme: colleagues