skip to main content
research-article

Weighted Vote-Based Classifier Ensemble for Named Entity Recognition: A Genetic Algorithm-Based Approach

Published:01 June 2011Publication History
Skip Abstract Section

Abstract

In this article, we report the search capability of Genetic Algorithm (GA) to construct a weighted vote-based classifier ensemble for Named Entity Recognition (NER). Our underlying assumption is that the reliability of predictions of each classifier differs among the various named entity (NE) classes. Thus, it is necessary to quantify the amount of voting of a particular classifier for a particular output class. Here, an attempt is made to determine the appropriate weights of voting for each class in each classifier using GA. The proposed technique is evaluated for four leading Indian languages, namely Bengali, Hindi, Telugu, and Oriya, which are all resource-poor in nature. Evaluation results yield the recall, precision and F-measure values of 92.08%, 92.22%, and 92.15%, respectively for Bengali; 96.07%, 88.63%, and 92.20%, respectively for Hindi; 78.82%, 91.26%, and 84.59%, respectively for Telugu; and 88.56%, 89.98%, and 89.26%, respectively for Oriya. Finally, we evaluate our proposed approach with the benchmark dataset of CoNLL-2003 shared task that yields the overall recall, precision, and F-measure values of 88.72%, 88.64%, and 88.68%, respectively. Results also show that the vote based classifier ensemble identified by the GA-based approach outperforms all the individual classifiers, three conventional baseline ensembles, and some other existing ensemble techniques. In a part of the article, we formulate the problem of feature selection in any classifier under the single objective optimization framework and show that our proposed classifier ensemble attains superior performance to it.

References

  1. Alfonseca, E. and Manandhar, S. 1999. An unsupervised method for general named entity recognition and automated concept discovery. In Proceedings of the 16th National Conference on Artificial Intelligence and the Eleventh Conference on Innovative Applications of Artificial Intelligence (AAAI’99/IAAI’99). 474--479.Google ScholarGoogle Scholar
  2. Anderson, T. W. and Scolve, S. 1978. Introduction to the Statistical Analysis of Data. Houghton Mifflin.Google ScholarGoogle Scholar
  3. Aone, C., Halverson, L., Hampton, T., and Ramos-Santacruz, M. 1998. SRA: Description of the IE2 system used for MUC-7. In Proceedings of the Message Understanding Conference (MUC’98).Google ScholarGoogle Scholar
  4. Babych, B. and Hartley, A. 2003. Improving machine translation quality with automatic named entity recognition. In Proceedings of the Conference on the European Chapter of the Association for Computational Linguistics Workshop on Machine Translation and Other Language Technology Tools (EACL’03). 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bennet, S. W., Aone, C., and Lovell, C. 1997. Learning to tag multilingual texts through observation. In Proceedings of Empirical Methods of Natural Language Processing (EMNLP’97). 109--116.Google ScholarGoogle Scholar
  6. Bikel, D. M., Schwartz, R. L., and Weischedel, R. M. 1999. An algorithm that learns what’s in a name. Mach. Learn. 34, 1-3, 211--231. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Borthwick, A. 1999. Maximum entropy approach to named entity recognition. Ph.D. thesis, New York University. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Borthwick, A., Sterling, J., Agichtein, E., and Grishman, R. 1998. NYU: Description of the MENE named entity system as used in MUC-7. In Proceedings of the Machine Understanding Conference (MUC’98).Google ScholarGoogle Scholar
  9. Breiman, L. 1996. Bagging predictors. Mach. Learn. 24, 2, 123--140. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Carrears, X., Marquez, L., and Padro, L. 2002. Named entity recognition using AdaBoost. In Proceedings of the Conference on Natural Language Learning (CoNLL’02). 167--170.Google ScholarGoogle Scholar
  11. Cherkauer, K. 1996. Human expert-level performance on a scientific image analysis task by a system using combined artificial neural networks. In Working Notes of the AAAI Workshop on Integrating Multiple Learned Models (AAAI’96). 15--21.Google ScholarGoogle Scholar
  12. Chieu, H. L. and Ng, H. T. 2003. Named entity recognition with a maximum entropy approach. In Proceedings of the Conference on Natural Language Learning (CoNLL’03). 160--163. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Collins, M. and Singer, Y. 1999. Unsupervised models for named entity classification. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP’99).Google ScholarGoogle Scholar
  14. Darroch, J. and Ratcliff, D. 1972. Generalized iterative scaling for log-linear models. Ann. Math. Statist. 43. 1470--1480.Google ScholarGoogle Scholar
  15. Dietterich, T. G. 2002. Ensemble methods in machine learning. In Proceedings of the 1st International Workshop in Multiple Classifiers Systems. J. Kittler and F. Roli Eds., Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Dietterich, T. G. and Bakiri, G. 1995. Solving multiclass learning problems via error correcting output codes. J. Artific. Intell. Res. 2, 263--286. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Ekbal, A. and Bandyopadhyay, S. 2007. Lexical pattern learning from corpus data for named entity recognition. In Proceedings of the 5th International Conference on Natural Language Processing (ICON’07). 123--128.Google ScholarGoogle Scholar
  18. Ekbal, A. and Bandyopadhyay, S. 2008a. Bengali named entity recognition using support vector machine. In Proceedings of the Workshop on Named Entity Recognition for South and South East Asian Languages, 3rd International Joint Conference on Natural Languge Processing (NER-IJCNLP’08). 51--58.Google ScholarGoogle Scholar
  19. Ekbal, A. and Bandyopadhyay, S. 2008b. Web-based Bengali news corpus for lexicon development and POS tagging. POLIBITS, 37, 20--29.Google ScholarGoogle ScholarCross RefCross Ref
  20. Ekbal, A. and Bandyopadhyay, S. 2008c. A Web-based Bengali news corpus for named entity recognition. Lang. Resour. Eval. 42, 2, 173--182.Google ScholarGoogle ScholarCross RefCross Ref
  21. Ekbal, A. and Bandyopadhyay, S. 2009. Voted NER system using appropriate unlabeled data. In Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration (NEWS’09). 202--210. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Ekbal, A., Naskar, S., and Bandyopadhyay, S. 2007. Named entity recognition and transliteration in Bengali. Lingvisticae Investigationes J. 30, 1 (Named Entities: Recognition, Classification and Use Special Issue), 95--114.Google ScholarGoogle Scholar
  23. Ekbal, A., Haque, R., and Bandyopadhyay, S. 2008. Named entity recognition in Bengali: A conditional random field approach. In Proceedings of the 3rd International Joint Conference on Natural Language Processing (IJCNLP’08). 589--594.Google ScholarGoogle Scholar
  24. Ekbal, A. and Saha, S. 2010. Weighted vote-based classifier ensemble selection using genetic algorithm for named entity recognition. In Proceedings of the Conference on Natural Languages in Databases (NLDB’10). 256--267. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Etzioni, O., Cafarrella, M., Downey, D., Popescu, A. M., Shaked, T., Soderland, S., Weld, D. S., and Yates, A. 2005. Unsupervised named entity extraction from the Web: An experimental study. Artific. Intell. 165. 91--134. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Florian, R., Ittycheriah, A., Jing, H., and Zhang, T. 2003. Named entity recognition through classifier combination. In Proceedings of the 7th Conference on Natural Language Learning (HLT-NAACL’03). Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Freund, Y. and Schapire, R. 1995a. A decision-theoretic generalization of online learning and an application to boosting. In Proceedings of the 2nd European Conference on Computational Learning Theory (ECCL’95). 23--37. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Freund, Y. and Schapire, R. E. 1995b. A decision-theoretic generalization of online learning and an application to boosting. In Proceedings of the 2nd European Conference on Computational Learning Theory (ECCL’95). 23--37. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Goldberg, D. E. 1989. Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Holland, J. H. 1975. Adaptation in Natural and Artificial Systems. The University of Michigan Press: AnnArbor.Google ScholarGoogle Scholar
  31. Humphreys, K., Gaizauskas, R., Azzam, S., Huyck, C., Mitchell, B., Cunnigham, H., and Wilks, Y. 1998. University of Sheffield: Description of the LaSIE-II System as Used for MUC-7. In Proceedings of the Message Understanding Conference (MUC’98).Google ScholarGoogle Scholar
  32. Joachims, T. 1999. Making Large Scale SVM Learning Practical. MIT Press: Cambridge, MA, 169--184.Google ScholarGoogle Scholar
  33. Klein, D., Smarr, H. N., and Manning, D. 2003. Named entity recognition with character-level models. In Proceedings of the Conference on Natural Language Learning (CoNLL’03). 188--191. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Kolen, J. F. and Pollack, J. B. 1991. Back propagation is sensitive to initial conditions. Adv. Neural Inf. Proc. Syst. 860--867. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Lafferty, J. D., McCallum, A., and Pereira, F. C. N. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the International Conference on Machine Learning (ICML’01). 282--289. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Li, W. and McCallum, A. 2004. Rapid development of Hindi named entity recognition using conditional random fields and feature induction. ACM Trans. on Asian Lang. Inform. Process. 2, 3, 290--294. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Lin, D. and Wu, X. 2009. Phrase clustering for discriminative learning. In Proceedings of 47th Annual Meeting of the Association for Computational Learning (ACL’09). 1030--1038. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Mandl, T. and Womser-Hacker, C. 2005. The effect of named entities on effectiveness in cross-language information retrieval evaluation. In Proceedings of the ACM Symposium on Applied Computing (SAC’05). 1059--1064. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. McCallum, A. and Li, W. 2003. Early results for named entity recognition with conditional random fields, feature induction, and Web-enhanced lexicons. In Proceedings of the Conference on Natural Language Learning (CoNLL’03). 188--191. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Mikheev, A., Grover, C., and Moens, M. 1998. Description of the LTG system used for MUC-7. In Proceedings of the Message Understanding Conference (MUC’98).Google ScholarGoogle Scholar
  41. Mikheev, A., Grover, C., and Moens, M. 1999. Named Entity Recognition without Gazeteers. In Proceedings of the Conference on the European Chapter of the Association for Computational Linguistics (EACL’03). 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Miller, S., Crystal, M., Fox, H., Ramshaw, L., Schawartz, R., Stone, R., Weischedel, R., and the Annotation Group. 1998. BBN: Description of the SIFT System as Used for MUC-7. In Proceedings of the Message Understanding Conference (MUC’98).Google ScholarGoogle Scholar
  43. Nobata, C., Sekine, S., Isahara, H., and Grishman, R. 2002. Summarization system integrated with named entity tagging and IE pattern discovery. In Proceedings of 3rd International Conference on Language Resources and Evaluation (LREC’02).Google ScholarGoogle Scholar
  44. Pasca, M., Lin, D., Bigham, J., Lifchits, A., and Jain, A. 2006. Organizing and searching the World Wide Web of facts - Step one: The one-million fact extraction challenge. In Proceedings of National Conference on Artificial Intelligence (AAAI’06). Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Patel, A., Ramakrishnan, G., and Bhattacharya, P. 2009. Relational learning assisted construction of rule base for Indian language NER. In Proceedings of the 7th International Conference on Natural Language Processing (ICON’09).Google ScholarGoogle Scholar
  46. Pizzato, L. A., Molla, D., and Paris, C. 2006. Pseudo relevance feedback using named entities for question answering. In Proceedings of the Australian Language Technology Workshop (ALTW’06). 89--90.Google ScholarGoogle Scholar
  47. Riloff, E. and Jones, R. 1999. Learning dictionaries for information extraction by multi-level bootstrapping. In Proceedings of the 16th National Conference on Artificial Intelligence (AAAI’99). 474--479. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Saha, S., Sarkar, S., and Mitra, P. 2008. A hybrid feature set based maximum entropy Hindi named entity recognition. In Proceedings of the 3rd International Joint Conference in Natural Langauge Processing (IJCNLP’08). 343--350.Google ScholarGoogle Scholar
  49. Sekine, S. 1998. Description of the Japanese NE system used for MET-2. In Proceedings of the Message Understanding Conference (MUC’98).Google ScholarGoogle Scholar
  50. Seung, H. S., Opper, M., and Sompolinsky, H. 1992. Query by committee. In Proceedings of the ACM Workshop on Computational Learning Theory (CLT’92). Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Sha, F. and Pereira, F. 2003. Shallow parsing with conditional random fields. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’03). 134--141. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Shinyama, Y. and Sekine, S. 2004. Named entity discovery using comparable news articles. In Proceedings of the International Conference on Computational Linguistics (COLING’04). 848--855. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Shishtla, P. M., Pingali, P., and Varma, V. 2008. A character n-gram based approach for improved recall in Indian language NER. In Proceedings of the Workshop on Named Entity Recognition for South and South East Asian Languages (IJCNLP’08). 101--108.Google ScholarGoogle Scholar
  54. Srihari, R., Niu, C., and Li, W. 2002. A hybrid approach for named entity and sub-type tagging. In Proceedings of 6th Conference on Applied Natural Language Processing (ANLP’02). 247--254. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Srikanth, P. and Murthy, K. N. 2008. Named entity recognition for Telugu. In Proceedings of the Workshop on Named Entity Recognition for South and South East Asian Languages (IJCNLP’08). 41--50.Google ScholarGoogle Scholar
  56. Srinivas, M. and Patnaik, L. M. 1994. Adaptive probabilities of crossover and mutation in genetic algorithms. IEEE Trans. Syst. Man Cybern. 24, 4, 656--667.Google ScholarGoogle ScholarCross RefCross Ref
  57. Suzuki, J. and Isozaki, H. 2008. Semi-supervised sequential labeling and segmentation using gigaword scale unlabeled data. In Proceedings of the Human Language Technology Conference (ACL/HLT’08). 665--673.Google ScholarGoogle Scholar
  58. Taira, H. and Haruno, M. 1999. Feature selection in SVM text categorization. In Proceedings of National Conference on Artificial Intelligence (AAAI’99). Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Tjong Kim Sang, E. F. and De Meulder, F. 2003. Introduction to the shared task: Language independent named entity recognition. In Proceedings of the 7th Conference on Natural Language Learning (HLT-NAACL’03). 142--147. Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. Vapnik, V. N. 1995. The Nature of Statistical Learning Theory. Springer-Verlag Berlin, Germany. Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Vijayakrishna, R. and Sobha, L. 2008. Domain focused named entity recognizer for Tamil using conditional random fields. In Proceedings of the Workshop on Named Entity Recognition for South and South East Asian Languages (IJCNLP’08). 93--100.Google ScholarGoogle Scholar
  62. Wolpert, D. 1992. Stacked generalization. Neural Netw. 5, 241--259. Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Wu, D., Ngai, G., and Carput, M. 2003. A stacked, voted, stacked model for named entity recognition. In Proceedings of the Conference on Natural Language Learning (CoNLL’03). Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. Yangarber, R., Lin, W., and Grishman, R. 2002. Unsupervised learning of generalized names. In Proceedings of the 19th International Conference on Computational Linguistics (COLING’02). 1--7. Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. Yu, X. 2007. Chinese named entity recognition with cascaded hybrid model. In Proceedings of Human Language Technology Conference/North American Chapter of the Association for Computational Linguistics (NAACL-HLT’07). 197--200. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Weighted Vote-Based Classifier Ensemble for Named Entity Recognition: A Genetic Algorithm-Based Approach

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian Language Information Processing
      ACM Transactions on Asian Language Information Processing  Volume 10, Issue 2
      June 2011
      111 pages
      ISSN:1530-0226
      EISSN:1558-3430
      DOI:10.1145/1967293
      Issue’s Table of Contents

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 June 2011
      • Revised: 1 January 2011
      • Accepted: 1 January 2011
      • Received: 1 May 2010
      Published in talip Volume 10, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader