Abstract
In this article, we report the search capability of Genetic Algorithm (GA) to construct a weighted vote-based classifier ensemble for Named Entity Recognition (NER). Our underlying assumption is that the reliability of predictions of each classifier differs among the various named entity (NE) classes. Thus, it is necessary to quantify the amount of voting of a particular classifier for a particular output class. Here, an attempt is made to determine the appropriate weights of voting for each class in each classifier using GA. The proposed technique is evaluated for four leading Indian languages, namely Bengali, Hindi, Telugu, and Oriya, which are all resource-poor in nature. Evaluation results yield the recall, precision and F-measure values of 92.08%, 92.22%, and 92.15%, respectively for Bengali; 96.07%, 88.63%, and 92.20%, respectively for Hindi; 78.82%, 91.26%, and 84.59%, respectively for Telugu; and 88.56%, 89.98%, and 89.26%, respectively for Oriya. Finally, we evaluate our proposed approach with the benchmark dataset of CoNLL-2003 shared task that yields the overall recall, precision, and F-measure values of 88.72%, 88.64%, and 88.68%, respectively. Results also show that the vote based classifier ensemble identified by the GA-based approach outperforms all the individual classifiers, three conventional baseline ensembles, and some other existing ensemble techniques. In a part of the article, we formulate the problem of feature selection in any classifier under the single objective optimization framework and show that our proposed classifier ensemble attains superior performance to it.
- Alfonseca, E. and Manandhar, S. 1999. An unsupervised method for general named entity recognition and automated concept discovery. In Proceedings of the 16th National Conference on Artificial Intelligence and the Eleventh Conference on Innovative Applications of Artificial Intelligence (AAAI’99/IAAI’99). 474--479.Google Scholar
- Anderson, T. W. and Scolve, S. 1978. Introduction to the Statistical Analysis of Data. Houghton Mifflin.Google Scholar
- Aone, C., Halverson, L., Hampton, T., and Ramos-Santacruz, M. 1998. SRA: Description of the IE2 system used for MUC-7. In Proceedings of the Message Understanding Conference (MUC’98).Google Scholar
- Babych, B. and Hartley, A. 2003. Improving machine translation quality with automatic named entity recognition. In Proceedings of the Conference on the European Chapter of the Association for Computational Linguistics Workshop on Machine Translation and Other Language Technology Tools (EACL’03). 1--8. Google ScholarDigital Library
- Bennet, S. W., Aone, C., and Lovell, C. 1997. Learning to tag multilingual texts through observation. In Proceedings of Empirical Methods of Natural Language Processing (EMNLP’97). 109--116.Google Scholar
- Bikel, D. M., Schwartz, R. L., and Weischedel, R. M. 1999. An algorithm that learns what’s in a name. Mach. Learn. 34, 1-3, 211--231. Google ScholarDigital Library
- Borthwick, A. 1999. Maximum entropy approach to named entity recognition. Ph.D. thesis, New York University. Google ScholarDigital Library
- Borthwick, A., Sterling, J., Agichtein, E., and Grishman, R. 1998. NYU: Description of the MENE named entity system as used in MUC-7. In Proceedings of the Machine Understanding Conference (MUC’98).Google Scholar
- Breiman, L. 1996. Bagging predictors. Mach. Learn. 24, 2, 123--140. Google ScholarDigital Library
- Carrears, X., Marquez, L., and Padro, L. 2002. Named entity recognition using AdaBoost. In Proceedings of the Conference on Natural Language Learning (CoNLL’02). 167--170.Google Scholar
- Cherkauer, K. 1996. Human expert-level performance on a scientific image analysis task by a system using combined artificial neural networks. In Working Notes of the AAAI Workshop on Integrating Multiple Learned Models (AAAI’96). 15--21.Google Scholar
- Chieu, H. L. and Ng, H. T. 2003. Named entity recognition with a maximum entropy approach. In Proceedings of the Conference on Natural Language Learning (CoNLL’03). 160--163. Google ScholarDigital Library
- Collins, M. and Singer, Y. 1999. Unsupervised models for named entity classification. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP’99).Google Scholar
- Darroch, J. and Ratcliff, D. 1972. Generalized iterative scaling for log-linear models. Ann. Math. Statist. 43. 1470--1480.Google Scholar
- Dietterich, T. G. 2002. Ensemble methods in machine learning. In Proceedings of the 1st International Workshop in Multiple Classifiers Systems. J. Kittler and F. Roli Eds., Springer. Google ScholarDigital Library
- Dietterich, T. G. and Bakiri, G. 1995. Solving multiclass learning problems via error correcting output codes. J. Artific. Intell. Res. 2, 263--286. Google ScholarDigital Library
- Ekbal, A. and Bandyopadhyay, S. 2007. Lexical pattern learning from corpus data for named entity recognition. In Proceedings of the 5th International Conference on Natural Language Processing (ICON’07). 123--128.Google Scholar
- Ekbal, A. and Bandyopadhyay, S. 2008a. Bengali named entity recognition using support vector machine. In Proceedings of the Workshop on Named Entity Recognition for South and South East Asian Languages, 3rd International Joint Conference on Natural Languge Processing (NER-IJCNLP’08). 51--58.Google Scholar
- Ekbal, A. and Bandyopadhyay, S. 2008b. Web-based Bengali news corpus for lexicon development and POS tagging. POLIBITS, 37, 20--29.Google ScholarCross Ref
- Ekbal, A. and Bandyopadhyay, S. 2008c. A Web-based Bengali news corpus for named entity recognition. Lang. Resour. Eval. 42, 2, 173--182.Google ScholarCross Ref
- Ekbal, A. and Bandyopadhyay, S. 2009. Voted NER system using appropriate unlabeled data. In Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration (NEWS’09). 202--210. Google ScholarDigital Library
- Ekbal, A., Naskar, S., and Bandyopadhyay, S. 2007. Named entity recognition and transliteration in Bengali. Lingvisticae Investigationes J. 30, 1 (Named Entities: Recognition, Classification and Use Special Issue), 95--114.Google Scholar
- Ekbal, A., Haque, R., and Bandyopadhyay, S. 2008. Named entity recognition in Bengali: A conditional random field approach. In Proceedings of the 3rd International Joint Conference on Natural Language Processing (IJCNLP’08). 589--594.Google Scholar
- Ekbal, A. and Saha, S. 2010. Weighted vote-based classifier ensemble selection using genetic algorithm for named entity recognition. In Proceedings of the Conference on Natural Languages in Databases (NLDB’10). 256--267. Google ScholarDigital Library
- Etzioni, O., Cafarrella, M., Downey, D., Popescu, A. M., Shaked, T., Soderland, S., Weld, D. S., and Yates, A. 2005. Unsupervised named entity extraction from the Web: An experimental study. Artific. Intell. 165. 91--134. Google ScholarDigital Library
- Florian, R., Ittycheriah, A., Jing, H., and Zhang, T. 2003. Named entity recognition through classifier combination. In Proceedings of the 7th Conference on Natural Language Learning (HLT-NAACL’03). Google ScholarDigital Library
- Freund, Y. and Schapire, R. 1995a. A decision-theoretic generalization of online learning and an application to boosting. In Proceedings of the 2nd European Conference on Computational Learning Theory (ECCL’95). 23--37. Google ScholarDigital Library
- Freund, Y. and Schapire, R. E. 1995b. A decision-theoretic generalization of online learning and an application to boosting. In Proceedings of the 2nd European Conference on Computational Learning Theory (ECCL’95). 23--37. Google ScholarDigital Library
- Goldberg, D. E. 1989. Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, New York. Google ScholarDigital Library
- Holland, J. H. 1975. Adaptation in Natural and Artificial Systems. The University of Michigan Press: AnnArbor.Google Scholar
- Humphreys, K., Gaizauskas, R., Azzam, S., Huyck, C., Mitchell, B., Cunnigham, H., and Wilks, Y. 1998. University of Sheffield: Description of the LaSIE-II System as Used for MUC-7. In Proceedings of the Message Understanding Conference (MUC’98).Google Scholar
- Joachims, T. 1999. Making Large Scale SVM Learning Practical. MIT Press: Cambridge, MA, 169--184.Google Scholar
- Klein, D., Smarr, H. N., and Manning, D. 2003. Named entity recognition with character-level models. In Proceedings of the Conference on Natural Language Learning (CoNLL’03). 188--191. Google ScholarDigital Library
- Kolen, J. F. and Pollack, J. B. 1991. Back propagation is sensitive to initial conditions. Adv. Neural Inf. Proc. Syst. 860--867. Google ScholarDigital Library
- Lafferty, J. D., McCallum, A., and Pereira, F. C. N. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the International Conference on Machine Learning (ICML’01). 282--289. Google ScholarDigital Library
- Li, W. and McCallum, A. 2004. Rapid development of Hindi named entity recognition using conditional random fields and feature induction. ACM Trans. on Asian Lang. Inform. Process. 2, 3, 290--294. Google ScholarDigital Library
- Lin, D. and Wu, X. 2009. Phrase clustering for discriminative learning. In Proceedings of 47th Annual Meeting of the Association for Computational Learning (ACL’09). 1030--1038. Google ScholarDigital Library
- Mandl, T. and Womser-Hacker, C. 2005. The effect of named entities on effectiveness in cross-language information retrieval evaluation. In Proceedings of the ACM Symposium on Applied Computing (SAC’05). 1059--1064. Google ScholarDigital Library
- McCallum, A. and Li, W. 2003. Early results for named entity recognition with conditional random fields, feature induction, and Web-enhanced lexicons. In Proceedings of the Conference on Natural Language Learning (CoNLL’03). 188--191. Google ScholarDigital Library
- Mikheev, A., Grover, C., and Moens, M. 1998. Description of the LTG system used for MUC-7. In Proceedings of the Message Understanding Conference (MUC’98).Google Scholar
- Mikheev, A., Grover, C., and Moens, M. 1999. Named Entity Recognition without Gazeteers. In Proceedings of the Conference on the European Chapter of the Association for Computational Linguistics (EACL’03). 1--8. Google ScholarDigital Library
- Miller, S., Crystal, M., Fox, H., Ramshaw, L., Schawartz, R., Stone, R., Weischedel, R., and the Annotation Group. 1998. BBN: Description of the SIFT System as Used for MUC-7. In Proceedings of the Message Understanding Conference (MUC’98).Google Scholar
- Nobata, C., Sekine, S., Isahara, H., and Grishman, R. 2002. Summarization system integrated with named entity tagging and IE pattern discovery. In Proceedings of 3rd International Conference on Language Resources and Evaluation (LREC’02).Google Scholar
- Pasca, M., Lin, D., Bigham, J., Lifchits, A., and Jain, A. 2006. Organizing and searching the World Wide Web of facts - Step one: The one-million fact extraction challenge. In Proceedings of National Conference on Artificial Intelligence (AAAI’06). Google ScholarDigital Library
- Patel, A., Ramakrishnan, G., and Bhattacharya, P. 2009. Relational learning assisted construction of rule base for Indian language NER. In Proceedings of the 7th International Conference on Natural Language Processing (ICON’09).Google Scholar
- Pizzato, L. A., Molla, D., and Paris, C. 2006. Pseudo relevance feedback using named entities for question answering. In Proceedings of the Australian Language Technology Workshop (ALTW’06). 89--90.Google Scholar
- Riloff, E. and Jones, R. 1999. Learning dictionaries for information extraction by multi-level bootstrapping. In Proceedings of the 16th National Conference on Artificial Intelligence (AAAI’99). 474--479. Google ScholarDigital Library
- Saha, S., Sarkar, S., and Mitra, P. 2008. A hybrid feature set based maximum entropy Hindi named entity recognition. In Proceedings of the 3rd International Joint Conference in Natural Langauge Processing (IJCNLP’08). 343--350.Google Scholar
- Sekine, S. 1998. Description of the Japanese NE system used for MET-2. In Proceedings of the Message Understanding Conference (MUC’98).Google Scholar
- Seung, H. S., Opper, M., and Sompolinsky, H. 1992. Query by committee. In Proceedings of the ACM Workshop on Computational Learning Theory (CLT’92). Google ScholarDigital Library
- Sha, F. and Pereira, F. 2003. Shallow parsing with conditional random fields. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’03). 134--141. Google ScholarDigital Library
- Shinyama, Y. and Sekine, S. 2004. Named entity discovery using comparable news articles. In Proceedings of the International Conference on Computational Linguistics (COLING’04). 848--855. Google ScholarDigital Library
- Shishtla, P. M., Pingali, P., and Varma, V. 2008. A character n-gram based approach for improved recall in Indian language NER. In Proceedings of the Workshop on Named Entity Recognition for South and South East Asian Languages (IJCNLP’08). 101--108.Google Scholar
- Srihari, R., Niu, C., and Li, W. 2002. A hybrid approach for named entity and sub-type tagging. In Proceedings of 6th Conference on Applied Natural Language Processing (ANLP’02). 247--254. Google ScholarDigital Library
- Srikanth, P. and Murthy, K. N. 2008. Named entity recognition for Telugu. In Proceedings of the Workshop on Named Entity Recognition for South and South East Asian Languages (IJCNLP’08). 41--50.Google Scholar
- Srinivas, M. and Patnaik, L. M. 1994. Adaptive probabilities of crossover and mutation in genetic algorithms. IEEE Trans. Syst. Man Cybern. 24, 4, 656--667.Google ScholarCross Ref
- Suzuki, J. and Isozaki, H. 2008. Semi-supervised sequential labeling and segmentation using gigaword scale unlabeled data. In Proceedings of the Human Language Technology Conference (ACL/HLT’08). 665--673.Google Scholar
- Taira, H. and Haruno, M. 1999. Feature selection in SVM text categorization. In Proceedings of National Conference on Artificial Intelligence (AAAI’99). Google ScholarDigital Library
- Tjong Kim Sang, E. F. and De Meulder, F. 2003. Introduction to the shared task: Language independent named entity recognition. In Proceedings of the 7th Conference on Natural Language Learning (HLT-NAACL’03). 142--147. Google ScholarDigital Library
- Vapnik, V. N. 1995. The Nature of Statistical Learning Theory. Springer-Verlag Berlin, Germany. Google ScholarDigital Library
- Vijayakrishna, R. and Sobha, L. 2008. Domain focused named entity recognizer for Tamil using conditional random fields. In Proceedings of the Workshop on Named Entity Recognition for South and South East Asian Languages (IJCNLP’08). 93--100.Google Scholar
- Wolpert, D. 1992. Stacked generalization. Neural Netw. 5, 241--259. Google ScholarDigital Library
- Wu, D., Ngai, G., and Carput, M. 2003. A stacked, voted, stacked model for named entity recognition. In Proceedings of the Conference on Natural Language Learning (CoNLL’03). Google ScholarDigital Library
- Yangarber, R., Lin, W., and Grishman, R. 2002. Unsupervised learning of generalized names. In Proceedings of the 19th International Conference on Computational Linguistics (COLING’02). 1--7. Google ScholarDigital Library
- Yu, X. 2007. Chinese named entity recognition with cascaded hybrid model. In Proceedings of Human Language Technology Conference/North American Chapter of the Association for Computational Linguistics (NAACL-HLT’07). 197--200. Google ScholarDigital Library
Index Terms
- Weighted Vote-Based Classifier Ensemble for Named Entity Recognition: A Genetic Algorithm-Based Approach
Recommendations
Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition
In this paper, the concept of finding an appropriate classifier ensemble for named entity recognition is posed as a multiobjective optimization (MOO) problem. Our underlying assumption is that instead of searching for the best-fitting feature set for a ...
Classifier Ensemble Selection Using Genetic Algorithm for Named Entity Recognition
In this paper, we propose a classifier ensemble technique based on genetic algorithm (GA) for named entity recognition (NER). We assume that the classifiers based on different feature representations can be effectively combined together using GA to ...
Building Locally Discriminative Classifier Ensemble Through Classifier Fusion Among Nearest Neighbors
PCM 2016: 17th Pacific-Rim Conference on Advances in Multimedia Information Processing - Volume 9916Many studies on ensemble learning that combines multiple classifiers have shown that, it is an effective technique to improve accuracy and stability of a single classifier. In this paper, we propose a novel discriminative classifier fusion method, which ...
Comments