skip to main content
research-article

Using a Hybrid Convolution Tree Kernel for Semantic Role Labeling

Published: 01 November 2008 Publication History

Abstract

As a kind of Shallow Semantic Parsing, Semantic Role Labeling (SRL) is gaining more attention as it benefits a wide range of natural language processing applications. Given a sentence, the task of SRL is to recognize semantic arguments (roles) for each predicate (target verb or noun). Feature-based methods have achieved much success in SRL and are regarded as the state-of-the-art methods for SRL. However, these methods are less effective in modeling structured features. As an extension of feature-based methods, kernel-based methods are able to capture structured features more efficiently in a much higher dimension. Application of kernel methods to SRL has been achieved by selecting the tree portion of a predicate and one of its arguments as feature space, which is named as predicate-argument feature (PAF) kernel. The PAF kernel captures the syntactic tree structure features using convolution tree kernel, however, it does not distinguish between the path structure and the constituent structure. In this article, a hybrid convolution tree kernel is proposed to model different linguistic objects. The hybrid convolution tree kernel consists of two individual convolution tree kernels. They are a Path kernel, which captures predicate-argument link features, and a Constituent Structure kernel, which captures the syntactic structure features of arguments. Evaluations on the data sets of the CoNLL-2005 SRL shared task and the Chinese PropBank (CPB) show that our proposed hybrid convolution tree kernel statistically significantly outperforms the previous tree kernels. Moreover, in order to maximize the system performance, we present a composite kernel through combining our hybrid convolution tree kernel method with a feature-based method extended by the polynomial kernel. The experimental results show that the composite kernel achieves better performance than each of the individual methods and outperforms the best reported system on the CoNLL-2005 corpus when only one syntactic parser is used and on the CPB corpus when automated syntactic parse results and correct syntactic parse results are used respectively.

References

[1]
Baker, C. F., Fillmore, C. J., and Lowe, J. B. 1998. The Berkeley FrameNet project. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics (COLING-ACL’98), 86--90.
[2]
Berger, A. L., Della Pietra, S. A., and Della Pietra, V. J. 1996. A maximum entropy approach to natural language processing. Comput. Linguist. 22, 1, 39--71.
[3]
Carreras, X. and Màrquez, L. 2004. Introduction to the CoNLL-2004 shared task: Semantic role labeling. In Proceedings of the 8th Conference on Natural Language Learning (CoNLL’04). 89--97.
[4]
Carreras, X. and Màrquez, L. 2005. Introduction to the CoNLL-2005 shared task: Semantic role labeling. In Proceedings of the 9th Conference on Natural Language Learning (CoNLL’05). 152--164.
[5]
Charniak, E. 2000. A maximum-entropy-inspired parser. In Proceedings of the 1st Conference of the North American Chapter of the Association for Computational Linguistics (NAACL’00).
[6]
Che, W., Zhang, M., Liu, T., and Li, S. 2006. A hybrid convolution tree kernel for semantic role labeling. In Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics and 21st International Conference on Computational Linguistics (COLING-ACL’06). Sydney, Australia.
[7]
Chieu, H. L. and Ng, H. T. 2003. Named entity recognition with a maximum entropy approach. In Proceedings of the 7th Conference on Natural Language Learning (CoNLL’03). 160--163.
[8]
Collins, M. 1999. Head-driven statistical models for natural language parsing. Ph.D. thesis, Pennsylvania University.
[9]
Collins, M. and Duffy, N. 2001. Convolution kernels for natural language. In Proceedings of the 15th Annual Conference on Neutral Information Processing Systems (NIPS’01).
[10]
Cristianini, N. and Shawe-Taylor, J. 2000. An Introduction to Support Vector Machines. Cambridge University Press, New York.
[11]
Culotta, A. and Sorensen, J. 2004. Dependency tree kernels for relation extraction. In Proceedings of the 42th Annual Meeting of the Association for Computer Linguistics (ACL’04). 423--429.
[12]
Gildea, D. and Jurafsky, D. 2002. Automatic labeling of semantic roles. Comput. Linguist. 28, 3, 245--288.
[13]
Gildea, D. and Palmer, M. 2002. The necessity of parsing for predicate argument recognition. In Proceedings of the 40th Anniversary Meeting of the Association for Computer Linguistics (ACL’02). 239--246.
[14]
Gimenez, J. and Màrquez, L. 2003. Fast and accurate part-of-speech tagging: The svm approach revisited. In Proceedings of the International Conference on Recent Advances in Natural Language (RANLP’03).
[15]
Haussler, D. 1999. Convolution kernels on discrete structures. Tech. Rep. UCSC-CRL-99-10. July.
[16]
Jiang, Z. P., Li, J., and Ng, H. T. 2005. Semantic argument classification exploiting argument interdependence. In Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI’05).
[17]
Joachims, T. 2002. Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms. Kluwer Academic Publishers, Norwell, MA.
[18]
Joachims, T., Cristianini, N., and Shawe-Taylor, J. 2001. Composite kernels for hypertext categorization. In Proceedings of the 18th International Conference on Machine Learning (ICML’01). 250--257.
[19]
Liu, T., Che, W., Li, S., Hu, Y., and Liu, H. 2005. Semantic role labeling system using maximum entropy classifier. In Proceedings of the 9th Conference on Natural Language Learning (CoNLL’05). 189--192.
[20]
Lodhi, H., Saunders, C., Shawe-Taylor, J., Cristianini, N., and Watkins, C. 2002. Text classification using string kernels. J. Mach. Learn. Res. 2, 419--444.
[21]
Marcus, M. P., Marcinkiewicz, M. A., and Santorini, B. 1993. Building a large annotated corpus of English: the penn treebank. Comput. Linguist. 19, 2, 313--330.
[22]
Moschitti, A. 2004. A study on convolution kernels for shallow statistic parsing. In Proceedings of the 42nd Annual Meeting of the Association for Computer Linguistics (ACL’04). 335--342.
[23]
Moschitti, A., Pighin, D., and Basili, R. 2006. Tree kernel engineering in semantic role labeling systems. In Proceedings of the Workshop on Learning Structured Information for Natural Language Applications, 11th International Conference on European Association for Computational Linguistics (EACL’06). Trento, Italy, 49--56.
[24]
Moschitti, A., Pighin, D., and Basili, R. To appear. Tree kernels for semantic role labeling. Comput. Linguist. (forthcoming).
[25]
Moschitti, A., Quarteroni, S., Basili, R., and Manandhar, S. 2006. Exploiting syntactic and shallow semantic kernels for question answer classification. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics (ACL’07). Prague, Czech Republic, 776--783.
[26]
Narayanan, S. and Harbabagiu, S. 2004. Question answering based on semantic structures. In Proceedings of the 20th International Conference on Computer Linguistics (COLING’04).
[27]
Nielsen, R. D. and Pradhan, S. 2004. Mixing weak learners in semantic parsing. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’04).
[28]
Palmer, M., Gildea, D., and Kingsbury, P. 2005. The proposition bank: An annotated corpus of semantic roles. Comput. Linguist. 31, 1, 71--106.
[29]
Ponzetto, S. P. and Strube, M. 2006. Exploiting semantic role labeling, wordnet and wikipedia for coreference resolution. In Proceedings of the Loint Human Language Technology Conference/Annula Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL’06). New York.
[30]
Pradhan, S., Hacioglu, K., Krugler, V., Ward, W., Martin, J. H., and Jurafsky, D. 2005. Support vector learning for semantic argument classification. Mach. Learn.
[31]
Pradhan, S., Ward, W., Hacioglu, K., Martin, J., and Jurafsky, D. 2005. Semantic role labeling using different syntactic views. In Proceedings of the 43rd Annual Meeting of the Association for Computer Linguistics (ACL’05). 581--588.
[32]
Punyakanok, V., Roth, D., and Tau Yih, W. 2005. The necessity of syntactic parsing for semantic role labeling. In Proceedings of 19th International Joint Conference on Artificial Intelligence (IJCAI’05). 1117--1123.
[33]
Punyakanok, V., Roth, D., Yih, W.-T., and Zimak, D. 2004. Semantic role labeling via integer linear programming inference. In Proceedings of the 20th International Conference on Computer Linguistics (COLING’04). 1346--1352.
[34]
Rifkin, R. and Klautau, A. 2004. In defense of one-vs-all classification. J. Mach. Learn. Res. 5, 101--141.
[35]
Schapire, R. E. and Singer, Y. 1999. Improved boosting algorithms using confidence-rated predictions. Mach. Learn. 37, 3, 297--336.
[36]
Shawe-Taylor, J. and Cristianini, N. 2004. Kernel Methods for Pattern Analysis. Cambridge University Press, New York.
[37]
Shen, D. and Lapata, M. 2007. Using semantic roles to improve question answering. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL’07). 12--21.
[38]
Sun, H. and Jurafsky, D. 2004. Shallow semantic parsing of chinese. In Proceedings of the Joint Human Language Technology Conference/Annual Meeting of the North American Chapter of the Association for Computational Linguistics (HLT/NAACL’04).
[39]
Surdeanu, M., Harabagiu, S., Williams, J., and Aarseth, P. 2003. Using predicate-argument structures for information extraction. In Proceedings of the 41st Annual Meeting of the Association for Computer Linguistics (ACL’03).
[40]
Surdeanu, M. and Turmo, J. 2005. Semantic role labeling using complete syntactic analysis. In Proceedings of the 9th Conference on Natural Language Learning (CoNLL’05). Ann Arbor, Michigan.
[41]
Vapnik, V. N. 1998. Statistical learning theory. Wiley.
[42]
Watkins, C. 1999. Dynamic alignment kernels. Tech. Rep. CSD-TR-98-11. January.
[43]
Xue, N. and Kulick, S. 2003. Automatic predicate argument structure analysis of the penn Chinese treebank. In Proceedings of the 10th Machine Translation Summit (MT Summit X).
[44]
Xue, N. and Palmer, M. 2004. Calibrating features for semantic role labeling. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’04).
[45]
Xue, N. and Palmer, M. 2005. Automatic semantic role labeling for Chinese verbs. In Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI’05).
[46]
Xue, N., Xia, F., Dong Chiou, F., and Palmer, M. 2005. The Penn Chinese Treebank: Phrase structure annotation of a large corpus. Nat. Lang. Eng. 11, 2, 207--238.
[47]
Zelenko, D., Aone, C., and Richardella, A. 2003. Kernel methods for relation extraction. J. Mach. Learn. Res. 3, 1083--1106.
[48]
Zhang, M., Zhang, J., and Su, J. 2006a. Exploring syntactic features for relation extraction using a convolution tree kernel. In Proceedings of the Joint Human Language Technology Conference/Annual Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL’06). New York City.
[49]
Zhang, M., Zhang, J., Su, J., and Zhou, G. 2006b. A composite kernel to extract relations between entities with both flat and structured features. In Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics and 21st International Conference on Computational Linguistics (COLING/ACL’06). Sydney, Australia.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Asian Language Information Processing
ACM Transactions on Asian Language Information Processing  Volume 7, Issue 4
November 2008
81 pages
ISSN:1530-0226
EISSN:1558-3430
DOI:10.1145/1450295
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 November 2008
Published in TALIP Volume 7, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Semantic role labeling
  2. hybrid convolution tree kernel

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Predicate-attention neural model for Chinese semantic role labelingComputers and Electrical Engineering10.1016/j.compeleceng.2022.10774199(107741)Online publication date: Apr-2022
  • (2022)Chinese Argument Identification Based on BertChinese Lexical Semantics10.1007/978-3-031-28956-9_27(337-350)Online publication date: 14-May-2022
  • (2021)Neural Unsupervised Semantic Role LabelingACM Transactions on Asian and Low-Resource Language Information Processing10.1145/346161320:6(1-16)Online publication date: 30-Nov-2021
  • (2021)Adaptive Convolution for Semantic Role LabelingIEEE/ACM Transactions on Audio, Speech, and Language Processing10.1109/TASLP.2020.304866529(782-791)Online publication date: 2021
  • (2021)Syntax-Tree Similarity for Test-Case Derivability in Software Requirements2021 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW)10.1109/ICSTW52544.2021.00037(162-172)Online publication date: Apr-2021
  • (2018)Sentence Patterns of “ (You)” in Semantic Dependence GraphsChinese Lexical Semantics10.1007/978-3-319-73573-3_3(27-40)Online publication date: 20-Jan-2018
  • (2015)Translating Today into TomorrowComputational Linguistics10.1162/COLI_a_0024041:4(709-718)Online publication date: Dec-2015
  • (2012)A Multidimensional Sequence Approach to Measuring Tree SimilarityIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2010.23924:2(197-208)Online publication date: 1-Feb-2012
  • (2010)Measuring tree similarity for natural language processing based information retrievalProceedings of the Natural language processing and information systems, and 15th international conference on Applications of natural language to information systems10.5555/1894525.1894528(13-23)Online publication date: 23-Jun-2010
  • (2010)A minimum error weighting combination strategy for Chinese semantic role labelingProceedings of the 23rd International Conference on Computational Linguistics10.5555/1873781.1873934(1362-1370)Online publication date: 23-Aug-2010
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media