ABSTRACT
Data-driven syntactic parsers are usually trained, tested and developed on web-news data. Little has been done to evaluate them on literary genres of different ages, which are still low-resource varieties in terms of syntactic annotation. In this paper, I will describe methodology and results concerning first experiments in testing two different kinds of dependency parsers on aesthetic writings by Schiller and F. Schlegel. First, I trained and tested the parsers on de-ud1.2, a treebank collecting German web-news texts. Second, I manually annotated excerpts by the two authors with syntactic metadata. Third, I tested the parsers on these excerpts, after training them on de-ud1.2
- Gildea, D. 2001. Corpus variation and parser performance. In Proceedings of the 2001 Conference on Empirical Methods in Natural Language Processing (pp. 167--202).Google Scholar
- Hammond, A., Brooke, J., & Hirst, G. 2013. A tale of two cultures: Bringing literary analysis and computational linguistics together. In Proceedings of the 2nd Workshop on Computational Literature for Literature (CLFL'13), Atlanta, (2013, June).Google Scholar
- Plank, Barbara, 2011, Domain adaptation for parsing. University Library Groniongen][Host]Google Scholar
- Freschi, M. 1998, Storia della Civiltà Letteraria Tedesca, Volume Secondo: Ottocento e Novecento, Utet, Milano.Google Scholar
- Mazza, D. 2013, La Lingua Tedesca: Storia e Testi, Collana Antologie, Carocci Editore, RomaGoogle Scholar
- Griffero, T. 2012, Storia dell'Estetica Moderna, Edizioni Nuova Cultura, Cross Roads, Annoı, n.1, Roma.Google Scholar
- Tesnière, L. 1959. Eléments de syntaxe structurale. Librairie C. Klincksieck.Google Scholar
- Nivre, J. 2005. Dependency grammar and dependency parsing. MSI report, 5133(1959), 1--32.Google Scholar
- Straka, M., Hajic, J., & Straková, J. 2016. Udpipe: Trainable pipeline for processing conll-u files performing tokenization, morphological analysis, pos tagging and parsing. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slovenia (pp. 4290--4297).Google Scholar
- Nivre, J., Hall, J., Nilsson, J., Chanev, A., Eryigit, G., Kübler, S., ... & Marsi, E. 2007. MaltParser: A language-independent system for data-driven dependency parsing. Natural Language Engineering, 13(02), 95--135.Google ScholarCross Ref
- Bohnet, B. 2010. Very high accuracy and fast dependency parsing is not a contradiction. In: Proceedings of the 23rd International Conference on Computational Linguistics. Association for Computational Linguistics, 2010, p. 89--97.Google ScholarDigital Library
- Buchholz, S., & Marsi, E. 2006. CoNLL-X shared task on multilingual dependency parsing. In Proceedings of the Tenth Conference on Computational Natural Language Learning (2006, June) (pp. 149--164). Association for Computational Linguistics. Google ScholarCross Ref
- Joost van Rijsbergen, C. 1975, Information retrieval. Butterworths, London (UK).Google Scholar
- Nivre, J et al. 2015, Universal Dependencies 1.2.Google Scholar
- De Marneffe, M. C., Dozat, T., Silveira, N., Haverinen, K., Ginter, F., Nivre, J., & Manning, C. D. 2014. Universal Stanford dependencies: A cross-linguistic typology. In LREC (2014, May) (Vol. 14, pp. 4585--92).Google Scholar
- Petrov, S., Das, D., & McDonald, R. 2011. A universal part-of-speech tagset. arXiv preprint arXiv:1104.2086.Google Scholar
- Hajič, J., Ciaramita, M., Johansson, R., Kawahara, D., Martí, M. A., Màrquez, L., ... & Straňák, P. 2009, June. The CoNLL-2009 shared task: Syntactic and semantic dependencies in multiple languages. In Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task (pp. 1--18). Association for Computational Linguistics. Google ScholarCross Ref
- Nilsson, J., & Nivre, J. 2008. MaltEval: an Evaluation and Visualization Tool for Dependency Parsing. In LREC. (2008, May).Google Scholar
- Ballesteros, M., Nivre, J. 2012. MaltOptimizer: A System for MaltParser Optimization. In: LREC. 2012. p. 2757--2Google Scholar
- Björkelund, A., Bohnet, B., Hafdell, L., and Nugues, P. 2010. A high-performance syntactic and semantic dependency parser. In Coling 2010: Demonstration Volume, pages 33--36, Beijing, August 23-27 2010.Google Scholar
Index Terms
- Dependency Parsing on Late-18th-Century German Aesthetic Writings: A Preliminary Inquiry into Schiller and F. Schlegel
Recommendations
I3rab: A New Arabic Dependency Treebank Based on Arabic Grammatical Theory
Treebanks are valuable linguistic resources that include the syntactic structure of a language sentence in addition to part-of-speech tags and morphological features. They are mainly utilized in modeling statistical parsers. Although the statistical ...
Undirected Dependency Parsing
Dependency parsers, which are widely used in natural language processing tasks, employ a representation of syntax in which the structure of sentences is expressed in the form of directed links dependencies between their words. In this article, we ...
Improving Telugu Dependency Parsing using Combinatory Categorial Grammar Supertags
We show that Combinatory Categorial Grammar (CCG) supertags can improve Telugu dependency parsing. In this process, we first extract a CCG lexicon from the dependency treebank. Using both the CCG lexicon and the dependency treebank, we create a CCG ...
Comments