ABSTRACT
The semantic annotation of textual content is important for the success of many information integration systems. In this paper, we present a method for generating semantic annotations about transfer in football news. Semantic web technologies are applied to extend the knowledge base of KIM platform so that named entities in this specific domain could be recognized. We study and define language models to recognize the semantic relation for football transfers. At the same time, we propose a pronoun recognition method using extraction rules to improve the relation recognition process. The experiment showed promising results on the data set built from Sky Sports news [27]. The precisions achieved in both cases, with and without integration of the pronoun recognition method, are both over 80%. In particular, the latter helps increase the recall value to around 10%.
- Abacha, A. B., Zweigenbaum, P.: "Automatic extraction of semantic relations between medical entities: a rule based approach", Fourth International Symposium on Semantic Mining in Biomedicine (SMBM) Hinxton, UK. 25-26 October 2010.Google Scholar
- Bannour, S., Audibert, L., Soldano, H. Ontology-based semantic annotation: an automatic hybrid rule-based method. In Proceedings of the BioNLP Shared Task 2013 Workshop, pages 139--143, Sofia, Bulgaria, August 2013. Association for Computational Linguistics.Google Scholar
- Berners-Lee, T., Hendler J., Lassila, O. 2001. "The Semantic Web", Scientific American Magazine, May 17, 2001.Google ScholarCross Ref
- Buitelaar, P., Cimiano, P., Weber, N: Ontology Learning and Population in SmartWeb. Philips Symposium on Intelligent Algorithms (SOIA), Netherlands, 2006.Google Scholar
- Chen, C. M., Chen, L.H.: A Novel Approach for Semantic Event Extraction from Sports Webcast Text. Multimedia Tools and Applications, Vol.71, pp. 1937--1952. (SCI) (NSC 100-2221-E-009-140-MY2). Google ScholarDigital Library
- Cimino, J., Barnett, G.: Automatic knowledge acquisition from MEDLINE. Methods of Information in Medicine; 32(2) 1993, 120--130.Google Scholar
- Cimiano, P., S. Handschuh, and S. Staab:Towards the Self-Annotating Web. Proceedings of the 13th International World Wide Web Conference (WWW 2004), 2004. Google ScholarDigital Library
- Dill, S., Gibson, N., Gruhl, D., Guha, R., Jhingran, A., Kanungo, T., Rajagopalan, S., Tomkins, A., Tomlin, J.A. and Zien, J.Y., SemTag and Seeker: Bootstrapping the semantic web via automated semantic annotation, in Twelfth International World Wide Web Conference, (Budapest, Hungary, 2003), 178--186. Google ScholarDigital Library
- Embarek, M. Ferret, O.: Learning patterns for building resources about semantic relations in the medical domain. In: proceedings of the Language Resources and Evaluation Conference 2008 (May 2008).Google Scholar
- GATE: https://gate.ac.uk/.Google Scholar
- Gruber, T. R.: A translation approach to portable ontology specifications. Knowledge Acquisition, 5(2):199--220, 1993. Google ScholarDigital Library
- Handschuh, S., Stabb, S., Maedche, A.:CREAM - Creating relational metadata with a component based, ontology-driven annotation framework. In Proceedings of K-Cap 2001, pages 76-- 83. ACM Press. Google ScholarDigital Library
- Harrington, B., Clark, S.: Asknet: Creating and Evaluating Large Scale Integrated Semantic Networks. Int. J. Semantic Computing 2(3): 343--364 (2008).Google Scholar
- JAPE: http://gate.ac.uk/sale/tao/splitch8.html.Google Scholar
- Dung T, Kameyama W.A.: Proposal of ontology-based health care information extraction system: VnHIES. IEEE International Conference on Research, Innovation and Vision for the Future (RIVF 2007); 2007 Hanoi, Vietnam;Google ScholarCross Ref
- Kobilarov, G., Scott, T., Raimond, Y., Oliver, S., Sizemore, C., Smethurst, M., Bizer, C., and Lee, R: Media meets Semantic Web -- How the BBC uses DBpedia and Linked Data to make Connections. ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications, SpringerVerlag Berlin, Heidelberg, Heraklion, Greece, 31 May -- 4 June 2009, pp. 723--737. Google ScholarDigital Library
- Kogut, P., Holmes, W. AeroDAML: applying information extraction to generate DAML annotations from web pages. In First International Conference on Knowledge Capture (K-CAP 2001), Workshop on Knowledge Markup and Semantic Annotation, Victoria (2001).Google Scholar
- Lee, C., Khoo, C., Na, J.:Automatic identification of treatment relations for medical Ontology learning: An exploratory study. Proceedings of the Eighth International ISKO Conference 2004, 245--250.Google Scholar
- Liang, T., Wu, D.S: Automatic Pronominal Anaphora Resolution in English Texts. International journal of Computaional Linguistics and Chinese Language Processing. (2004) 1--20.Google Scholar
- Markovski, A., Jovanovik, M., Trajanov, D.: Web Extensions for Semantic Data Creation. In 9th International Conference for Informatics and Information Technology, 2012.Google Scholar
- Mendes, Pablo, Jakob, M., Bizer, C. (2012): DBpedia:A Multilingual Cross-domain Knowledge Base. Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12). 1813--1817.Google Scholar
- Muthu lakshmi, S., Uma, G. V.: Semantic Web based e-Learning System for Sports Domain. International Journal of Computer Applications (0975 - 8887) Volume 8-No.14, October 2010.Google Scholar
- Nguyen, D. P. T., Matsuo, Y., Ishizuka, M.: Exploiting Syntactic and Semantic Information for Relation Extraction from Wikipedia. IJCAI Workshop on Text-Mining & Link-Analysis (TextLink 2007), 2007.Google Scholar
- Popov B., Kiryakov A., Kirilov A., Manov D., Ognyanoff D., Goranov M.:KIM - Semantic Annotation Platform. 2nd International Semantic Web Conference (ISWC2003), Florida, USA, 2003, pp. 834--849.Google Scholar
- Qiu, L., Kan, M.Y., Chua, T.S.: A Public Reference Implementation of the RAP Anaphora Resolution Algorithm. In: proceedings of the Language Resources and Evaluation Conference 2004 (LREC 2004), Lisbon, Portugal, pp. 291--294 (2004).Google Scholar
- Rayfield, Jem (2012): Sports Refresh: Dynamic Semantic Publishing. In: http://www.bbc.co.uk/blogs/bbcinternet/2012/04/sports_dynamic_semantic.html, visited April 20, 2012.Google Scholar
- Sky Sports:http://www1.skysports.com/transfer-centre/.Google Scholar
- Slimani, T.: Semantic Annotation: The Mainstay of Semantic Web. International Journal of Computer Applications Technology and Research, Volume 2, Issue 6, 763--770, 2013.Google Scholar
- Sun, Le., Han, X.: A Feature-Enriched Tree Kernel for Relation Extraction. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), pages 61--67, Baltimore, Maryland, USA, June 23-25 2014.Google Scholar
- Tuan-Dung, C., Quang-Minh, N., Hoang-Cong, N., Hagino, T.: Towards efficient sport data integration through semantic annotation. Proceeding of The Fourth International Conference on Knowledge and Systems Engineering KSE 2012, pp. 99--106, ISBN 978-1-4673-2171-6, Da Nang Viet Nam, August, 2012. Google ScholarDigital Library
- Tymoshenko, K., Giuliano, C.:FBK-IRST: Semantic Relation Extraction using Cyc.In Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval-2010), pp. 214--217, Uppsala, Sweden, 2010. Google ScholarDigital Library
- Zemanta api:http://developer.zemanta.com/.Google Scholar
Index Terms
- Automatic creation of semantic data about football transfer in sport news
Recommendations
The semantic annotated documents: from HTML to the semantic web
CEA'07: Proceedings of the 2007 annual Conference on International Conference on Computer Engineering and ApplicationsThe current circumstance of the Semantic Web is that there is not much of a Semantic Web due to the lack of annotated web pages. There is such a lack because annotating web pages currently does not provide much practical benefit. In this work an ...
Towards Efficient Sport Data Integration through Semantic Annotation
KSE '12: Proceedings of the 2012 Fourth International Conference on Knowledge and Systems EngineeringIn news genre, the sport domain is one of great interest to audiences on many occasions. The explosion of Internet these days leads to many obstacles for users in searching information due to enormous amount of data collected from multiple media ...
Semantic annotation of geodata based on linked-open data
MEDES '15: Proceedings of the 7th International Conference on Management of computational and collective intElligence in Digital EcoSystemsThere are several research works using web semantic to improve integration and retrieval of geospatial data provided by spatial data infrastructures (SDI) and geoportals. However, an important open issue that deserves more research efforts is how to ...
Comments