ABSTRACT
Data integration aims to facilitate the exploitation of heterogeneous data by providing the user with a unified view of data residing in different sources. Currently, ontologies are commonly used to represent this unified view in terms of a global target schema due to their flexibility and expressiveness. However, most approaches still assume a predefined target schema and focus on generating the mappings between this schema and the sources.
In this paper, we propose a solution that supports data integration tasks by employing semi-automatic ontology construction to create a target schema on the fly. To that end, we revisit existing ontology extraction, matching and merging techniques and integrate them into a single end-to-end system. Moreover, we extend the used techniques with the automatic generation of mappings between the extracted ontologies and the underlying data sources. Finally, to demonstrate the usefulness of our solution, we integrate it with an independent data integration system.
- A. Algergawy, S. Massmann, and E. Rahm. A clustering-based approach for large-scale ontology matching. In Proceedings of ADBIS, pages 415--428. Springer, 2011. Google ScholarDigital Library
- H. Bohring and S. Auer. Mapping XML to OWL ontologies. In Proceedings of 13. Leipziger Informatik-Tage (LIT 2005), pages 147--156, 2005.Google Scholar
- S. Castano, V. De Antonellis, S. De Capitani di Vimercati, and M. Melchiori. Semi-automated extraction of ontological knowledge from XML data sources. In Proceedings of DEXA, pages 852--860. IEEE, 2002. Google ScholarDigital Library
- F. Cerbah. Learning highly structured semantic repositories from relational databases: The RDBToOnto tool. In In Proceedings of ESWC, pages 777--781. Springer, 2008. Google ScholarDigital Library
- I. F. Cruz, F. Palandri Antonelli, and C. Stroe. AgreementMaker: Efficient matching for large real-world schemas and ontologies. VLDB, pages 1586--1589, 2009. Google ScholarDigital Library
- M. Dadjoo and E. Kheirkhah. An approach for transforming of relational databases to OWL ontology. International Journal of Web & Semantic Technology, 6(1), 2015.Google ScholarCross Ref
- J. David, J. Euzenat, F. Scharffe, and C. T. Dos Santos. The Alignment API 4.0. Semantic Web Journal, 2:3--10, 2011. Google ScholarDigital Library
- G. De Giacomo, D. Lembo, M. Lenzerini, and R. Rosati. On reconciling data exchange, data integration, and peer data management. In Proceedings of ACM PODS, pages 133--142. ACM, 2007. Google ScholarDigital Library
- C. P. De Laborda and S. Conrad. Relational. OWL - a data and schema representation format based on OWL. In Proceedings of CRPIT, pages 89--96. Australian Computer Society, 2005. Google ScholarDigital Library
- S. Dessloch, M. A. Hernández, R. Wisnesky, A. Radwan, and J. Zhou. Orchid: Integrating schema mapping and etl. In Proceedings of ICDE, pages 1307--1316. IEEE, 2008. Google ScholarDigital Library
- A. Doan, J. Madhavan, P. Domingos, and A. Halevy. Learning to map between ontologies on the semantic web. In WWW2002. ACM, 2002. Google ScholarDigital Library
- R. Fagin, L. M. Haas, M. Hernández, R. J. Miller, L. Popa, and Y. Velegrakis. Clio: Schema mapping creation and data exchange. pages 198--236, 2009.Google Scholar
- R. Fagin, P. G. Kolaitis, R. J. Miller, and L. Popa. Data exchange: Semantics and query answering. Theoretical Computer Science, 336:89--124, 2005. Google ScholarDigital Library
- M. Georgiu and A. Groza. Ontology enrichment using semantic wikis and design patterns. Studia Universitatis Babes-Bolyai, Informatica, 56(2), 2011.Google Scholar
- M. Granitzer, V. Sabol, K. W. Onn, D. Lukose, and K. Tochtermann. Ontology alignment - a survey with focus on visually supported semi-automatic techniques. Future Internet, 2:238--258, 2010.Google ScholarCross Ref
- W. Hu, N. Jian, Y. Qu, and Y. Wang. GMO: A graph matching for ontologies. In Proceedings of K-Cap 2005 Workshop on Integrating Ontologies, pages 43--50, 2005.Google Scholar
- W. Hu, Y. Qu, and G. Cheng. Matching large ontologies: A divide-and-conquer approach. Data and Knowledge Engineering, 67:140--160, 2008. Google ScholarDigital Library
- Y. R. Jean-Mary, E. P. Shironoshita, and M. R. Kabuka. Ontology matching with semantic verification. Journal of Web Semantics, 7:235--251, 2009. Google ScholarDigital Library
- P. Jovanovic, O. Romero, A. Simitsis, A. Abelló, H. Candón, and S. Nadal. Quarry: Digging up the gems of your data treasury. In Proceedings of EDBT, pages 549--552. OpenProceedings, 2015.Google Scholar
- P. Lambrix and H. Tan. SAMBO - a system for aligning and merging biomedical ontologies. Journal of Web Semantics, 4:196--206, 2006. Google ScholarDigital Library
- M. Lenzerini. Data integration: A theoretical perspective. In Proceedings of ACM PODS, pages 233--246. ACM, 2002. Google ScholarDigital Library
- J. Li, J. Tang, Y. Li, and Q. Luo. RiMOM - a dynamic multistrategy ontology alignment framework. IEEE TKDE, 21:1218--1232, 2009. Google ScholarDigital Library
- L. Lubyte and S. Tessaris. Automatic extraction of ontologies wrapping relational data sources. In Proceedings of DEXA, volume 5690, pages 128--142. Springer, 2009. Google ScholarDigital Library
- W. Mallede, F. Marir, and V. Vassilev. Algorithms for mapping RDB schema to RDF for facilitating access to deep web. In Proceedings of WEB. IARIA XPS Press, 2013.Google Scholar
- G. Petasis, V. Karkaletsis, G. Paliouras, A. Krithara, and E. Zavitsanos. Ontology population and enrichment: State of the art. Knowledge-Driven Multimedia Information Extraction and Ontology Evolution, pages 134--166, 2011. Google ScholarCross Ref
- J. Sequeda. On the semantics of R2RML and its relationship with the direct mapping. In Proceedings of the International Semantic Web Conference, pages 193--196. CEUR-WS, 2013.Google Scholar
- P. Shvaiko and E. Ome Euzenat. Ontology matching: State of the art and future challenges. IEEE TKDE, 2011.Google Scholar
- G. Stoilos, G. Stamou, and S. Kollias. A string metric for ontology alignment. In Proceedings of ISWC, pages 624--637. Springer, 2005. Google ScholarDigital Library
- F. M. Suchanek, S. Abiteboul, and P. Senellart. PARIS: Probabilistic alignment of relations, instances, and schema. VLDB, 5:157--168, 2011. Google ScholarDigital Library
- S. Wang, Y. Zeng, and N. Zhong. Ontology extraction and integration from semi-structured data. In Proceedings of AMT, pages 39--48. Springer, 2011. Google ScholarDigital Library
- H. Zhang, W. Hu, and Y. Qu. Constructing virtual documents for ontology matching using MapReduce. In Proceedings of JIST, pages 48--63. Springer-Verlag, 2012. Google ScholarDigital Library
Index Terms
- Supporting Data Integration Tasks with Semi-Automatic Ontology Construction
Recommendations
A Framework for Ontology-Based Data Integration
ICICSE '08: Proceedings of the 2008 International Conference on Internet Computing in Science and EngineeringThis paper presents an ontology-based data integration framework that is capable of deriving an ontology from a collection of XML schemas in a semi-automatic manner and integrating heterogeneous XML sources at the semantic level. The ontology in our ...
An ontology data matching method for web information integration
iiWAS '08: Proceedings of the 10th International Conference on Information Integration and Web-based Applications & ServicesThe emerging Semantic Web relies on the development of ontologies and the deployment of data annotated by ontologies. For a certain domain with a suitable ontology developed, its ontology annotated data (or simply ontology data) from different sources ...
Appropriate global ontology construction: a domain-reference-ontology based approach
iiWAS '11: Proceedings of the 13th International Conference on Information Integration and Web-based Applications and ServicesData integration involves combining data residing in different sources and providing users with a unified view of these data through what is called a "global schema". We address here the problem of automatic construction of this global schema in the ...
Comments