ABSTRACT
The use of semantic technologies is gaining significant traction in science communication with a wide array of applications in disciplines including the life sciences, computer science, and the social sciences. Languages like RDF, OWL, and other formalisms based on formal logic are applied to make scientific knowledge accessible not only to human readers but also to automated systems. These approaches have mostly focused on the structure of scientific publications themselves, on the used scientific methods and equipment, or on the structure of the used datasets. The core claims or hypotheses of scientific work have only been covered in a shallow manner, such as by linking mentioned entities to established identifiers. In this research, we therefore want to find out whether we can use existing semantic formalisms to fully express the content of high-level scientific claims using formal semantics in a systematic way. Analyzing the main claims from a sample of scientific articles from all disciplines, we find that their semantics are more complex than what a straight-forward application of formalisms like RDF or OWL account for, but we managed to elicit a clear semantic pattern which we call the "super-pattern''. We show here how the instantiation of the five slots of this super-pattern leads to a strictly defined statement in higher-order logic. We successfully applied this super-pattern to an enlarged sample of scientific claims. We show that knowledge representation experts, when instructed to independently instantiate the super-pattern with given scientific claims, show a high degree of consistency and convergence given the complexity of the task and the subject. These results therefore open the door on the longer run for allowing researchers to express their high-level scientific findings in a manner they can be automatically interpreted. This in turn will allow for automated consistency checking, question answering, aggregation, and much more.
- Tim Berners-Lee and James A. Hendler. 2001. Publishing on the semantic web. Nature , Vol. 410 (2001), 1023--1024.Google ScholarCross Ref
- Arthur Brack et al. 2020 a. Domain-Independent Extraction of Scientific Concepts from Research Articles. In Advances in Information Retrieval . https://doi.org/10.1007/978--3-030--45439--5_17Google ScholarCross Ref
- Adrien Coulet et al. 2011a. Integration and publication of heterogeneous text-mined relationships on the Semantic Web. J Biomed Semant , Vol. 2 (2011). https://doi.org/10.1186/2041--1480--2-S2-S10Google ScholarCross Ref
- Alexandru Constantin et al. 2016. The Document Components Ontology (DoCO). Semantic Web , Vol. 7, 2 (February 2016), 167--181. https://doi.org/10.3233/SW-150177Google ScholarDigital Library
- Angelo Di Iorio et al. 2014a. Semantic Lenses to Bring Digital and Semantic Publishing Together. In ISWC'14 , Vol. 128. 12--23.Google Scholar
- Aldo Gangemi et al. 2014b. The Publishing Workflow Ontology (PWO). Semantic Web , Vol. 8 (2014). https://doi.org/10.3233/SW-160230Google ScholarDigital Library
- Cristina-Iulia Bucur et al. 2020 b. A Unified Nanopublication Model for Effective and User-Friendly Access to the Elements of Scientific Publishing. EKAW2020 (2020). https://doi.org/10.1007/978--3-030--61244--3_7Google ScholarCross Ref
- David Shotton et al. 2009. Adventures in semantic publishing: Exemplar semantic enhancements of a research article. PLoS computational biology , Vol. 5 (2009). Issue 4.Google Scholar
- Emek Demir et al. 2010a. The BioPAX community standard for pathway data sharing. Nat. Biotechnol. , Vol. 28 (2010). https://doi.org/10.1038/nbt.1666Google ScholarCross Ref
- Habeeb Ibrahim Abdul Razack et al. 2021. Artificial intelligence-assisted tools for redefining the communication landscape of the scholarly world. Science Editing , Vol. 8 (2021). https://doi.org/10.6087/kcse.244Google ScholarCross Ref
- Jaana Taakis et al. 2015a. Crowdsourced semantic annotation of scientific publications and tabular data in PDF. SEMANTICS'15 (2015).Google Scholar
- L. Garcia-Castro et al. 2013a. Connections across Scientific Publications based on Semantic Annotations. SEPublica (2013). https://doi.org/10.5167/UZH-82214Google ScholarCross Ref
- M.A. Angrosh et al. 2014c. Contextual information retrieval in research articles: Semantic publishing tools for the research community. Semantic Web , Vol. 5 (2014), 261--293. Issue 4. https://doi.org/0.5555/2786113.2786115Google ScholarDigital Library
- Marcus C. Chibucos et al. 2014 d. Standardized description of scientific evidence using the Evidence Ontology (ECO). Database : the journal of biological databases and curation (2014). https://doi.org/10.1093/database/bau075Google ScholarCross Ref
- M. Hucka et al. 2003. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics , Vol. 19 (2003). https://doi.org/10.1093/bioinformatics/btg015Google ScholarCross Ref
- Mohamad Yaser Jaradeh et al. 2019 a. Open Research Knowledge Graph: Next Generation Infrastructure for Semantic Scholarly Knowledge. In KCAP'19 . https://doi.org/10.1145/3360901.3364435Google ScholarDigital Library
- Paolo Ciccarese et al. 2011b. An open annotation ontology for science on web 3.0. J Biomed Semant , Vol. 2 (2011). https://doi.org/10.1186/2041--1480--2-S2-S4Google ScholarCross Ref
- Paolo Ciccarese et al. 2012a. Open semantic annotation of scientific publications using DOMEO. Journal of Biomedical Semantics , Vol. 3 (2012). https://doi.org/10.1186/2041--1480--3-S1-S1Google ScholarCross Ref
- Paul Groth et al. 2010b. The anatomy of a nanopublication. Information Services and Use , Vol. 30, 1--2 (2010). https://doi.org/10.3233/ISU-2010-0613Google ScholarCross Ref
- Pedro Sernadela et al. 2015b. A Semantic Layer for Unifying and Exploring Biomedical Document Curation Results. WBBIO'2015 (2015). https://doi.org/10.1007/978--3--319--16483-0_2Google ScholarCross Ref
- Sören Auer et al. 2018. Towards a Knowledge Graph for Science. In WIMS'18 . https://doi.org/10.1145/3227609.3227689Google ScholarDigital Library
- Sumit Madan et al. 2019 b. The extraction of complex relationships and their conversion to biological expression language (BEL) overview of the BioCreative VI (2017) BEL track. Database : the journal of biological databases and curation (2019). https://doi.org/10.1093/database/baz084Google ScholarCross Ref
- Silvio Peroni et al. 2012b. Scholarly publishing and Linked Data: describing roles, statuses, temporal and contextual extents. i-Semantics 2012 (2012). https://doi.org/10.1145/2362499.2362502Google ScholarDigital Library
- Tobias Kuhn et al. 2013b. Broadening the scope of nanopublications. In Extended Semantic Web Conference . 487--5017. https://doi.org/10.1007/978--3--642--38288--8_33Google ScholarCross Ref
- Tobias Kuhn et al. 2013c. Broadening the Scope of Nanopublications (ESWC'13, Vol. 7882). ESWC: European Semantic Web Conference, Springer, 487--501. https://doi.org/10.1007/978--3--642--38288--8_33Google ScholarCross Ref
- Bryon Jacob and Jonathan Ortiz. 2017. Data.world: A Platform for Global-Scale Semantic Publishing. In ISWC'17 .Google Scholar
- Tobias Kuhn. 2018. Using the AIDA Language to Formally Organize Scientific Claims. CNL 2018 (2018). https://doi.org/10.3233/978--1--61499--904--1--52Google ScholarCross Ref
- Tobias Kuhn and Michel Dumontier. 2017. Genuine semantic publishing. Data Science , Vol. 1 (2017), 139--154. Issue 1/2. https://doi.org/10.3233/DS-170010Google ScholarCross Ref
- Esther Landhuis. 2016. Scientific literature: Information overload. Nature , Vol. 535 (2016), 457--458. https://doi.org/10.1038/nj7612--457aGoogle ScholarCross Ref
- Silvia Mirri et almbox. 2017. Towards accessible graphs in HTML-based scientific articles. In 4th IEEE Annual Consumer Communications & Networking Conference (CCNC) . 1067--1072. https://doi.org/10.1109/CCNC.2017.7983287Google ScholarDigital Library
- Silvio Peroni. 2012. Semantic Publishing: issues, solutions and new trends in scholarly publishing within the Semantic Web era . PhD Thesis. Universita di Bologna. http://speroni.web.cs.unibo.it/publications/peroni-2012-semantic-publishing-issues.pdfGoogle Scholar
- Silvio Peroni. 2014. The Digital Publishing Revolution. In Semantic Web Technologies and Legal Scholarly Publishing, Pompeu Casanovas and Giovanni Sartor (Eds.). Law, Governance and Technology Series, Vol. 15. Springer International Publishing, Switzerland, Chapter 2, 7--43. https://doi.org/10.1007/978--3--319-04777--5_2Google ScholarCross Ref
- Silvio Peroni. 2017. Automating semantic publishing. Data Science , Vol. 1 (2017), 155--173. Issue 1/2. https://doi.org/10.3233/DS-170012Google ScholarCross Ref
- Silvio Peroni et almbox. 2017. Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles. PeerJ Computer Science , Vol. 3 (2017). Issue e132. https://doi.org/10.7717/peerj-cs.132Google ScholarCross Ref
- Silvio Peroni and David Schotton. 2008. The SPAR Ontologies. ISWC 2018 Proceedings (2008). https://doi.org/10.1007/978--3-030-00668--6_8Google ScholarCross Ref
- Silvio Peroni and David Shotton. 2012. FaBiO and CiTO: ontologies for describing bibliographic resources and citations. Web Semantics: Science, Services and Agents on the World Wide Web , Vol. 17 (December 2012), 33--43. https://doi.org/10.1016/j.websem.2012.08.001Google ScholarCross Ref
- Peter Gordon Roetzel. 2019. Information overload in the information age: a review of the literature from business administration, business psychology, and related disciplines with a bibliometric approach and framework development. Business Research volume , Vol. 12 (2019). https://doi.org/10.1007/s40685-018-0069-zGoogle ScholarCross Ref
- Idafen Santana-Perez and María Poveda-Villalon. 2018. FAIR* Reviews Ontology (FR) . http://purl.org/spar/fr Retrieved September 14, 2021 fromGoogle Scholar
- Bahar Sateli and René Witte. 2016. From Papers to Triples: An Open Source Workflow for Semantic Publishing Experiments. In Semantics, Analytics, Visualization. Enhancing Scholarly Data . Springer, 39--44. https://doi.org/10.1007/978--3--319--53637--8_5Google ScholarCross Ref
- Pedro Sernadela and Jose Luis Oliveira. 2017. A semantic-based workflow for biomedical literature annotation. Database (Oxford) (2017). https://doi.org/10.1093/database/bax088Google ScholarCross Ref
- David Shotton. 2009. Semantic publishing: the coming revolution in scientific journal publishing. Learn. Publ. , Vol. 22 (2009), 85--94. Issue 2. https://doi.org/10.1087/2009202Google ScholarCross Ref
- Leslie F. Sikos. 2017. Knowledge Representation with Semantic Web Standards .Springer International Publishing, Cham, 11--49. https://doi.org/10.1007/978--3--319--54066--5_2Google ScholarCross Ref
- Ted Slater. 2014. Recent advances in modeling languages for pathway maps and computable biological networks. Drug Discovery Today , Vol. 19 (2014). https://doi.org/10.1016/j.drudis.2013.12.011Google ScholarCross Ref
- Ted Slater and Diane H. Song. 2012. Saved by the BEL: ringing in a common language for the life sciences. Fall (2012).Google Scholar
- Bodo M. Stern and Erin K. O'Shea. 2019. A proposal for the future of scientific publishing in the life sciences. PLoS Biol , Vol. 17, 2 (2019), 683--684. https://doi.org/10.1371/journal.pbio.3000116Google ScholarCross Ref
Index Terms
- Expressing High-Level Scientific Claims with Formal Semantics
Recommendations
Knowledge network of scientific claims derived from a semantic publication system
16th International Conference on Electronic Publishing --ELPUB 2012 --Social Shaping of Digital Publishing: Exploring the Interplay between Culture and TechnologyCurrently, the conventional communication channel for reporting scientific results is Web electronic publishing of scientific articles in paper print formats, such as PDFs. The emergence of the Semantic Web and Linked Data environment provides new ...
The SPAR Ontologies
The Semantic Web – ISWC 2018AbstractOver the past eight years, we have been involved in the development of a set of complementary and orthogonal ontologies that can be used for the description of the main areas of the scholarly publishing domain, known as the SPAR (Semantic ...
Formal Semantics and OntologiesTowards an Ontological Account of Formal Semantics
Proceedings of the 2008 conference on Formal Ontology in Information Systems: Proceedings of the Fifth International Conference (FOIS 2008)Formal ontology relies on representation languages for expressing ontologies. This involves the formal semantics of these languages which is typically based on a limited set of abstract mathematical notions. In this paper, we discuss the interplay ...
Comments