skip to main content
10.1145/1183550.1183566acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

An architecture for creating collaborative semantically capable scientific data sharing infrastructures

Published: 10 November 2006 Publication History

Abstract

Increasingly, scientists are seeking to collaborate and share data among themselves. Such sharing is can be readily done by publishing data on the World-Wide Web. Meaningful querying and searching on such data depends upon the availability of accurate and adequate metadata that describes the data and the sources of the data. In this paper, we outline the architecture of an implemented cyber-infrastructure for chemistry that provides tools for users to upload datasets and their metadata to a database. Our proposal combines a two level metadata system with a centralized database repository and analysis tools to create an effective and capable data sharing infrastructure. Our infrastructure is extensible in that it can handle data in different formats and allows different analytic tools to be plugged in.

References

[1]
Bairoch, A., Apweiler, R. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Research, vol. 28, no. 1, pp. 45--48, 2000.
[2]
Bateman, A., Coin L., et al. The Pfam protein families database. Nucleic Acids Research, vol. 30, no. 1, pp. 276--280, 2002.
[3]
Bouganim, L. et al. The Ecobase Project: Database and Web Technologies for Environmental Information Systems. ACM SIGMOD Record, vol. 30, no. 3, pp. 70--75, 2001.
[4]
Buneman, P., Khanna, S., Tajima, K., Tan, W.C. Archiving scientific data. ACM Transactions on Database Systems, vol. 29, no. 1, pp. 2--42, 2004.
[5]
Chervenak, A., Foster, I., Kesselman, C., Salisbury, C., Tuecke, S., The Data Grid: Towards an architecture for the distributed management and analysis of large scientific datasets. Journal of Network and Computer Applications, vol. 23, No. 3, pp. 187--200, July 2000.
[6]
Deutsch, A., Fernadez, M., Suciu, D. Storing Semistructured data with STORED. In Proceedings of the ACM SIGMOD international conference on Management of data. 1999.
[7]
Dongilli, P., Franconi, E., Tessaris, S. Semantics driven support for query formulation. In Proceedings of the International Workshop on Description Logics (DL), vol. 104, Whistler, BC, Canada, June 2004.
[8]
Dublin Core Qualifiers. Dublin Core Metadata Initiative, 2000.
[9]
Hamosh, A., Scott, A. F., Amberger, J., Bocchini, C., Valle, D., McKusick, V. A., Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Research, vol. 30, no. 1, pp. 52--55. 2002.
[10]
OAI -- Protocol for Metadata Harvesting. Open Archives Initiative. http://www.openarchives.org/. 2001.
[11]
Shosani, A., Bernado, L. M., Nordberg, H., Rotem, D., Sim, A. Storage management for high energy physics applications. In Computing in High Energy Physics (CHEP). 1998.

Cited By

View all
  • (2020)Supporting visual analytics in decision support systemProceedings of the 19th Brazilian Symposium on Human Factors in Computing Systems10.1145/3424953.3426483(1-10)Online publication date: 26-Oct-2020
  • (2013)Data sharing in the sciencesAnnual Review of Information Science and Technology10.1002/aris.2011.144045011345:1(247-294)Online publication date: 2-Jan-2013
  • (2011)Data sharing in the sciencesAnnual Review of Information Science and Technology10.5555/2766865.276687845:1(247-294)Online publication date: 1-Jan-2011
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WIDM '06: Proceedings of the 8th annual ACM international workshop on Web information and data management
November 2006
102 pages
ISBN:1595935258
DOI:10.1145/1183550
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 November 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. architecture for cyber-infrastructures
  2. inter-operation
  3. research dataset integration
  4. scientific databases

Qualifiers

  • Article

Conference

CIKM06
Sponsor:
CIKM06: Conference on Information and Knowledge Management
November 10, 2006
Virginia, Arlington, USA

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Supporting visual analytics in decision support systemProceedings of the 19th Brazilian Symposium on Human Factors in Computing Systems10.1145/3424953.3426483(1-10)Online publication date: 26-Oct-2020
  • (2013)Data sharing in the sciencesAnnual Review of Information Science and Technology10.1002/aris.2011.144045011345:1(247-294)Online publication date: 2-Jan-2013
  • (2011)Data sharing in the sciencesAnnual Review of Information Science and Technology10.5555/2766865.276687845:1(247-294)Online publication date: 1-Jan-2011
  • (2011)Data sharing in networked environmentsProceedings of the 5th WSEAS international conference on Communications and information technology10.5555/2028497.2028537(207-213)Online publication date: 14-Jul-2011
  • (2010)Web Syndication Approaches for Sharing Primary Data in "Small Science" DomainsData Science Journal10.2481/dsj.009-0129(42-53)Online publication date: 2010
  • (2010)Keyword search across databases and documentsProceedings of the 2nd International Workshop on Keyword Search on Structured Data10.1145/1868366.1868368(1-6)Online publication date: 6-Jun-2010
  • (2009)An investigation in applying image retrieval techniques to X-ray engineering picturesProceedings of the 8th WSEAS international conference on Artificial intelligence, knowledge engineering and data bases10.5555/1553921.1553941(73-78)Online publication date: 21-Feb-2009
  • (2007)Measuring referential integrity in distributed databasesProceedings of the ACM first workshop on CyberInfrastructure: information management in eScience10.1145/1317353.1317367(61-66)Online publication date: 9-Nov-2007
  • (2007)Metadata management for federated databasesProceedings of the ACM first workshop on CyberInfrastructure: information management in eScience10.1145/1317353.1317361(31-38)Online publication date: 9-Nov-2007
  • (2007)ChemXSeerProceedings of the ACM first workshop on CyberInfrastructure: information management in eScience10.1145/1317353.1317356(7-10)Online publication date: 9-Nov-2007

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media