ABSTRACT
We present software architecture to federate data from multiple heterogeneous health informatics data sources owned by multiple organizations. The architecture builds upon state-of-the-art open-source Java and XML frameworks in innovative ways. It consists of (a) federated query engine, which manages federated queries and result set aggregation; and (b) data source facades, which translate the physical data models into a common model on-the-fly. System modules are connected via reusable Apache Camel integration routes and deployed to an OSGi enterprise service bus. A demonstration of the system using mock implementations is presented; we are currently working on integrating live data sources. Our system can be extended and integrated with existing SOA Healthcare and HL7 frameworks such as i2b2 and caGrid.
- Apache-Foundation. 2008. Camel Book in One Page. http://activemq.apache.org/camel/book-in-one-page.htmlGoogle Scholar
- Apelon. 2009. DTS - Open Source FAQ. Apelon, Inc. http://www.apelon.com/Products/DTS/OpenSourceFAQ/tabid/101/Default.aspxGoogle Scholar
- Balani, N. 2009 Apache Cxf web service development: develop and deploy SOAP and RESTful web services. Packt. Google ScholarDigital Library
- Bauer, C. and King, G. 2006 Java Persistence with Hibernate. Manning Publications Co. Google ScholarDigital Library
- BIRN. 2007. The Beginner's User Guide to inQ and SRB. Biomedical Informatics Research Network. http://nbirn.net/tools/srb/inQ_user_guide.shtmGoogle Scholar
- Bradshaw, R. L., Matney, S., Livne, O. E., Bray, B. E., Mitchell, J. A. and Narus, S. P. Architecture of a Federated Query Engine for Heterogeneous Resources. In Proceedings of the AMIA Annual Symposium (San Francisco, CA, November 14-18, 2009). Published online at http://www.ncbi.nlm.nih.gov/pmc/issues/184543/Google Scholar
- Bug, W., Astahkov, V., Boline, J., Fennema-Notestine, C., Grethe, J. S., Gupta, A., Kennedy, D. N., Rubin, D. L., Sanders, B., Turner, J. A. and Martone, M. E. Data federation in the Biomedical Informatics Research Network: tools for semantic annotation and query of distributed multiscale brain data. In Proceedings of the AMIA Annual Symposium (Washington, DC, November 8-12, 2008). Published online at http://www.ncbi.nlm.nih.gov/pmc/issues/177327/Google Scholar
- FURTHeR. 2010. FURTHeR XML Schemas. University of Utah. http://dev-app.further.utah.edu/portal/schemaDocs.jsfGoogle Scholar
- Katz, M. 2008 Practical RichFaces. Apress. Google ScholarDigital Library
- Kawaguchi, K. 2010. Meet Hudson. http://wiki.hudson-ci.org/display/HUDSON/Meet+HudsonGoogle Scholar
- Keator, D. B., Wei, D., Gadde, S., Bockholt, J., Grethe, J. S., Marcus, D., Aucoin, N. and Ozyurt, I. B. 2009. Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid. Front Neuroinformatics, 3, 30(Oct. 2009). DOI=http://doi.acm.org/10.3389/neuro.11.030.2009Google Scholar
- Komatsoulis, G. A., Warzel, D. B., Hartel, F. W., Shanbhag, K., Chilukuri, R., Fragoso, G., Coronado, S., Reeves, D. M., Hadfield, J. B., Ludet, C. and Covitz, P. A. 2008. caCORE version 3: Implementation of a model driven, service-oriented architecture for semantic interoperability. J Biomed Inform, 41, 1(Feb. 2008), 106--123. DOI=http://doi.acm.org/10.1016/j.jbi.2007.03.009 Google ScholarDigital Library
- Matney, S. A., Bradshaw, R. L., Livne, O. E., Bray, B. E., Frey, L., Mitchell, J. A. and Narus, S. P. Developing a Semantic Framework for Clinical and Translational Research. City, 2010 (submitted).Google Scholar
- Metsker, S. J. 2002 The Design Patterns Java Workbook. Addison-Wesley Longman Publishing Co., Inc. Google ScholarDigital Library
- Murphy, S. N., Weber, G., Mendis, M., Gainer, V., Chueh, H. C., Churchill, S. and Kohane, I. 2010. Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2). J Am Med Inform Assoc, 17, 2(Mar. 2010), 124--130. DOI=http://doi.acm.org/10.1136/jamia.2009.000893Google ScholarCross Ref
- O'Brien, T., Casey, J., Fox, B., Zyl, J. V., Moser, M., Redmond, E. and Shatzer, L. 2009 Maven: The Complete Reference. Sonatype, Inc. Mountain View, CA. http://www.sonatype.com/books/mvnref-book/reference/public-book.htmlGoogle Scholar
- Oster, S., Langella, S., Hastings, S., Ervin, D., Madduri, R., Phillips, J., Kurc, T., Siebenlist, F., Covitz, P., Shanbhag, K., Foster, I. and Saltz, J. 2008. caGrid 1.0: an enterprise Grid infrastructure for biomedical research. J Am Med Inform Assoc, 15, 2(Mar-Apr. 2008), 138--149. DOI=http://doi.acm.org/10.1197/jamia.M2522Google ScholarCross Ref
- Progress Software. 2009. Open Source OSGi ESB - FUSE ESB 4 (ServiceMix 4). Progress Software Corporation. http://fusesource.com/products/enterprise-servicemix4/Google Scholar
- Richard S. Hall, K. P., Stuart McCulloch, David Savage. 2010 OSGi in Action. Manning Publications Co.Google Scholar
- Slaymaker, M., Power, D., Russell, D., Wilson, G. and Simpson, A. Accessing and aggregating legacy data sources for healthcare research, delivery and training. In Proceedings of the ACM symposium on Applied computing (Fortaleza, Ceara, Brazil, 2008). ACM, New York, NY. DOI=http://doi.acm.org/10.1145/1363686.1363994 Google ScholarDigital Library
- SpringSource. 2009. Spring Dynamic Modules Reference Guide. http://static.springsource.org/osgi/docs/1.2.1/reference/html/Google Scholar
- SpringSource. 2007. The Spring Framework - Reference Documentation. http://static.springsource.org/spring/docs/2.5.x/reference/testing.htmlGoogle Scholar
- Tidwell, D. 2008 XSLT. O'Reilly Media.Google Scholar
- Walls, C. and Breidenbach, R. 2007 Spring in action. Manning Publications Co. Google ScholarDigital Library
- Weber, G. M., Murphy, S. N., McMurry, A. J., Macfadden, D., Nigrin, D. J., Churchill, S. and Kohane, I. S. 2009. The Shared Health Research Information Network (SHRINE): a prototype federated query tool for clinical data repositories. J Am Med Inform Assoc, 16, 5(Sep-Oct. 2009), 624--630. DOI=http://doi.acm.org/10.1197/jamia.M3191Google ScholarCross Ref
Index Terms
- Federated querying architecture for clinical & translational health IT
Recommendations
Federated Querying Architecture with Clinical & Translational Health IT Application
We present a software architecture that federates data from multiple heterogeneous health informatics data sources owned by multiple organizations. The architecture builds upon state-of-the-art open-source Java and XML frameworks in innovative ways. It ...
Coral: federated query join order optimization based on deep reinforcement learning
AbstractThe rise of diversified data engines has created the need for federated queries. A federated query can take a query and provide data analysis based on data from various data engines. Since the query data originates from multiple data engines, ...
Federated Aggregate Cohort Estimator (FACE)
HighlightsFACE is a federated, multi-platform approach for clinical study cohort estimation.Data is aggregated from both i2b2 and Virtual Data Warehouse (VDW) data models.The architecture helps meet IRB and security requirements easily with local ...
Comments