Experiences of Transforming a Complex Nephrologic Care and Research Database into i2b2 Using the IDRT Tools

Publikation: Beitrag in FachzeitschriftForschungsartikelBeigetragenBegutachtung

Beitragende

  • Christian Maier - , Friedrich-Alexander-Universität Erlangen-Nürnberg (Erstautor:in)
  • Jan Christoph - , Friedrich-Alexander-Universität Erlangen-Nürnberg (Autor:in)
  • Danilo Schmidt - , Charité – Universitätsmedizin Berlin (Autor:in)
  • Thomas Ganslandt - , Friedrich-Alexander-Universität Erlangen-Nürnberg (Autor:in)
  • Hans-Ulrich Prokosch - , Friedrich-Alexander-Universität Erlangen-Nürnberg (Autor:in)
  • Stefan Kraus - , Friedrich-Alexander-Universität Erlangen-Nürnberg (Autor:in)
  • Martin Sedlmayr - , Friedrich-Alexander-Universität Erlangen-Nürnberg (Letztautor:in)

Abstract

The secondary use of data from electronic medical records has become an important factor to determine and to identify various causes of disease. For this reason, applications like informatics for integrating biology and the bedside (i2b2) offer a GUI-based front end to select patient cohorts. To make use of those tools, however, clinical data need to be extracted from the Electronic Health Record (EHR) system and integrated into the data schema of i2b2. We used TBase, a documentation system for nephrologic transplantations, as a source system and applied the Integrated Data Repository Toolkit (IDRT) for the Extract, Transform, and Load (ETL) process to load the data into i2b2. Since i2b2 uses an entity-attribute-value (EAV) schema, which is a fundamentally different way of modeling data in comparison to a standard relational schema in TBase, we evaluated if (a) the data relationship of the source system entities can still be represented in the i2b2 schema and if (b) the IDRT is a suitable solution for loading the data of a comprehensive data schema like TBase into i2b2. For that reason, we identified entities in the TBase data schema which were relevant for answering questions on cohort identification. By doing so, we found out that the entities had different structures that needed to be handled differently for the ETL process. Furthermore, the use of IDRT revealed shortcomings with regard to large input data and specific data structures that are part of most modern EHR systems. However, this project also showed that our way of modeling the TBase data in i2b2 has been proven to be successful in terms of answering the most common questions of clinicians on cohort identification.

Details

OriginalspracheEnglisch
Aufsatznummer5640685
FachzeitschriftJournal of healthcare engineering
Jahrgang2019
Ausgabenummer1
PublikationsstatusVeröffentlicht - 17 Jan. 2019
Peer-Review-StatusJa
Extern publiziertJa

Externe IDs

Scopus 85060830028

Schlagworte