Data Curation Tool

This is a standalone desktop application which helps its users transform existing health datasets into HL7 FHIR. The tool acts as an ETL tool, but specialized to transform data into the widely-used and well-established health data standard, called HL7 FHIR. It can read data from Excel/CSV files and/or PostgreSQL databases. Previously, I presented a rough architecture of the Data Curation Tool in this scientific journal paper.

The tool communicates with a valid FHIR endpoint, helps the users configure mappings from source data models into FHIR resources including the code system mappings, validates them against the rules exposed by the FHIR endpoint and them batch-transforms the source data into FHIR resources according to the mapping configurations, and finally puts them into the given FHIR endpoint. A video tutorial of the tool can be found here. The source code is open and can be found on GitHub together with installation releases.

A screenshot from the Data Curation Tool where the user maps source data fields to FHIR Resource elements. In this example, a FHIR Observation resource is filled with the columns of a selected Excel sheet.

I am the lead software architect of the Data Curation Tool. While the heavy burden of the coding was on the shoulders of my junior colleagues, I did lead the development while always keeping an eye on the source code by handling each and every pull request. We developed it with Electron framework which combines the Chromium rendering engine and the NodeJS runtime. We implemented with TypeScript under VueJS and used the Quasar framework for the graphical components. The full tech stack can be found on stackshare.


FAIR4Health Data Curation & Validation Tool