eMERGE Phase IV Clinical Center at Partners HealthCare

  • Funded by National Institutes of Health (NIH)
  • Total publications:6 publications

Grant number: 3U01HG008685-05S2

Grant search

Key facts

  • Disease

    COVID-19
  • Start & end year

    2020
    2025
  • Known Financial Commitments (USD)

    $883,131
  • Funder

    National Institutes of Health (NIH)
  • Principal Investigator

    Elizabeth W Karlson
  • Research Location

    United States of America
  • Lead Research Institution

    Brigham And Women'S Hospital
  • Research Priority Alignment

    N/A
  • Research Category

    Clinical characterisation and management

  • Research Subcategory

    Prognostic factors for disease severity

  • Special Interest Tags

    Data Management and Data Sharing

  • Study Type

    Non-Clinical

  • Clinical Trial Details

    N/A

  • Broad Policy Alignment

    Pending

  • Age Group

    Unspecified

  • Vulnerable Population

    Unspecified

  • Occupations of Interest

    Unspecified

Abstract

The Coronavirus Disease 2019 (COVID-19) pandemic has caught the world off guard, reshaping ways of life,the economy, and healthcare delivery. Data in electronic health records (EHRs) should be widely available to study COVID-19 but have not yet been effectively shared across clinical sites, with public health agencies, or with policy makers. There are several large, national and international projects to build informatics infrastructureto analyze the EHR data of patients with COVID-19. However, aggregating data from multiple EHRs only works if you can trust the final results. This means being able to go back to each site and talk to the people who know the data best, to understand the local clinical guidelines, coding practices, data quality problems, and other factors that affect the data. In March, 2020, we launched an international effort called the Consortium for Clinical Characterization of COVID-19 by EHR (4CE). It brings together more than 100 informatics experts, statisticians,and ICU doctors from around the world. The novel aspect of 4CE is that we recognize the complexities of EHR data and the need to directly involve the local data experts, not only in the data collection, but also in the development of research questions and the data analyses. We try to move fast, believing that early intelligenceis worth more than complete intelligence later. To do this, we avoid roadblocks that typically slow down informatics projects, such as building or installing new software, or the regulatory hurdles involved in sharing patient-level data. Instead, we ask participating sites to run analyses locally, using simple existing tools, likeSQL, R, and Python scripts, and only share aggregate counts and statistics centrally with the rest of the 4CEconsortium. We review and validate the data as a group, identify and fix data quality problems, and ask sites to repeat the analyses until everything is right. Through multiple cycles of data verification, we iteratively clean up the data and gain confidence that the findings we are seeing are real. Because we can do this quickly, we go from research question to results in just a few weeks. This proposed project will "productize" the 4CE approach, through three Specific Aims: (1) Transition 4CE to "Phase 2", where sites will begin more complex local analyses.We will develop Phase 2 analysis scripts; update our data upload, validation, and visualization websites; and,test the Phase 2 scripts at three sites before expanding to the rest of the consortium. (2) Demonstrate and evaluate 4CE through two use cases. We will refine and validate an algorithm for identifying COVID-19 patients with "severe" disease and use 4CE to characterize central nervous system complications in COVID-19. (3) Develop a plan for integrating with complementary efforts and long-term sustainability. As part of this, we willcreate a guide that shows sites how to use 4CE data extracts and quality checks to support other COVID-19 informatics projects, including the generation of OMOP files.

Publicationslinked via Europe PMC

Informative missingness: What can we learn from patterns in missing laboratory data in the electronic health record?

Acute respiratory distress syndrome after SARS-CoV-2 infection on young adult population: International observational federated study based on electronic health records through the 4CE consortium.

International electronic health record-derived post-acute sequelae profiles of COVID-19 patients.

Changes in laboratory value improvement and mortality rates over the course of the pandemic: an international retrospective cohort study of hospitalised patients infected with SARS-CoV-2.

Evolving phenotypes of non-hospitalized patients that indicate long COVID.

Individualized prediction of COVID-19 adverse outcomes with MLHO.