GHUCCTS N3C COVID data mapping

  • Funded by National Institutes of Health (NIH)
  • Total publications:0 publications

Grant number: 3UL1TR001409-06S3

Grant search

Key facts

  • Disease

    COVID-19
  • Start & end year

    2015
    2021
  • Known Financial Commitments (USD)

    $99,864
  • Funder

    National Institutes of Health (NIH)
  • Principal Investigator

    Joseph G Verbalis
  • Research Location

    United States of America
  • Lead Research Institution

    Georgetown University
  • Research Priority Alignment

    N/A
  • Research Category

    Research on Capacity Strengthening

  • Research Subcategory

    Systemic/environmental components of capacity strengthening

  • Special Interest Tags

    Data Management and Data SharingInnovation

  • Study Type

    Not applicable

  • Clinical Trial Details

    N/A

  • Broad Policy Alignment

    Pending

  • Age Group

    Not Applicable

  • Vulnerable Population

    Not applicable

  • Occupations of Interest

    Not applicable

Abstract

AbstractA major challenge to full utilization the available data and resources has been the complex nature of healthdata, and heterogeneity of data sources (including unstructured clinical notes) combined with a lack ofstandards. The lack of standards precludes semantic interoperability across platforms and between institutions.Instead, current approaches utilize resource intensive natural language processes to extract, transform, andcorrelate data from different sources for analysis. To improve translational science and accelerate research toimprove patient outcomes, many new and innovative studies are leveraging large volumes of available datathrough standardized and shared data initiatives. With current advances in computing and health data analysistools, methods and access, and to make data more meaningful, open, and accessible, research studies havemoved beyond traditional retroactive reporting to pragmatic interventions and predictive capabilities. Ongoingefforts focus on exploiting common data standards and models such as the Observational Medical OutcomesPartnership (OMOP) standard-defined by the Observational Health Data Sciences and Informatics (OHDSI)consortium, and accepted as canon by both the NIH and PCORI- will lead the way to discover insights intextual narrative, enforce data standardization, and promote scalability and sharing. The OHDSI Common DataModels (CDM) makes data more meaningful, open, and accessible, which drives translational science andallows for consistent development of predictive models across different data sources. The National COVIDCohort Collaborative (N3C), ACT, BD2K-NIH Data Commons, the National Center for Data to Health (CD2H),and others are among the efforts that will lead to new discoveries and informed decision making, driven bydata science and undergirded by mature Big Data technologies. We propose to design and establish novel,scalable, and standardized big data processes to massively abstract the raw electronic medical recorddatasets for observational studies. This project will develop a secure cloud-based environment to host thesedata, as well as the application programming and graphical user interfaces to support observational researchstudies leveraging these resources. By these means we will reduce the barriers to data standardization,annotation and sharing for reproducible analytics and begin to enforce complete semantic and syntacticinteroperability between the resources in the data ecosystem. This effort will enable our investigators to studythe effects of medical interventions and predict patients' health outcomes and generate the empirical evidencebase necessary to establish best practices in observational analysis.