GHUCCTS N3C COVID data mapping
- Funded by National Institutes of Health (NIH)
- Total publications:0 publications
Grant number: 3UL1TR001409-06S3
Grant search
Key facts
Disease
COVID-19Start & end year
20152021Known Financial Commitments (USD)
$99,864Funder
National Institutes of Health (NIH)Principal Investigator
Joseph G VerbalisResearch Location
United States of AmericaLead Research Institution
Georgetown UniversityResearch Priority Alignment
N/A
Research Category
Research on Capacity Strengthening
Research Subcategory
Systemic/environmental components of capacity strengthening
Special Interest Tags
Data Management and Data SharingInnovation
Study Type
Not applicable
Clinical Trial Details
N/A
Broad Policy Alignment
Pending
Age Group
Not Applicable
Vulnerable Population
Not applicable
Occupations of Interest
Not applicable
Abstract
AbstractA major challenge to full utilization the available data and resources has been the complex nature of healthdata, and heterogeneity of data sources (including unstructured clinical notes) combined with a lack ofstandards. The lack of standards precludes semantic interoperability across platforms and between institutions.Instead, current approaches utilize resource intensive natural language processes to extract, transform, andcorrelate data from different sources for analysis. To improve translational science and accelerate research toimprove patient outcomes, many new and innovative studies are leveraging large volumes of available datathrough standardized and shared data initiatives. With current advances in computing and health data analysistools, methods and access, and to make data more meaningful, open, and accessible, research studies havemoved beyond traditional retroactive reporting to pragmatic interventions and predictive capabilities. Ongoingefforts focus on exploiting common data standards and models such as the Observational Medical OutcomesPartnership (OMOP) standard-defined by the Observational Health Data Sciences and Informatics (OHDSI)consortium, and accepted as canon by both the NIH and PCORI- will lead the way to discover insights intextual narrative, enforce data standardization, and promote scalability and sharing. The OHDSI Common DataModels (CDM) makes data more meaningful, open, and accessible, which drives translational science andallows for consistent development of predictive models across different data sources. The National COVIDCohort Collaborative (N3C), ACT, BD2K-NIH Data Commons, the National Center for Data to Health (CD2H),and others are among the efforts that will lead to new discoveries and informed decision making, driven bydata science and undergirded by mature Big Data technologies. We propose to design and establish novel,scalable, and standardized big data processes to massively abstract the raw electronic medical recorddatasets for observational studies. This project will develop a secure cloud-based environment to host thesedata, as well as the application programming and graphical user interfaces to support observational researchstudies leveraging these resources. By these means we will reduce the barriers to data standardization,annotation and sharing for reproducible analytics and begin to enforce complete semantic and syntacticinteroperability between the resources in the data ecosystem. This effort will enable our investigators to studythe effects of medical interventions and predict patients' health outcomes and generate the empirical evidencebase necessary to establish best practices in observational analysis.