Establishing the GWAS Catalog as a resource for large-scale association studies
- Funded by National Institutes of Health (NIH)
- Total publications:0 publications
Grant number: 3U41HG007823-07S1
Grant search
Key facts
Disease
COVID-19Start & end year
20142022Known Financial Commitments (USD)
$168,829Funder
National Institutes of Health (NIH)Principal Investigator
Helen Elizabeth ParkinsonResearch Location
United States of AmericaLead Research Institution
European Molecular Biology LaboratoryResearch Priority Alignment
N/A
Research Category
Pathogen: natural history, transmission and diagnostics
Research Subcategory
Pathogen morphology, shedding & natural history
Special Interest Tags
Data Management and Data Sharing
Study Type
Clinical
Clinical Trial Details
Not applicable
Broad Policy Alignment
Pending
Age Group
Unspecified
Vulnerable Population
Unspecified
Occupations of Interest
Unspecified
Abstract
Project Summary: Accelerating access and sharing of COVID-19 human host genetic and phenotype dataEarly evidence from twin studies suggests that approximately 50% of COVID-19 disease burden isdetermined by host genetics. The identification of host factors for COVID-19 will directly influence thedevelopment of public health intervention strategies and the identification of drug targets. There are a varietyof existing cohort longitudinal studies with existing genetic and clinical data, e.g. UK Biobank, AllofUs,23andMe, Ancestry.com who are engaging existing cohort participants for information on COVID-19 diseaseburden. The COVID-19 Host Genetics Initiative (COVID-19-HGI) is an international consortium that aims toidentify host genetic associations of COVID-19 by combining data from human cohorts. The EuropeanGenome-phenome Archive (EGA) and the NHGRI Analysis, Visualization, and Informatics Lab-spaceAnVIL/Terra platforms are founding partners that form the data sharing and analysis platform. The EGA is aGA4GH driver project and can rapidly acquire these data enabling ethical genomic data sharing. This extendsthe international data sharing infrastructure and processes enabling access to human controlled access datarelevant to addressing the COVID-19 pandemic.Aim 1: Host submissions to the COVID-19-HGI data sharing platformThe EGA has previously received submissions from over 144 US submitters and US based users represent33% of the total user community which streams 8.6 PB of data last year. The COVID-19 pandemic is expectedto significantly increase this number through planned new industrial collaboration (e.g. Ancestry.com,Regeneron Pharmaceuticals). There is an opportunity to develop new submission templates and processesto enable more rapid submission of genetic and phenotype data.Aim 2: COVID-19 host metadata harmonisationRecording and collection of clinical patient data of COVID-19 disease burden is a critical requirement.Phenotype information is collected using a variety of formats, coding schemes, surveys, and ontologies.Using the COVID-19-HGI data dictionary, we will construct a common minimal metadata model that will mapacross COVID-19 studies for genetic association studies.Aim 3: Rapid integrated data access and flow into COVID-19-HGI analysis platformRapid integration of new human genotypes and phenotyping will be essential to determine reliable and wellsupported genetic associations. The NHGRI AnVIL and Terra platform will be the analysis platform for theCOVID-19-HGI. We will use GA4GH standards to provide rapid data access and integration of US COVID-19 data. This will result in more rapid and seamless human data flow between EGA and AnVIL to provideadditional power to COVID-19 host association studies