Establishing the GWAS Catalog as a resource for large-scale association studies

  • Funded by National Institutes of Health (NIH)
  • Total publications:0 publications

Grant number: 3U41HG007823-07S1

Grant search

Key facts

  • Disease

    COVID-19
  • Start & end year

    2014
    2022
  • Known Financial Commitments (USD)

    $168,829
  • Funder

    National Institutes of Health (NIH)
  • Principal Investigator

    Helen Elizabeth Parkinson
  • Research Location

    United States of America
  • Lead Research Institution

    European Molecular Biology Laboratory
  • Research Priority Alignment

    N/A
  • Research Category

    Pathogen: natural history, transmission and diagnostics

  • Research Subcategory

    Pathogen morphology, shedding & natural history

  • Special Interest Tags

    Data Management and Data Sharing

  • Study Type

    Clinical

  • Clinical Trial Details

    Not applicable

  • Broad Policy Alignment

    Pending

  • Age Group

    Unspecified

  • Vulnerable Population

    Unspecified

  • Occupations of Interest

    Unspecified

Abstract

Project Summary: Accelerating access and sharing of COVID-19 human host genetic and phenotype dataEarly evidence from twin studies suggests that approximately 50% of COVID-19 disease burden isdetermined by host genetics. The identification of host factors for COVID-19 will directly influence thedevelopment of public health intervention strategies and the identification of drug targets. There are a varietyof existing cohort longitudinal studies with existing genetic and clinical data, e.g. UK Biobank, AllofUs,23andMe, Ancestry.com who are engaging existing cohort participants for information on COVID-19 diseaseburden. The COVID-19 Host Genetics Initiative (COVID-19-HGI) is an international consortium that aims toidentify host genetic associations of COVID-19 by combining data from human cohorts. The EuropeanGenome-phenome Archive (EGA) and the NHGRI Analysis, Visualization, and Informatics Lab-spaceAnVIL/Terra platforms are founding partners that form the data sharing and analysis platform. The EGA is aGA4GH driver project and can rapidly acquire these data enabling ethical genomic data sharing. This extendsthe international data sharing infrastructure and processes enabling access to human controlled access datarelevant to addressing the COVID-19 pandemic.Aim 1: Host submissions to the COVID-19-HGI data sharing platformThe EGA has previously received submissions from over 144 US submitters and US based users represent33% of the total user community which streams 8.6 PB of data last year. The COVID-19 pandemic is expectedto significantly increase this number through planned new industrial collaboration (e.g. Ancestry.com,Regeneron Pharmaceuticals). There is an opportunity to develop new submission templates and processesto enable more rapid submission of genetic and phenotype data.Aim 2: COVID-19 host metadata harmonisationRecording and collection of clinical patient data of COVID-19 disease burden is a critical requirement.Phenotype information is collected using a variety of formats, coding schemes, surveys, and ontologies.Using the COVID-19-HGI data dictionary, we will construct a common minimal metadata model that will mapacross COVID-19 studies for genetic association studies.Aim 3: Rapid integrated data access and flow into COVID-19-HGI analysis platformRapid integration of new human genotypes and phenotyping will be essential to determine reliable and wellsupported genetic associations. The NHGRI AnVIL and Terra platform will be the analysis platform for theCOVID-19-HGI. We will use GA4GH standards to provide rapid data access and integration of US COVID-19 data. This will result in more rapid and seamless human data flow between EGA and AnVIL to provideadditional power to COVID-19 host association studies