COVID-19 - The Bioinformatics and Biostatistics Hub on the front line (GISAID Project)

  • Funded by Institut Pasteur International Network (IPIN)
  • Total publications:211 publications

Grant number: Unknown

Grant search

Key facts

  • Disease

  • Funder

    Institut Pasteur International Network (IPIN)
  • Principle Investigator

  • Research Location

  • Lead Research Institution

  • Research Category

    Pathogen: natural history, transmission and diagnostics

  • Research Subcategory

    Pathogen genomics, mutations and adaptations

  • Special Interest Tags


  • Study Subject


  • Clinical Trial Details


  • Broad Policy Alignment


  • Age Group

    Not Applicable

  • Vulnerable Population

    Not applicable

  • Occupations of Interest

    Not applicable


Like many other research teams at the Institut Pasteur, the Bioinformatics and Biostatistics Hub is engaged in fighting against the COVID-19 pandemic by participating in the curation of GISAID data (Global Initiative on Sharing All Influenza Data). The Hub is the service division of the Computational Biology Department, and is composed of 50 experts in biostatistics and bioinformatics. At the end of March 2020, following a discussion on phylodynamic questions with the Evolutive Bioinformatics Unit at the Institut Pasteur, GISAID asked for help with processing the increasingly abundant data that was being submitted as well as with maintaining its quality. A solution for managing this essential resource was reached rapidly. The Hub agreed to accommodate this request, and as of April 1st, thirteen of its members have been actively curating, on a daily basis, data received by the consortium. The GISAID initiative, launched in 2006 following the 2006 bird flu epidemic, fosters international sharing of sequences associated with this virus as well as related geographical, clinical and epidemiological information. Its scope is now being extended to species associated with avian and other animal viruses, today including SARS-CoV-2, to help the scientific community understand how viruses evolve, spread and potentially trigger pandemics. (To boot, the National Reference Center for Respiratory Viruses (Including Influenza) at the Institut Pasteur shared the two complete sequences of viruses taken from two of the first French cases on this platform on the 30th of January, 2020.) The Initiative guarantees that access to data in GISAID is free for everyone, provided that individuals log in and agree to respect the GISAID sharing mechanism governed by its database access agreement. As of the 15th of April 2020, more than 130 SARS-CoV-2 genomes had been submitted by Institut Pasteur teams since the month of January. Concretely, Hub members are on duty every day, from midday to midnight, to process the numerous genomes of SARS-CoV-2 submitted (which range from a few dozen to several hundred per day) in order to validate the quality and reliability of sequences and their metadata. The objective is to standardize the metadata in order to facilitate searching the database, and to check the consistency of the assemblies. More than 9,000 genomes are accessible on the GISAID web site today (April 15, 2020), of which almost 3,000 have been curated since the 1st of April with the help of the Hub. This data is used, among other things, by nextstrain, an open source project aimed at providing a snapshot of the evolution of populations of pathogens via a modern and reactive interface. In addition to this action, the Hub remains available to campus scientists. More than ever the Hub is ready to provide its skills in experimental design, data processing, analysis and modelling, as well as in software, pipelines and web application development on priority projects related to COVID-19 research.

Publicationslinked via Europe PMC

Last Updated:38 minutes ago

View all publications at Europe PMC

Medicine Non-Adherence: A New Viewpoint on Adherence Arising from Research Focused on Sub-Saharan Africa.

Genomic Engineering of Oral Keratinocytes to Establish In Vitro Oral Potentially Malignant Disease Models as a Platform for Treatment Investigation.

Microbial Diversity Impacts Non-Protein Amino Acid Production in Cyanobacterial Bloom Cultures Collected from Lake Winnipeg.

Maternal Diet High in Linoleic Acid Alters Offspring Lipids and Hepatic Regulators of Lipid Metabolism in an Adolescent Rat Model.

Effect of Flour Particle Size on the Glycemic Index of Muffins Made from Whole Sorghum, Whole Corn, Brown Rice, Whole Wheat, or Refined Wheat Flours.

Femtosecond Laser Machining of an X-ray Mask in a 500 Micron-Thick Tungsten Sheet.

The Use of Nautical Activities in Formal Education: A Systematic Review.

Assessing the Psychometric Validity of the Epistaxis Severity Score: Internal Consistency and Test-Retest Reliability.