Refining Predictive Models for Neglected and Emerging Infectious Diseases
- Funded by National Institutes of Health (NIH)
- Total publications:0 publications
Grant number: 5R35GM146612-03
Grant search
Key facts
Disease
COVID-19, UnspecifiedStart & end year
20222027Known Financial Commitments (USD)
$377,500Funder
National Institutes of Health (NIH)Principal Investigator
ASSOCIATE PROFESSOR Ye ShenResearch Location
United States of AmericaLead Research Institution
UNIVERSITY OF GEORGIAResearch Priority Alignment
N/A
Research Category
Clinical characterisation and management
Research Subcategory
Prognostic factors for disease severity
Special Interest Tags
Data Management and Data Sharing
Study Type
Non-Clinical
Clinical Trial Details
N/A
Broad Policy Alignment
Pending
Age Group
Not Applicable
Vulnerable Population
Not applicable
Occupations of Interest
Not applicable
Abstract
PROJECT SUMMARY Predictive models play an essential role in disease prevention and control. Recent advances in scientific research have allowed more thorough and in-depth data collection from epidemiological studies (e.g., GPS data, climate data, wearable device data). However, due to the many variables collected and the relatively short time frame for epidemiological data collection during some of the epidemics, missing information is unavoidable, and subsequent updates of the database may be necessary. How to incorporate data with partial information, i.e., with missingness, and predictors measured dynamically over time, into existing models to perform more accurate and efficient predictions remains a challenge. Recently, the PI and his team have developed predictive models for various purposes among several neglected and emerging infectious diseases, including schistosomiasis, COVID-19, and human seasonal influenza. While conducting these studies, we identified several practical issues prohibiting a broader implementation of the proposed models, such as missing data and a lack of adaptive mechanisms based on dynamic inflows of predictors. Existing models adopting the complete data analysis approach will significantly reduce the statistical power and cause potential bias. Moreover, predictive models applied in epidemiological infectious disease studies often rely on historical data collected up to a time point without taking into consideration of future data inputs. Meanwhile, the development in statistical and machine learning methods laid the foundation for new dynamic predictive models based on trajectory data, with recent progress in functional concurrent regression and incremental learning. However, these methodological advances have been poorly integrated into field applications. Even in recent COVID-19 research where advanced dynamic models have been developed, balancing the data flow and prediction window has not been well studied. In addition, existing models often require a large amount of variable collection, so a practical two-stage approach allowing limited data collection early on can be more time- and cost-effective. In this MIRA proposal, we aim at refining predictive models for several neglected and emerging infectious diseases. Specifically, three coherent projects with distinct research activities will be pursued, which include: 1) refining hotspot prediction models for schistosomiasis interventions; 2) development and validation of prognostic risk models for COVID-19 in the US, with methods development on missing data handling and functional regression for dynamic prediction; 3) development and validation of a vaccine benefits score for human seasonal influenza. The refined models are expected to be accompanied by new and more general predictive algorithms involving missing data processing and dynamic prediction mechanisms to enhance model performance and adaptability. The methodological development from this proposal will also inform other epidemiological studies with similar challenges and have a broader long-term impact beyond the scope of the infectious diseases covered in the currently proposed projects.