Give

Center for Data-Driven Drug Development and Treatment Assessment (DATA)

In this Section

Overview

Pharmaceutical development has a large impact on the nation’s economy and public health. Despite substantial annual outlays for pharmaceutical development, many drugs fail in clinical trial, while the majority of those making it to market fail to yield a profit. These costs and low returns hinder additional development. The Center for Data-Driven Drug Development and Treatment Assessment (DATA), through the pursuit of its research thrusts, has the potential to greatly enhance the national research infrastructure by increasing the capacity of the engineering/scientific workforce. In particular, DATA will produce new methodologies and infrastructure for industry-wide collaborative drug discovery, yielding new medicines at reduced cost.

The Center will focus on three main areas of unmet/underserved research needs within the (bio)pharmaceutical sector, with the goal of significantly accelerating the pace of drug discovery while reducing research costs:

1) the development, testing, and validation of machine learning methods for drug discovery and repurposing;

2) providing an industry-wide and vendor-agnostic Secure Data Hub for pharmaceutical and patient data with third-party private search capabilities; and

3) enable federated machine learning for drug repositioning over encrypted databases. Enabling these research thrusts are new developments in efficient fully homomorphic encryption and applications of coupled tensor-matrix and tensor-tensor completion methods to drug discovery and repurposing.

Awarded Projects

Scaffold Hopping Using GenAI and Limited Data Sets
PI: Peter Toogood, co-I: Emily Wittrup
STAR-guided machine learning prediction of clinical safety before clinical trials even begin
PI: Duxin Sun, co-PI: Kayvan Najarian
Predicting Metabolic-Mediated Drug Toxicity
PI: Dan Beard
Making causal inferences on the effects of drugs with CASM on EHR- based RCT data
PI: Cristian Minoccheri
Machine learning advances for multi-omics integration to improve microbiome-based precision medicine for colorectal cancer diagnosis and treatment
PI: Marcy Balunas
Combatting rapidly mutating viral targets using Thompson sampling
PI: Tim Cernak
Docking to Novel pocKets (DoNK): A Dense Synthetic Receptor-Ligand Binding Dataset
PI: Matt O’Meara
CASM-informed Reinforcement Learning (CASM-RL) to identify optimal treatment strategies for sepsis
PI: Cristian Minoccheri
Model-informed drug development for cancer using agent-based multivariate modeling
PI: Denise Kirschner, co-I: Maral Budak Marple
Generative Artificial Intelligence for Design and Optimization of New Therapeutic Antibodies
co-PIs: Pete Tessier & Kayvan Najarian
Utilizing Fully Homomorphic Encryption for Privacy Preserving Machine Learning in Drug Development
PI: Kayvan Najarian
Machine-Learning Based Optimization to Identify New Treatments for Tuberculosis Response
PI: Denise Kirschner, co-I: Maral Budak Marple
Developing Computational Phenotypes of Patient Reported Outcomes in Inflammatory Bowel Disease for Improved Assessment of Symptomatic Therapeutic Effects and Prediction of Medication Response
PI: Ryan Stidham, co-I: Cristian Minoccheri
Artificial Intelligence for Interpreting Signals Data from Bedside Portable Gas Chromatography in Ulcerative Colitis
co-PIs: Ryan Stidham & Sherman Fan

This project brings together data scientists, mathematicians, biomedical researchers, and healthcare providers to produce reproducible methodologies that will make a broad impact on drug discovery and biomedical applications of data science. The Center will support programs to educate the next generation of data science workforce members, research leaders, and citizens. By forming collaborations with industry, government, and community partners, the project will enable the dissemination and translation of research into impactful products and services for the betterment of society.

Website supported by the U.S. National Science Foundation (NSF) under the award number 2209546. Opinions, findings, conclusions, or recommendations expressed are those of the author(s) and do not necessarily reflect the views of the NSF.