Center for Data-Driven Drug Development and Treatment Assessment (DATA)

Call for Lightning Talks!

The Center for Data-Driven Drug Development and Treatment Assessment (DATA), an NSF-backed Industry-University Cooperative Research Center (NSF IUCRC), is inviting University of Michigan students (undergrad-doc), postdocs, and other researchers in training to submit their research projects by March 9, 2026 using our Google Form.

The lightning talks will be held at the Center’s Industry Advisory Board (IAB) meeting on March 30, 2026, over Zoom from 1-2pm EST. Students selected to give a lightning talk will be emailed a Zoom link closer to the start of the event.

The fully virtual meeting will host several companies from the pharmaceutical, healthcare, IT, and AI startup industries, offering the presenters ample opportunities to network and discuss their career plans with potential employers. Participants will be invited to attend the entire IAB meeting, which will also include a career panel, faculty presentations competing for DATA funding, and showcase of DATA industry partners.

Questions? Contact us at [email protected].

Pharmaceutical development has a large impact on the nation’s economy and public health. Despite substantial annual outlays for pharmaceutical development, many drugs fail in clinical trial, while the majority of those making it to market fail to yield a profit. These costs and low returns hinder additional development. The Center for Data-Driven Drug Development and Treatment Assessment (DATA), through the pursuit of its research thrusts, has the potential to greatly enhance the national research infrastructure by increasing the capacity of the engineering/scientific workforce. In particular, DATA will produce new methodologies and infrastructure for industry-wide collaborative drug discovery, yielding new medicines at reduced cost.

The Center will focus on three main areas of unmet/underserved research needs within the (bio)pharmaceutical sector, with the goal of significantly accelerating the pace of drug discovery while reducing research costs:

  1. The development, testing, and validation of machine learning methods for drug discovery and repurposing.
  2. Providing an industry-wide and vendor-agnostic Secure Data Hub for pharmaceutical and patient data with third-party private search capabilities.
  3. Enable federated machine learning for drug repositioning over encrypted databases. Enabling these research thrusts are new developments in efficient fully homomorphic encryption and applications of coupled tensor-matrix and tensor-tensor completion methods to drug discovery and repurposing.

Awarded Projects

Scaffold Hopping Using GenAI and Limited Data Sets

PI: Peter Toogood, co-I: Emily Wittrup

STAR-guided machine learning prediction of clinical safety before clinical trials even begin

PI: Duxin Sun, co-PI: Kayvan Najarian

Predicting Metabolic-Mediated Drug Toxicity

PI: Dan Beard

Making causal inferences on the effects of drugs with CASM on EHR- based RCT data

PI: Cristian Minoccheri

Machine learning advances for multi-omics integration to improve microbiome-based precision medicine for colorectal cancer diagnosis and treatment

PI: Marcy Balunas

Combatting rapidly mutating viral targets using Thompson sampling

PI: Tim Cernak

Docking to Novel pocKets (DoNK): A Dense Synthetic Receptor-Ligand Binding Dataset

PI: Matt O’Meara

CASM-informed Reinforcement Learning (CASM-RL) to identify optimal treatment strategies for sepsis

PI: Cristian Minoccheri

Model-informed drug development for cancer using agent-based multivariate modeling

PI: Denise Kirschner, co-I: Maral Budak Marple

Generative Artificial Intelligence for Design and Optimization of New Therapeutic Antibodies

co-PIs: Pete Tessier & Kayvan Najarian

Utilizing Fully Homomorphic Encryption for Privacy Preserving Machine Learning in Drug Development

PI: Kayvan Najarian

Machine-Learning Based Optimization to Identify New Treatments for Tuberculosis Response

PI: Denise Kirschner, co-I: Maral Budak Marple

Developing Computational Phenotypes of Patient Reported Outcomes in Inflammatory Bowel Disease for Improved Assessment of Symptomatic Therapeutic Effects and Prediction of Medication Response

PI: Ryan Stidham, co-I: Cristian Minoccheri

Artificial Intelligence for Interpreting Signals Data from Bedside Portable Gas Chromatography in Ulcerative Colitis

co-PIs: Ryan Stidham & Sherman Fan

Project Statement

This project brings together data scientists, mathematicians, biomedical researchers, and healthcare providers to produce reproducible methodologies that will make a broad impact on drug discovery and biomedical applications of data science. The Center will support programs to educate the next generation of data science workforce members, research leaders, and citizens. By forming collaborations with industry, government, and community partners, the project will enable the dissemination and translation of research into impactful products and services for the betterment of society.

Website supported by the U.S. National Science Foundation (NSF) under the award number 2209546. Opinions, findings, conclusions, or recommendations expressed are those of the author(s) and do not necessarily reflect the views of the NSF.

DATA Team

Lead Investigators

H. V. Jagadish

Edgar F Codd Distinguished University Professor of Electrical Engineering and Computer Science

Kayvan Najarian

Professor of Computational Medicine and Bioinformatics

Center Staff

Ivana Tullett

Compliance Coordinator

Chloe Winnie

Director of Operations