Michigan Center for Data Driven Drug Development and Treatment Assessment


The University of Michigan’s Institute for Data Science (MIDAS) has been awarded a grant by the National Science Foundation’s Industry-University Cooperative Research Centers (NSF IUCRC) program to establish a center for precompetitive research into drug design, repurposing and repositioning, patient phenotyping, treatment monitoring and assessment, and pharmacovigilance using novel machine learning and artificial intelligence techniques. We are seeking additional industry partners whose research efforts align with the Center’s focus and who will be active collaborators in shaping the future paradigm of integrated and cost-effective patient care. The motivation for the Center, its research areas, and your potential role within it are detailed below.


The Center for Data-Driven Drug Development and Treatment Assessment (DATA) intends to establish a center for precompetitive research into drug design, drug treatment assessment, drug repositioning, patient phenotyping and quantitative pharmacovigilance using novel machine learning and artificial intelligence techniques. We will work with our partners from industry and state / professional organizations whose research efforts align with the Center’s focus and who will be active collaborators in shaping the future paradigm of integrated and cost-effective drug design, health monitoring and patient care.

Center for Data-Driven Drug Development and Treatment Assessment (DATA) will focus on six main areas with the goal of significantly accelerating overall drug development and assessment of the outcomes associated with these treatments, while reducing national health care expenditures. These research areas represent the general capabilities that the Center will offer, but can be readily extended to meet the needs of participating industry partners:

  1. Development, testing, and validation of algebraic, machine learning, and artificial intelligence techniques for drug development, treatment monitoring and patient phenotyping with respect to drugs/treatments – State of the art algebraic, machine learning, and artificial intelligence techniques together with advances in treatment monitoring will be brought to bear on discovering new approaches for drug screening, drug repurposing/repositioning, estimation of patient phenotypes related to treatment outcomes/efficacy and exploring drug toxicity/adverse effects; using public, academic, and industry partner-provided patient and drug databases. These include advanced tensor algebraic techniques that were designed to systematically integrate auxiliary information currently ignored or under-utilized by extant methods. Additionally, novel machine learning-based methodologies for mobile/at-home patient monitoring will also be further developed.
  2. Design and development of advanced artificial intelligence methods for treatment assessment. The Center will focus on the design and implementation of monitoring systems and algorithms that can continuously monitor the health of individuals receiving different types of drugs and treatments.
  3. Provide an industry-wide and vendor-agnostic Secure Data Hub with third-party private search capabilities for the development, testing, validation and assessment of drugs and treatments – By applying advanced algebraic and statistical methods, the Center will create large, encrypted, and integrated databases of patient and pharmaceutical data for which the participating companies need only share full encryptions of their proprietary databases. Such databases enable third-party private searches, allowing partners to search securely and anonymously for data related to therapies or patient populations of interest.
  4. Enable federated machine learning for drug design, health informatics and pharmacovigilance over encrypted databases – Center investigators have adapted several machine learning algorithms for use over encrypted data. The Center will further integrate these algorithms into a federated learning framework. Such technologies will enable machine learning over databases that cannot be integrated into the Secure Data Hub.
  5. Enable data driven approaches to developing heath care policies with respect to drugs and treatments – Utilizing the patient phenotypes, health analytics, and comprehensive adverse event/drug interactions data developed and collected by the Center, policy makers, professional organizations, and patient advocacy organizations will be able to better revise standards of care and patient outreach methods to improve patient outcomes in response to drugs and treatments while reducing costs related to unnecessary or ineffective therapies.
  6. Provide unique lab resources for development and validation of patient phenotypes and new drug discovery – University of Michigan provides unique high throughput in-vitro and in-vivo capabilities, facilitated by multiple labs, centers and cores, which will be made available as resources for developing new drugs informed by discovered patient phenotypes and post-market surveillance data, closing the loop on the pharmaceutical development pipeline.

About the NSF IUCRC Program and Your Potential Collaboration

The Center will leverage the existing community of data science and domain knowledge experts of MIDAS and the University of Michigan. MIDAS and its affiliated faculty will actively engage with industry partners to construct and implement a computational platform using novel algorithmic approaches to extract, validate, and integrate information from different drug/target datasets in a secure and vendor-agnostic manner.

The NSF IUCRC program has been enabling long-term research partnerships between industry, academia, and government for over 40 years. The IUCRC program facilitates the establishment of a thematic center focused on pre-competitive research projects, in which participating members have the ability to solicit and select proposals for development. The NSF funds a single center at the national level for any one research area and has indicated its support for our Center to be the NSF-funded center fulfilling the national need for improved health monitoring and post-market surveillance.

Financial commitment letters will be required for all companies who wish to become members of the Center. Membership entails voting rights on the Industry Advisory Board (IAB), with each member receiving a pro-rated number of votes based on their membership status. We have established the following annual membership fee structure: Full Membership at $40,000 for one vote, $60,000 for one and one-half votes, or $80,000 for two votes; and Associate Membership at $20,000 for one-half vote. Members exercise their voting rights through the Center’s IAB, choosing which pre-competitive research projects to fund. The NSF provides a minimum of $150,000 annually in the first 5 years of the Center’s operations, with additional matching funds depending upon industry participation. The NSF IUCRC program stipulates that 90% of all membership and NSF monies directly fund research projects.

Benefits of Participating in DATA

  1. Define problems/projects, have a team of U-M/industry members work on the project, and have access to all resulting solutions at precompetitive level
    1. Note that the IDC rate for the projects defined within IUCRC is 10%, as opposed to 56% for direct projects with U-M.
    2. The process of agreeing/signing contracts with U-M (or any other university) is often very long and time-consuming, while the projects defined within IUCRC are processed and funded very quickly.
  2. Access to highly skilled talent (students, postdocs, etc.) for future hiring
    1. Ability to observe potential future employees throughout the course of one or more projects and assess their capabilities.
    2. We will have regular workshops through which students/postdocs can have direct discussions regarding their career plans and options.
  3. Access to additional funding directly from NSF
    1. NSF provides grants to industry partners through programs such as INTERN and REU.
    2. Students/interns learn the job before getting hired by a company while getting paid by the NSF.
  4. Ability to interact with other industry partners in a precompetitive setting and learn from each other’s interests, needs, and capabilities
    1. At least some parts of the ideas/solutions to data/algorithmic challenges may come from other industry partners.
    2. Larger companies have the opportunity to learn about the capabilities of smaller companies participating in the Center.
    3. Smaller companies have the opportunity to show their capabilities to larger entities.

Access to Resulting Solutions and IP

NSF emphasizes that centers must target common problems and pains by focusing on precompetitive partnerships. All solutions (data, algorithms, etc.) can be used by all participants but only within the framework of the Center. All companies can use the resulting solutions for the applications defined/funded by the Center but not as a product that they market. The Center focuses mainly on deployment of IP via a public, shared resource mechanism such as open source, open access (Creative Commons), or public domain. However, it is very possible that the work conducted under the Center will result in new IP. The ownership of the technology will be governed by the Center’s bylaws. U-M will work with the companies for technology transfer.

Resources Provided by University of Michigan

  • Data and lab resources related to health informatics and pharmacovigilance
  • Expertise in data science and algorithms for health informatics. Samples of the algorithms and computational resources provided by the Center include:
    • Computational drug toxicity screening and drug interaction modeling using fully homomorphic encryption and coupled tensor methods (CMMC/CTMC)
    • Efficient and effective models for mobile/outpatient monitoring and prediction of adverse events using physiological signals
    • Patient phenotyping using transparent artificial intelligence (AI), which enable interpretation of the recommendations/predictions created by computational methods. These include tensor-based machine learning methods for fusion of multimodal patient data for diagnostic and prognostic modeling

U-M has already established successful research cores, labs, and resources that will be instrumental to the success of DATA, such as:


For inquiries about the DATA Center, please contact Managing Director Ivana Tullett (734-647-4986) or Director Kayvan Najarian.

Mailing Address:

Center for Data-Driven Drug Development and Treatment Assessment (DATA)
University of Michigan Michigan Institute for Data Science
Weiser Hall, 500 Church Street, Suite 600A
Ann Arbor MI 48109