Research Pillars

Research activities in 5 interconnected “pillars”

MIDAS focuses on five pillars, described below, within its overall effort to support data science and Artificial Intelligence (AI) research. Across these pillars, MIDAS promotes responsible research practices, supports the adoption of new data types and methodologies, and facilitates convergence science. The pillars will change over time: current pillars will be retired or shift focus, and new pillars established. Furthermore, these pillars are interconnected where possible, to maximize the synergies between them and leverage the interdisciplinary MIDAS community and the collaborative spirit U-M is known for.

In addition to supporting research within the pillars, MIDAS fosters connections and collaborations in data science and AI for the entire University, maintains a large umbrella covering all relevant research areas, and supports an inclusive community of researchers. MIDAS also continues to promote cutting-edge methodological development as the foundation for data science and AI.

For more information or to participate in any of the activities, contact: midas-contact@umich.edu

Responsible Research Pillar: Enhancing Scientific and Societal Impact

MIDAS collaborates with our researchers to develop foundational principles, developing methodology and tools, and deploying such tools for the ethical use of data and algorithms and to improve the reproducibility and replicability of scientific findings. Current activities include:

Responsible Data Science and AI

Overview: As data science and AI become a major force in science and in society, increasingly complex analytical pipelines working with poorly understood data pose significant issues of bias, inclusion and fairness. MIDAS is mobilizing our researchers to promote ethical data science and AI. Our approaches include

raising awareness through research discussions and public events, and enabling the development of technical solutions through collaboration with our faculty researchers. An example of such events is the annual Future Leaders Summit. An example of technical solution development is FIDES (Framework for Integrative Data Equity Systems).

Who will benefit: All U-M researchers who conduct related research and those who want to examine the issues of bias, inclusion and fairness in their research.

Coordinators: H. V. Jagadish (Director, MIDAS | Professor, Electrical Engineering and Computer Science) and Jing Liu (Executive Director, MIDAS)

Data Feminism

Overview: The term “data feminism” was coined by Catherine D’Ignazio and Lauren Klein as the title of their 2020 book, which provides a framework for data scientists to address issues in data justice, equity and fairness through feminism principles and approaches. D’Ignazio and Klein’s keynote speech at the MIDAS 2020 Symposium caused a sensation on campus and many of our researchers responded to the call to action. A faculty reading group has been active since then, and MIDAS is working with other units to develop research on data justice based on feminism principles.

Who will benefit: Researchers whose work is related to data justice and who are interested in seeking collaboration.

Coordinator: H.V. Jagadish (Director, MIDAS | Professor, Electrical Engineering and Computer Science)

Data For Social Good

Overview: MIDAS researchers and students carry out projects to support the data strategy of government and community partners, including the City of Detroit, the Native American tribal nations and other organizations in Michigan. MIDAS coordinates and defines such projects together with our partners, and connects cutting-edge research and ethical data science approaches for positive societal impact. 

Who will benefit: Researchers who would like to increase the impact of their research; trainees who would like to gain real-world experience; government and community partners.

Coordinator: Jing Liu (Executive Director, MIDAS)

Data and AI in Society community forum

Overview: Humans constantly grapple with how new technological advances fit into our moral and ethical framework. As the intensive data use and AI tools become essential in almost every sector, every organization in our society, and every aspect of our daily lives, we all need to think about how we adapt to the co-existence of human and AI, and how these new technologies can support, instead of harm, our values. MIDAS convenes these forums to discuss emerging issues such as AI tools in the classroom and in creative endeavors, and catalyze stronger connections between research and policy.

Who Will Benefit: All data science and AI researchers and users of data and AI products who wonder how such technologies are shaping our work, our lives and our society.

Coordinator: Josh Pasek  (Associate Director, MIDAS | Associate Professor, Communications and Media)

Data Pillar: Measuring and Improving Society

Societal transformations have complicated traditional survey methods for data collection, while a plethora of new data sources are creating opportunities to measure human behaviors and the human social condition. MIDAS supports the development and use of data science and AI methods to better understand society through new data types such as text, video, sensor and digital trace data.  Current activities include:

Unstructured Data for Social Science

Overview: MIDAS supports the campus community to develop this research theme with a two-pronged approach that includes training and research incubation.  For training, MIDAS organizes a series of tutorials designed to introduce Natural Language Processing (NLP) to domain researchers who have little or no prior experience.  We are also developing a learning roadmap that provides researchers with a systematic approach to learning NLP.  To enable innovative research, MIDAS collaborates with the AI Lab and the Institute for Social Research to offer research connection meetings to connect experts for unstructured data (text, image, video, etc) and domain researchers who plan to address significant research questions using such data.

Who Will Benefit: Researchers who want to learn the fundamentals of research with unstructured data, researchers who have such data and want to connect with methodology experts, and methodology experts who want to seek domain collaborators to significant research questions.

Coordinators: Josh Pasek  (Associate Director, MIDAS | Associate Professor, Communications and Media) and Beth Uberseder (Research Manager, MIDAS)

Access to Twitter Data

Overview: MIDAS provides the U-M research community access to a large repository of raw Twitter data free of charge. Falling under the data pillar, The Twitter Decahose is an ongoing collection of a 10% sample of tweets with data going as far back as 2009. There are also subsets for specific purposes related to COVID-19 and US and Indian politicians.

Who Will Benefit: This is a shared effort between MIDAS, CSCAR, and ARC to provide data to researchers for a broad spectrum of applications in network analysis, behavior analysis, sociolinguistics, natural language processing, and information diffusion. 

Coordinators: Sean Meyer  (Senior Scientist, MIDAS) and Beth Uberseder (Research Manager, MIDAS)

Supporting the development of new data and their access

Overview: MIDAS supports and collaborates with our faculty and campus units who develop new data sources and data infrastructure for social science research, and enable the wide adoption of such new resources. To name just a few examples: 1) The Research Data Ecosystem, a major effort of ICPSR with NSF funding to modernize the existing software platform to increase the ability of researchers to safely and securely access, connect, store, and manipulate data. MIDAS is a collaborator for the grant application and the project implementations. 2) MIDAS provided initial funding and data access for Libby Hemphill (faculty member in the School of Information and ICPSR) to develop the Social Media Archive. 3) MIDAS pilot funding for faculty effort to develop new data for social science (such as a large scale data on romantic relationships and digitizing the G.I. Bill record data).

Who will benefit: Researchers and units that develop new data sources and infrastructure.

Coordinators: Jing Liu  (Executive Director, MIDAS) Beth Uberseder (Research Manager, MIDAS)

AI Pillar: AI for Science and Engineering

AI methods are revolutionizing research through being powerful tools for many steps of the research workflow, and through automating and accelerating the entire research workflow. We catalyze creative and transformative applications of AI with the potential to lead to major scientific breakthroughs; and enable a broader U-M research community to adopt AI in imagining, planning, executing, and supporting research applications across a range of science and engineering domains.

AI in Science and Engineering postdoc program

Overview: This campus-wide program provides outstanding early-career researchers with intensive training and research experience as they ready themselves for independent research in academia and other sectors. As one of the sites in a new global network of postdoc programs focusing on AI in Science and Engineering, we support postdocs who apply AI methodology to address significant research questions. AI is defined broadly to include machine learning, robotics, Bayesian inference, and simulation. Science and engineering includes mathematical sciences, physical sciences, earth and environmental sciences, basic biological sciences, and engineering. 

Who Will Benefit: Postdocs, their faculty mentors, and research collaborators.

Coordinators: H.V. Jagadish (Director, MIDAS | Professor, Electrical Engineering and Computer Science), Jing Liu (Executive Director, MIDAS), and William Currie (Professor and Associate Dean for Research and Engagement, SEAS)

AI in Science and Engineering for the campus

Overview: Using the postdoc training program as the core, we are developing research and training activities for campus researchers and trainees, to connect AI methodologists and domain researchers, to enable AI skills training for researchers, to build an AI in Science and Engineering research community.

Who Will Benefit: All researchers on campus who would like to apply AI methods to science and engineering research.

Coordinators: H.V. Jagadish (Director, MIDAS | Professor, Electrical Engineering and Computer Science), Jing Liu (Executive Director, MIDAS)

Automated Research Workflow

Overview: Significant advancements in scientific computing, AI, and the hardware and software research environment are enabling researchers to develop automated research workflows (ARWs): AI and machine learning (ML) as components for data processing and analytics, and as tools to design and monitor experiments. Institutions and researchers are capitalizing on this opportunity to significantly accelerate research. MIDAS has organized a colloquium and faculty discussions around this theme, and is now building collaboration among campus researchers to enable ARW development and adoption.

Who Will Benefit: All researchers on campus who would like to explore how ARWs could facilitate their research and researchers who are interested in developing ARWs.

Coordinator: Jing Liu (Executive Director, MIDAS)

Emerging Pillar: Cultivating New Strengths

We support team activities in strategic research areas that have the potential to grow into pillars. The focus is on areas that are, or are expected to be, national priorities and / or U-M strengths, and can be significantly boosted with Data Science and AI. Naturally, activities in this pillar are exploratory, and a strong focus will emerge with time.

Data Science for Environmental Research

Overview: As research on the environment, climate change and sustainability becomes a national priority, an increasing number of U-M environmental scientists are embracing data science and AI methods. MIDAS has been building collaboration and support for these researchers and our concerted effort lays the foundation for our next pillar. 

MIDAS offers a training program for environmental scientists to help U-M faculty and research scientists adopt data science and AI techniques to environmental science research and integrate data science into their grant applications. The program also helps foster a U-M research community that will advance the application of data science and AI to research that encompasses environmental, climate, and earth sciences; as well as ecology. The full training program has three phases: a half-day workshop, a week-long bootcamp, and two optional half-day follow-up sessions.

In addition, our Propelling Original Data Science (PODS) pilot funding program has been supporting a number of environmental research projects that use cutting-edge data science and AI methods, including:

  • Improving the efficiency of energy grids
  • Improving urban water quality
  • Better machine learning models to monitor air quality
  • Supporting decision making for navigating vital waterways in winter
  • Identifying communities vulnerable to climate change
  • Developing data systems of fish communities in the Great Lakes
  • Detecting illicit wildlife trading

Who Will Benefit: All environmental scientists who want to explore data science and AI methods for their research, and methodologists who seek to collaborate with environmental scientists.

Coordinator: Beth Uberseder (Research Manager, MIDAS)

Analytics Pillar: Transforming Health Interventions

Health intervention research and implementation is one of the biggest users, as well as inspirations, of cutting-edge data science and AI methods.  MIDAS collaborates with campus partners to enable the adoption of cutting-edge analytics and modeling of complex data to boost U-M’s biomedical and healthcare research.

Sequential Decision Making for Health Intervention

Overview: MIDAS enables the application of Sequential Decision Making (SDM) to health intervention and other related research through a methodology working group and training for researchers.

The working group aims to develop research grants to both advance cutting-edge SDM methodologies beyond micro-randomized trials / just-in-time adaptive interventions, and to enable the wide adoption of such methodologies. Through a series of meetings, the working group is identifying a) current and developing SDM methodologies that could be enhanced / extended by U-M researchers, b) identifying healthcare challenges that could be mitigated with the application of these methods, and c) identifying challenges in non-health care domain that could benefit from the same methods (such as industrial engineering). .

A workshop, planned for Summer 2023, will focus on methods for optimizing adaptive interventions.  This workshop will produce researchers with a strong grasp of the core components of adaptive interventions and available experimental tools for optimizing them, and produce and disseminate via a durable website a set of recommendations on how to teach / interpret the curriculum on optimizing adaptive interventions.

Who Will Benefit: Both SDM methodologists and researchers who may adopt these methods in their research.

Coordinator: Beth Uberseder (Research Manager, MIDAS)

NSF IUCRC : Center for Data-Driven Drug Development and Treatment Assessment (DATA)

Overview: MIDAS researchers and their U-M collaborators and industry and healthcare partners have established an NSF-funded Center for Data-Driven Drug Development and Treatment Assessment (DATA) for pre-competitive research in drug design, drug treatment assessment, drug repositioning, patient phenotyping, and quantitative pharmacovigilance using novel machine learning (ML) and Artificial Intelligence (AI) techniques.  The external partners include healthcare systems, non-profits, technology companies and pharmaceutical manufacturers, ranging in size from startups to multinational organizations.

Who Will Benefit: The Center will be a hub to identify and address the complex and time-consuming nature of drug design and treatment assessment, creating solutions that not only reduce the cost associated with the drug design process, but also those that assess, monitor, and optimize treatment and overall patient health.

Coordinator: Kayvan Najarian (PI, DATA; Associate Director, MIDAS; Professor, Computational Medicine and Bioinformatics) and Ivana Tullett (Managing Director, DATA)

Bootcamp on Introductory Data Science for Biomedical Scientists

Overview: The annual bootcamp is open to all U-M researchers and trainees, as well as biomedical scientists from the public and private sectors. We especially welcome faculty members who want to grasp the basic concepts and methods of data science, so that they can start building a data science component in their research, and work more effectively with their data science collaborators. Furthermore, we help like-minded researchers get to know each other through this bootcamp and develop collaboration. Participants will learn how to a) utilize supervised and unsupervised machine learning, including deep learning, for clinical applications; b) determine which data science/artificial intelligence techniques are appropriate for a given clinical application; c) implement these methodologies in Python code, and d) showcase success stories to further exhibit opportunities and approaches to enhance medical research with data science. After the bootcamp, we will offer follow-up sessions that focus on developing data science components for grant proposals, focusing on K grants. 

Who Will Benefit: All biomedical scientists who are interested in learning about incorporating data science into their research, but the content is geared towards junior faculty members and those from the public and private sectors. 

Coordinator: Kayvan Najarian (PI, DATA; Associate Director, MIDAS; Professor, Computational Medicine and Bioinformatics) and Beth Uberseder (Research Manager, MIDAS)

Wearables Data Platform

Overview: Data from wearable devices are increasingly central to research on health and human behavior, but a lack of standards in data collection and processing will hinder data sharing and the effectiveness of such research. A number of campus organizations, including the Biosciences Initiative, the Exercise and Sports Science Initiative and MIDAS, organized a Wearables Summit in 2021 for U-M researchers, and the key takeaway from the Summit was the urgent need of an effective process for data sharing and workflow management. MIDAS is one of the collaborators in an effort organized by multiple research units and researchers across campus to build a platform to house wearables data at scale and to develop best practices for data collection and sharing. 

Who Will Benefit: This potential offering to the U-M community can facilitate collaboration and data sharing among researchers who use internal and external wearables data collected in various studies.

Coordinator: Sean Meyer (Senior Scientist, MIDAS)