Marie O’Neill

My research interests include health effects of air pollution, temperature extremes and climate change (mortality, asthma, hospital admissions, birth outcomes and cardiovascular endpoints); environmental exposure assessment; and socio-economic influences on health.
Data science tools and methodologies include geographic information systems and spatio-temporal analysis, epidemiologic study design and data management.

Ken Kollman

I have been involved in the building of data infrastructure in the study of elections, political systems, violence, geospatial units, demographics, and topography. This infrastructure will eventually lead to the integration of data across many domains in the social, health, population, and behavioral sciences. My core research interests are in elections and political organizations.

Sara Lafia

I am a Research Fellow in the Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan. My research is currently supported by a NSF project, Developing Evidence-based Data Sharing and Archiving Policies, where I am analyzing curation activities, automatically detecting data citations, and contributing to metrics for tracking the impact of data reuse. I hold a Ph.D. in Geography from UC Santa Barbara and I have expertise in GIScience, spatial information science, and urban planning. My interests also include the Semantic Web, innovative GIS education, and the science of science. I have experience deploying geospatial applications, designing linked data models, and developing visualizations to support data discovery.

Mihaela (Miki) Banu

In the area of multi-scale modeling of manufacturing processes: (a) Models for understanding the mechanisms of forming and joining of lightweight materials. This new understanding enables the development of advanced processes which remove limitations of current state-of-the-art capabilities that exhibit limited formability of high strength lightweight alloys, and limited reproducibility of joining quality; (b) Innovative multi-scale finite element models for ultrasonic welding of battery tabs (resulting in models adopted by GM for designing and manufacturing batteries for the Chevy Volt), and multi-scale models for ultrasonic welding of short carbon fiber composites (resulting in models adopted by GM for designing and manufacturing assemblies made of carbon fiber composites with metallic parts); (c) Data-driven algorithms of prediction geometrical and microstructural integrity of the incremental formed parts. Machine learning is used for developing fast and robust methods to be integrated into the designing process and replace finite element simulations.

Anthony Vanky

Anthony Vanky develops and applies data science and computational methods to design, plan, evaluate cities, emphasizing their applications to urban planning and design. Broadly, his work focuses on the domains of transportation and human mobility; social behaviors and urban space; policy evaluation; quantitative social sciences; and the evaluation of urban form. Through this work, he has extensively collaborated with public and private partners. In addition, he considers creative approaches toward data visualization, public engagement and advocacy, and research methods.


Anthony Vanky’s Cityways project analyzed 2.2 million trips from 135,000 people over one year to understand the factors that influence outdoor pedestrian path choice. Factors considered included weather, urban morphology, businesses, topography, traffic, the presence of green spaces, among others.

Robert Manduca

Professor Manduca’s research focuses on urban and regional economic development, asking why some cities and regions prosper while others decline, how federal policy influences urban fortunes, and how neighborhood social and economic conditions shape life outcomes. He studies these topics using computer simulations, spatial clustering methods, network analysis, and data visualization.

In other work he explores the consequences of rising income inequality for various aspects of life in the United States, using descriptive methods and simulations applied to Census microdata. This research has shown how rising inequality has lead directly to lower rates of upward mobility and increases in the racial income gap.

9.9.2020 MIDAS Faculty Research Pitch Video.

Screenshot from “Where Are The Jobs?” visualization mapping every job in the United States based on the unemployment insurance records from the Census LODES data.

Joshua Stein

As a board-certified ophthalmologist and glaucoma specialist, I have more than 15 years of clinical experience caring for patients with different types and complexities of glaucoma. In addition to my clinical experience, as a health services researcher, I have developed experience and expertise in several disciplines including performing analyses using large health care claims databases to study utilization and outcomes of patients with ocular diseases, racial and other disparities in eye care, associations between systemic conditions or medication use and ocular diseases. I have learned the nuances of various data sources and ways to maximize our use of these data sources to answer important and timely questions. Leveraging my background in HSR with new skills in bioinformatics and precision medicine, over the past 2-3 years I have been developing and growing the Sight Outcomes Research Collaborative (SOURCE) repository, a powerful tool that researchers can tap into to study patients with ocular diseases. My team and I have spent countless hours devising ways of extracting electronic health record data from Clarity, cleaning and de-identifying the data, and making it linkable to ocular diagnostic test data (OCT, HVF, biometry) and non-clinical data. Now that we have successfully developed such a resource here at Kellogg, I am now collaborating with colleagues at > 2 dozen academic ophthalmology departments across the country to assist them with extracting their data in the same format and sending it to Kellogg so that we can pool the data and make it accessible to researchers at all of the participating centers for research and quality improvement studies. I am also actively exploring ways to integrate data from SOURCE into deep learning and artificial intelligence algorithms, making use of SOURCE data for genotype-phenotype association studies and development of polygenic risk scores for common ocular diseases, capturing patient-reported outcome data for the majority of eye care recipients, enhancing visualization of the data on easy-to-access dashboards to aid in quality improvement initiatives, and making use of the data to enhance quality of care, safety, efficiency of care delivery, and to improve clinical operations. .

Yuri Zhukov

My research focuses on the causes, dynamics and outcomes of conflict, at the international and local levels. My methodological areas of interest include spatial statistics, mathematical/computational modeling and text analysis.

Map/time-series/network plot, showing the flow of information across battles in World War II. Z axis is time, X and Y axes are longitude and latitude, polygons are locations of battles, red lines are network edges linking battles involving the same combatants. Source:

Joseph Eisenberg

Dr. Eisenberg studies infectious disease epidemiology with a focus on waterborne pathogens. His expertise are in the areas of water sanitation and hygiene (WASH), quantitative microbial risk assessment (QMRA) and disease transmission modeling. Dr. Eisenberg has a long-standing research platform in northern coastal Ecuador, examining how changes in the social and natural environments, mediated by road construction, affect the epidemiology of enteric pathogens. Specific studies focus on enteric pathogens, antimicrobial resistance, the microbiome and dengue. He is also The NIGMS consortium, Models of Infectious Disease Agent Study (MIDAS), to examine mechanisms of transmission and potential intervention and control of enteric pathogens through water and sanitation interventions.

Jim Omartian

My research explores the interplay between corporate decisions and employee actions. I currently use anonymized mobile device data to observe individual behaviors, and employ both unsupervised and supervised machine learning techniques.