I am faculty at ICPSR, the largest social science data archive in the world. I manage an education research pre-registration site (sreereg.org) that is focused on transparency and replicability. I also manage a site for sharing work around record linkage, including code (linkagelibrary.org). I am involved in the LIFE-M project (life-m.org), recently classifying the mortality data. That project uses cutting-edge techniques for machine-reading handwritten forms.
“Neighborhood Environments as Socio-Techno-bio Systems: Water Quality, Public Trust, and Health in Mexico City (NESTSMX)” is an NSF-funded multi-year collaborative interdisciplinary project that brings together experts in environmental engineering, anthropology, and environmental health from the University of Michigan and the Instituto Nacional de Salud Pública. The PI is Elizabeth Roberts (anthropology), and the co-PIs are Brisa N. Sánchez (biostatistics), Martha M Téllez-Rojo (public health), Branko Kerkez (environmental engineering), and Krista Rule Wigginton (civil and environmental engineering). Our overarching goal for NESTSMX is to develop methods for understanding neighborhoods as “socio-techno-bio systems” and to understand how these systems relate to people’s trust in (or distrust of) their water. In the process, we will collectively contribute to our respective fields of study while we learn how to merge efforts from different disciplinary backgrounds.
NESTSMX works with families living in Mexico City, that participate in an ongoing longitudinal birth-cohort chemical-exposure study (ELEMENT (Early Life Exposures in Mexico to ENvironmental Toxicants, U-M School of Public Health). Our research involves ethnography and environmental engineering fieldwork which we will combine with biomarker data previously gathered by ELEMENT. Our focus will be on the infrastructures and social structures that move water in and out of neighborhoods, households, and bodies.
My research investigates social inequality and its maintenance across time and generations. Current projects focus on wealth inequality and its consequences for the next generation, the institutional context of social mobility processes and educational inequality in the United States and other industrialized countries. I also help expand the social science data infrastructure and quantitative methods needed to address questions on inequality and mobility. I serve as Principal Investigator of the Wealth and Mobility (WAM) study as well as Co-Investigator of the Panel Study of Income Dynamics (PSID). As such, my research draws on and helps construct nationally representative survey data as well as full-population administrative data. My methodological work has been focused on causal inference, multiple imputation, and measurement error.
Dr. VanEseltine is a sociologist and data scientist working with large-scale administrative data for causal and policy analysis. His interests include studying the effects of scientific infrastructure, training, and initiatives, as well as the development of open, sustainable, and replicable systems for data construction, curation, and dissemination. As part of the Institute for Research on Innovation and Science (IRIS), he contributes to record linkage and data improvements in the research community releases of UMETRICS, a data system built from integrated records on federal award funding and spending from dozens of American universities. Dr. VanEseltine’s recent work includes studying the impacts of COVID-19 on academic research activity.
My research focuses on building infrastructure for public health and health science research organizations to take advantage of cloud computing, strong software engineering practices, and MLOps (machine learning operations). By equipping biomedical research groups with tools that facilitate automation, better documentation, and portable code, we can improve the reproducibility and rigor of science while scaling up the kind of data collection and analysis possible.
Research topics include:
1. Open source software and cloud infrastructure for research,
2. Software development practices and conventions that work for academic units, like labs or research centers, and
3. The organizational factors that encourage best practices in reproducibility, data management, and transparency
The practice of science is a tug of war between competing incentives: the drive to do a lot fast, and the need to generate reproducible work. As data grows in size, code increases in complexity and the number of collaborators and institutions involved goes up, it becomes harder to preserve all the “artifacts” needed to understand and recreate your own work. Technical AND cultural solutions will be needed to keep data-centric research rigorous, shareable, and transparent to the broader scientific community.
My research interests include health effects of air pollution, temperature extremes and climate change (mortality, asthma, hospital admissions, birth outcomes and cardiovascular endpoints); environmental exposure assessment; and socio-economic influences on health.
Data science tools and methodologies include geographic information systems and spatio-temporal analysis, epidemiologic study design and data management.
I have been involved in the building of data infrastructure in the study of elections, political systems, violence, geospatial units, demographics, and topography. This infrastructure will eventually lead to the integration of data across many domains in the social, health, population, and behavioral sciences. My core research interests are in elections and political organizations.
Dr. Andrew Gronewold, P.E., is an Associate Professor with the School for Environment and Sustainability (SEAS) at the University of Michigan. He also holds adjunct faculty appointments in the University of Michigan’s Department of Civil and Environmental Engineering, and the Department of Earth and Environmental Sciences. Dr. Gronewold conducts research through a range of hydrological science projects that explore methods for quantifying and communicating uncertainties arising within long-term hydrological monitoring networks and data, and incorporating those uncertainties into models and risk-based water resources management decisions. Much of his recent research has focused on monitoring, analyzing, and forecasting the long-term water budget and water levels of the Laurentian Great Lakes.
I manage research activities for the College and Beyond II study at ICPSR, including survey development and data infrastructure planning. My research broadly focuses on issues of postsecondary access and success for undergraduate and graduate students and uses quantitative methodologies.