Back to Working Groups

Social Research with Unstructured Data

MIDAS Working Group


Social science increasingly uses data that requires advanced tools and algorithms. Preparing text, audio, and video data for analysis and running those analyses involves linking contemporary computational tools with datasets and major research challenges in ways that cut across disciplines. Building bridges between data scientists who develop the tools to analyze these data and social scientists who have datasets to address research questions in unprecedented ways that could benefit from them is critical for both opening the world of these new data and refining the tools for analysis.

The Social Research with Unstructured Data working group facilitates collaborations between NLP methodology experts and domain researchers who pose significant research questions that could benefit from NLP. One of the primary working group activities is organizing a regular connection series, with presentations on works in progress, funding opportunities, and new data sources. This ongoing series, jointly sponsored by MIDAS, the AI Lab and ISR, began in Fall 2022.

The working group also organized an NLP workshop series in Fall 2022. Past workshop series information is available here.

Working Group Members

Danai Koutra – Associate Professor, Electrical Engineering and Computer Science

Josh Pasek – Associate Professor, Communications and Media, Political Science

Elyse Thulin – Michigan Data Science Fellow

Julia Lippman – Senior Research Specialist, Center for Political Studies

Beth Uberseder – Research Manager, MIDAS

Social Research with Unstructured Data: Connection Series

The Social Research with Unstructured Data Connection Series is designed to build connections between social scientists and data scientists to improve scholarship in both arenas. The series connects faculty, research scientists, postdoctoral, and graduate students to help build major research projects from their vision, find the right methods for their data, identify collaborators, or find scholars who can help trial run the tools they have developed.

For more information, or if you are interested in presenting work in progress at this series, please fill out this form, or contact the MIDAS Research Manager, Beth Uberseder ( For the schedule of upcoming events as well as past speakers for this series, see below.


Upcoming Events

During the Winter 2023 Term, the Social Research with Unstructured Data: Connection Series events were held from 2:00-3:30 PM in ISR 6050 (426 Thompson Street).

Fall 2023 dates and speakers – TBA

Past Events and Speakers

May 5, 2023
Z. Tuba Suzer-Gurtekin, Assistant Research Scientist of the Survey Research Center and Institute for Social Research, presenting “Explaining Consumer Expectations using Big Data”.
David Jurgens, Assistant Professor of Information & Assistant Professor of Electrical Engineering and Computer Science, discussing “Recognizing Empathy in Conversation”.
April 18, 2023
Stella Yu, Professor of Electrical Engineering and Computer Science, discussing “Unsupervised Data-Driven Learning of Visual Hierarchy”.
Sara Lafia, Research Investigator, David Bleckley & Trent Alexander, Population Studies Center and Research Professor, on “Digitizing and parsing semi-structured documents from the GI Bill mortgage program”.
March 31, 2023
Dallas Card, Assistant Professor of Information, on “Measuring Language Ideology in Text Data Selection for Large Language Models”.
February 3, 2023
Walter Mebane, Professor of Political Science and Professor of Statistics, discussing “A Twitter Election Observatory Using Decahose Data”.
Viktoryia Kalesnikava and Aparna Ananthasubramaniam presenting on NLP methods focused on understanding circumstances around suicide deaths that occurred in 2020.
January 20, 2023
No presenters for this week. Elevator pitches about research, networking, and a brainstorming event to kick off the term.
December 9, 2022
Elle O’Brien, Lecturer & Research Investigator at the School of Information, discussing “Barriers to Adopting Data Science Methods for Research”.
Brady West, Research Professor in the Survey Research Center, on “Adjusting Estimated Regression Coefficients for Mismatch Error when Linking Twitter and Survey Data”.
November 11, 2022
Fred Conrad, Professor of Psychology and Director of the Survey Research Center, discussing recent work using stance detection to uncover hidden alignment between social media posts and survey responses.
Vinod Vydiswaran, Associate Professor of Learning Health Sciences and the School of Information, presenting “NLP works hand-in-hand with Qualitative Research — Solutions and Pitfalls”.