How quickly does coronavirus spread? MIDAS Fellow, Qianying Lin, works to answer the question

By | News, Research

Read more here:

To see Qianying’s presentation, “COVID-19 Outbreak in Wuhan, China: in Retrospect and in Prospect” please click here.  A captioned version will be available soon.

Abstract: Since first confirmation in December 2019, the novel coronavirus diseases (COVID-19) infected more than 50,000 people and claimed over 2000 lives in Wuhan, China. It was transmitted across the whole country shortly, and now swept the world by causing more 20,000 infections in countries other than China. Using official reported cases and assuming changing reporting ratio, we investigated the early stage of the epidemic of COVID-19 in Wuhan and analysed its transmissibility. We then built up a conceptual model and incorporated the zoonotic introduction, emigration, individual reaction, and governmental action to simulate the trends of the outbreak in Wuhan and predicted the disease would be completely controlled by the end of April under current policies. These studies provide insights into not only the characteristics of COVID-19 itself, but the impact of governmental actions.


MIDAS Fellow, Arya Farahi, publishes manuscript on how the age of Dark Matter halos govern their content.

By | News, Research
Dark Matter halos are the most massive gravitationally bound objects in our Universe. These halos host the majority of baryonic matter in the Universe in the form of hot gas and cold stellar phase. Determining how baryons are partitioned into these phases is challenging and requires detailed modeling of galaxy formation and their assembly history. By employing a suite of cosmological simulations, Farahi et al. show that formation time of the same mass halos is strongly correlated with their gas and stellar content. This implies that formation time is one of the key factors that governs the content and form of the Baryons within the dark matter halos. Incorporating this information has the potential to improve our understanding of the fundamental physics of our Universe using galaxy cluster abundance and allows us to gain insight into the matter evolution within these systems.
Read more on the publication here:

MDST Mentors

By | News
The Michigan Data Science Team, sponsored by MIDAS and comprised of students of all levels from various schools/colleges at UM, is looking for additional faculty or post-doc mentors that could help guide student projects.  If you are interested in providing domain-specific expertise or contributing to any of the following topics, please contact MIDAS Education Program Manager, Trisha Fountain (, and she will connect you with the appropriate members of the team.
Student projects for this semester include:
  1. Education Deserts: Education deserts are geographic areas removed from post-secondary educational institutions. The presence of these institutions have a pretty big impact not only on educational access of people in their vicinity, but also on local economies and demographics. Take U of M and Ann Arbor as one outstanding example of this type of relationship. We would like to examine what features about these educational institutions have what type of impact on local socioeconomic factors.
  2. Oscar Winners: How can we predict which movies will win the 2020 Academy Awards? Features students are currently considering include IMDB reviews, ratings, and potentially even Twitter responses.
  3. Music Generation: This team is working on generating music (MIDI files) using deep learning with a transformer model.
  4. r/rateme analysis: rateme is a subreddit where people post pictures of themselves and ask to be rated on appearance. We’re more interested in: What are the demographic distributions (age/gender) of posters and commenters? How do these differ, and how do they interact? How predictive are age/gender in predicting ratings? How does the rating-seeking language affect the ratings on a post (i.e. if you display less confidence in posting, are people less likely to rate you harshly?)
  5. Congestion Pricing: Some large cities have implemented congestion pricing policies in which they charge a price for vehicles which enter the city center during peak traffic hours. The idea is that this will incentivize public transportation usage and decrease traffic during rush hours. Students are looking at London traffic data to see how effective this policy has been (London is one of the cities with this type of policy).
  6. Blood Pressure Estimation: We are working with Dr. Byrd from the medical school on this project, so mentors are less necessary, but I figured I’d include this just to be comprehensive. Blood pressure tends to be in flux, so a single sample is less informative than an average over the course of a day. We’ll be looking at clinical trial data and data from the UM hospital clinical warehouse to see if lab results (such as complete blood count) can be used as a good predictor of average blood pressure.

MIDAS Director, H.V. Jagadish, and affiliated faculty Levenstein and Hampshire, awarded NSF grant for data equity

By | News, Research

View video on data ethics.  


U-M receives $2M NSF grant to explore data equity systems

By Alex Piazza

Data science is an important tool that can help researchers tackle important societal challenges ranging from mobility and health to public safety and education.

But data science techniques and technologies also pose enormous potential for harm by reinforcing inequity and leaking private information. As a result, many sensitive datasets are restricted from research use, impeding progress in areas that impact society.

The University of Michigan, with a $2 million grant from the National Science Foundation (NSF), plans to establish a framework for a national institute that would enable research using sensitive data, while preventing misuse and misinterpretation.

“Data science has proven time and time again to be an invaluable resource when addressing emerging challenges and opportunities in areas of broad potential impact,” said H.V. Jagadish, director of the Michigan Institute for Data Science. “But having access to information comes with a great deal of responsibility, so our first priority is to ensure data science is not misused to disproportionately harm underrepresented groups.”

U-M researchers will partner with colleagues at New York University and the University of Washington over the next two years to deploy new techniques and technologies that enable responsible data science, while establishing an interdisciplinary community focused on the study, design, deployment and assessment of equitable data systems.

Equity is an important facet of data science that NSF aims to strengthen in the coming years, as the federal agency partners with universities such as U-M to enable new modes of data-driven discovery that will transform the frontiers of science and engineering.

The centerpiece of its ongoing effort, called Harnessing the Data Revolution at NSF, is the development of national institutes that address multidisciplinary problems in big data. U-M will help lay the groundwork for developing these institutes, which will eventually serve as a point of convergence for researchers from multiple disciplines to share expertise and address pressing challenges in data science.

“Information is being gathered about all of us, from our Google searches and online purchases to property tax records and social media activity,” said Margaret Levenstein, director of the Inter-university Consortium for Political and Social Research at U-M, which maintains the world’s oldest and largest archive of research and instructional data for the social and behavioral sciences. “You would assume the usage of data to be accurate and fair, but that is not always the case. That is why building a framework is so important because, in order for us to harness the enormous potential of big data, we need to ensure equity and privacy.”

H.V. Jagadish (U-M) is the principal investigator on this grant. Robert Hampshire (U-M), Bill Howe (UW), Margaret Levenstein (U-M) and Julia Stoyanovich (NYU) are co-principal investigators.


MIDAS core faculty, Dr. Robert Hampshire, leads a team of MIDAS faculty to receive NSF Convergence Accelerator grant

By | News, Research

Dr. Robert Hampshire, MIDAS core faculty and Associate Professor of Public Policy at the Ford School, and his team, receives nearly $1 million in funding from the National Science Foundation’s Convergence Accelerator.  The team leaders also include MIDAS faculty members Carol Flannagan, H.V. Jagadish and Margaret Levenstein.  Read more at

MIDAS affiliated faculty, Dr. Mike Cafarella, receives funding from NSF’s Convergence Accelerator in Harnessing the Data Revolution

By | News, Research

MIDAS affiliated faculty and Associate Professor in Computer Science and Engineering, Dr. Mike Cafarella, receives funding from the National Science Foundation, in its program of Convergence Accelerator in Harnessing the Data Revolution.  This project, “Simultaneous Knowledge Network Programming and Extraction”, is a direct result of his team’s project funded by MIDAS.  Read more at

Michigan Institute for Data Science Announces the First Cohort of Michigan Data Science Fellows

By | News, Research

Seven outstanding young data scientists from the US, Asia and Europe will join the Michigan Institute for Data Science (MIDAS) at the University of Michigan (U-M), as the inaugural cohort of the Michigan Data Science Fellows program.  They will work at the boundaries of data science methods and domain sciences in an intellectually vibrant environment and develop collaborative relationships with the U-M data science community. The Fellows and their data science application areas are:

  • Arya Farahi, coming from Carnegie Mellon University: Cosmology and its intersection with fundamental physics.
  • Qianying (Ruby) Lin, coming from Hong Kong Polytechnic University: Epidemic inferences and trends. 
  • Patrick Park, currently at U-M: Structure and evolution of large-scale human social networks.
  • Elyas Sabeti, currently at U-M: Theory and algorithms for the analysis of medical Big Data.
  • Maria Veiga, coming from the University of Zurich: Developing techniques for multi-scale modeling.
  • Edgar Vivanco (joint postdoctoral fellow with the National Center for Institutional Diversity), coming from Stanford University: Utilizing machine learning to examine how colonial-era institutions and contemporary criminal violence shape economic under-performance
  • Blair Winograd, currently at U-M: working with M-Write to combine conceptual writing prompts, automated peer review, natural language processing, and automated personalized feedback to create an infrastructure for writing at scale.

The two-year Fellows Program accepts recent PhDs who are stars in their respective fields and whose work is in data science.  They are expected to be more independent than the average postdoctoral researchers at the same career juncture; however, each Fellow also has two faculty sponsors, one a methodology expert, and the other an expert in an application domain, to ensure scientific and career guidance.  

The Fellows program is a new component of MIDAS’ effort to catalyze the transformative use of Data Science in a wide range of disciplines to achieve lasting societal impact, through research, education, outreach and partnership. “This is the first postdoctoral training program at U-M, and one of the few in the nation, with data science as the explicit focus,” says Dr. H.V. Jagadish, MIDAS Director and Professor of Computer Science and Engineering, “and we hope this program will foster the next generation of data science leaders with both a strong scientific vision and a commitment to using data science for positive societal impact.”

One of the Fellows, Elyas Sabeti, expressed great enthusiasm: “This is such a unique opportunity.  It’s amazing that I will be working side by side with people who study Physics, Education, Political Science…  I can’t wait to find out how many great ideas we can come up with together.” 

For more information on the Fellows, please click here.