Michigan Center for Single-Cell Genomic Data Analytics

Research Overview

Single-cell genomics, rooted in single-cell sequencing, has great potential for providing insight into fundamental questions in biomedical science and drive new health science discoveries, such as: How many cell types and functional states are there in a given tissue?  What is the range of natural variation within a cell type and how is such variability affected by genetic and environmental factors?  What happens at the single-cell level during cell fate determination in the developmental process?  How does cellular heterogeneity within a tumor affect response to therapy and how can we address this with precision medicine?  The list is endless.  However, the explosive growth of single-cell sequencing technologies also brings new computational challenges.  One major challenge is the “sparse read counts data”: because of the minuscule amount of genetic material in a single cell, fragments of the genome are often missing from the sequencing read-out, and existing tools are ineffective in addressing this missing-data problem and piecing together reliable genomic information.

The research team will establish a Michigan Center for Single-Cell Genomic Data Analytics, and connect mathematicians and data scientists with biological researchers to develop, evaluate, and implement a variety of cutting-edge methodologies in sparse data analysis.  These methodologies will address issues in data normalization, batch effect detection and correction, marker selection, classification, rare class identification, differential expression, network and phylogenetic inference, develop tools for cyclic or time-series data, and enable information integration across data types.  The team will apply these methodologies to four biological questions to test their utility: 1) Intra-tumor heterogeneity, cancer stem cells in metastasis and treatment resistance, cancer genome evolution; 2) spermatogenesis as a model for cell fate determination during development; 3) transcriptional complexity and gene regulation at the single-cell level; 4) molecular changes at the single-cell level as a result of environmental exposures and windows of susceptibility.

The outcome from this research project will have much broader impact on biomedical research beyond the four research areas that will be used as test cases.  Sparse data analytics also has wide application beyond health sciences.  For example, Electronic Health Records are inherently sparse, as are consumer data (purchasing, rating, or video viewing habits), location and usage data of mobile devices, connectivity in social networks, medical imaging or land imaging by satellites.  In short, this line of research is conceptually connected with many areas of active research in data science and will produce general-purpose tools for many research areas.

Research Impact

Research Team

jun li

Jun Li
Co-Principal Investigator, Human Genetics and Computational Medicine and Bioinformatics

ann gilbert

Anna Gilbert
Co-Principal Investigator, Mathematics

ann gilbert

Laura Balzano
Electrical Engineering and Computer Science

Justin Colacino
Environmental Health Sciences and Nutritional Sciences

ann gilbert

Yuanfang Guan
Computational Medicine and Bioinformatics

Sue Hammoud
Human Genetics, Obstetrics and Gynecology, and Urology

ann gilbert

Gil Omenn
Computational Medicine and Bioinformatics, Human Genetics and School of Public Health

ann gilbert

Clay Scott
Electrical Engineering and Computer

Max Wicha
Internal Medicine

ann gilbert

Xiang Zhou

2019 Schedule of Events- All meetings are in Weiser Hall 619, 2-3:30 pm, unless indicted otherwise

May 24, 2019 Hongjiu Isoform imputation for single-cell RNAseq data using Seekmer
May 31, 2019

Qianhui Huang, Evaluation of Computational Methods to Deconvolute Cell Types in Single-Cell Transcriptomics data

June 7, 2019 Justin, High-content imaging of single cells
June 14, 2019 Lulu (Xiang’s group), Leveraging Gene Co-expression Pattern to Infer Trait-Relevant Tissues in Genome-wide Association Studies
June 21, 2019 Daniel (Yang Chen’s group), Probabilistic Single-Cell Data Integration
June 28, 2019 Alex (Anna’s group), Comparison of marker selection methods for high throughput scRNA-seq data
July 5, 2019 Yutong Wang, Integration of spatial and dissociated single-cell data for estimating anatomical information
July 19, 2019 Tasha (Justin’s group), Characterizing differences in normal breast stem cell biology between African American and European American women using single-cell analyses
July 26, 2019 Umang (Anna’s group), A Paucity of Data in Machine Learning: Applications in Single-Cell RNA Sequencing and Ranking
August 2, 2019 Mark Robinson seminar at 2 pm, Forum Hall, Statistical methods for flexible differential analysis of cross-sample single-cell RNA-seq datasets
September 6, 2019 Xianing Zheng, journal blub (Bonnie Berger papers)
September 20, 2019 Adrienne Shami “Comparative analysis of human, macaque, and mouse testes reveals conserved and divergent features of mammalian spermatogenesis at single cell resolution”

and Jun Li “Single-cell spatial analysis program”

September 27, 2019 Yutong Wang, Journal club (Smita’s publications)
October 4, 2019 Smita Krishnaswamy (Yale), 3 pm, Forum Hall, Individual meetings on Oct 7.
October 10, 2019 Hanchuan Peng, Allen Institute for Brain Science, LSI Seminars, “Industrial-level full neuron morphology screening of whole brains”, noon Forum Hall.  Individual meetings on Oct 11.
October 11, 2019 Jeff Regier (https://regier.stat.lsa.umich.edu), new faculty in Statistics
October 18, 2019 Can meet if someone runs it (Jun away, ASHG 2019)
October 25, 2019 Hengshi Yu (student in Josh Welch lab)
November 1, 2019 Julie Deeke (student in Johann Gagnon-Bartsch lab)
November 8, 2019 Can meet if someone runs it (Jun away to Houston)
November 15, 2019 No meeting, MIDAS Symposium on 11/14-15
November 19 (not a Friday) Gerald Guon (UC Davis), 2 pm, Forum Hall
November 22, 2019 Hojae Lee (student in Josh Welch lab)
November 29, 2019 Day after Thanksgiving, No meeting
December 6, 2019 Nigel Michki (student in Dawen Cai’s group)
December 13, 2019 Hyun Min Kang, faculty in Biostatistics
December 20, 2019 Likely no meeting: too close to end of semester