Explore ARCExplore ARC

Ho-Joon Lee

By |

Dr. Lee’s research in data science concerns biological questions in systems biology and network medicine by developing algorithms and models through a combination of statistical/machine learning, information theory, and network theory applied to multi-dimensional large-scale data. His projects have covered genomics, transcriptomics, proteomics, and metabolomics from yeast to mouse to human for integrative analysis of regulatory networks on multiple molecular levels, which also incorporates large-scale public databases such as GO for functional annotation, PDB for molecular structures, and PubChem and LINCS for drugs or small compounds. He previously carried out proteomics and metabolomics along with a computational derivation of dynamic protein complexes for IL-3 activation and cell cycle in murine pro-B cells (Lee et al., Cell Reports 2017), for which he developed integrative analytical tools using diverse approaches from machine learning and network theory. His ongoing interests in methodology include machine/deep learning and topological Kolmogorov-Sinai entropy-based network theory, which are applied to (1) multi-level dynamic regulatory networks in immune response, cell cycle, and cancer metabolism and (2) mass spectrometry-based omics data analysis.

Figure 1. Proteomics and metabolomics analysis of IL-3 activation and cell cycle (Lee et al., Cell Reports 2017). (A) Multi-omics abundance profiles of proteins, modules/complexes, intracellular metabolites, and extracellular metabolites over one cell cycle (from left to right columns) in response to IL-3 activation. Red for proteins/modules/intracellular metabolites up-regulation or extracellular metabolites release; Green for proteins/modules/intracellular metabolites down-regulation or extracellular metabolites uptake. (B) Functional module network identified from integrative analysis. Red nodes are proteins and white nodes are functional modules. Expression profile plots are shown for literature-validated functional modules. (C) Overall pathway map of IL-3 activation and cell cycle phenotypes. (D) IL-3 activation and cell cycle as a cancer model along with candidate protein and metabolite biomarkers. (E) Protein co-expression scale-free network. (F) Power-low degree distribution of the network E. (G) Protein entropy distribution by topological Kolmogorov-Sinai entropy calculated for the network E.


Emanuel Gull

By |

Professor Gull works in the general area of computational condensed matter physics with a focus on the study of correlated electronic systems in and out of equilibrium. He is an expert on Monte Carlo methods for quantum systems and one of the developers of the diagrammatic ‘continuous-time’ quantum Monte Carlo methods. His recent work includes the study of the Hubbard model using large cluster dynamical mean field methods, the development of vertex function methods for optical (Raman and optical conductivity) probes, and the development of bold line diagrammatic algorithms for quantum impurities out of equilibrium. Professor Gull is involved in the development of open source computer programs for strongly correlated systems.

Quantum impurities are small confined quantum systems coupled to wide leads. An externally applied time-dependent magnetic field induces a change in the population of spins on the impurity, leading to time-dependent switching behavior. The system's equations of motion are determined by a many-body quantum field theory and solved using a diagrammatic Monte Carlo approach. The computations were performed at Columbia University and the University of Michigan.

Quantum impurities are small confined quantum systems coupled to wide leads. An externally applied time-dependent magnetic field induces a change in the population of spins on the impurity, leading to time-dependent switching behavior. The system’s equations of motion are determined by a many-body quantum field theory and solved using a diagrammatic Monte Carlo approach. The computations were performed at Columbia University and the University of Michigan.

Shravan Veerapaneni

By |

Dr. Veerapaneni’s research group develops fast and scalable algorithms for solving differential and integral equations on complex moving geometries. Application areas of current interest include large-scale simulations of blood flow through arbitrary confined geometries, electrohydrodynamics of soft particles and heat flow on time-varying domains.


Jieping Ye

By |

Jieping Ye, PhD, is Associate Professor of Computational Medicine and Bioinformatics in the Medical School at the University of Michigan, Ann Arbor.

The Ye Lab has been conducting fundamental research in machine learning and data mining, developing computational methods for biomedical data analysis, and building informatics software. We have developed novel machine learning algorithms for feature extraction from high-dimensional data, sparse learning, multi-task learning, transfer learning, active learning, multi-label classification, and matrix completion. We have developed the SLEP (Sparse Learning with Efficient Projections) package, which includes implementations of large-scale sparse learning models, and the MALSAR (Multi-tAsk Learning via StructurAl Regularization) package, which includes implementations of state-of-the-art multi-task learning models. SLEP achieves state-of-the-art performance for many sparse learning models, and it has become one of the most popular sparse learning software packages. With close collaboration with researchers at the biomedical field, we have successfully applied these methods for analyzing biomedical data, including clinical image data and genotype data.

Peter X. K. Song

By |

Dr. Song interested in the development and application of theories and methodologies from Data Science to solve scientific problems arising from medical and public health sciences, in particular from the fields of environmental health sciences and nutritional sciences. People from his lab are strongly interested in interdisciplinary research in the areas of statistics, operation research, and machine learning, with the core interest in the statistical foundation of big data analytics, and with target applications in processing and analyzing big data from various applied sciences, including asthma, environmental health sciences, nephrology, and nutritional sciences. His research projects have been funded by NIH, NSF and DARPA funding agencies. Visit Song Lab webpage for detail: http://www.umich.edu/~songlab/

Judy Jin

By |

Judy Jin, PhD, is Professor of Industrial and Operations Engineering in the College of Engineering at the University of Michigan, Ann Arbor.

Prof. Jin’s research focuses on the development of new data fusion methodologies for improving system operation and quality with the emphasis on fusion of data and engineering knowledge collected from disparate sources by integrating multidisciplinary methods. Her research has been widely applied in both manufacturing and service industry by providing techniques for knowledge discovery and risk-informed decision making. Key research issues are being pursued:

  1. Advanced quality control methodologies for system monitoring, diagnosis and control with temporally and spatially dense operational/sensing data.
  2. Multi-scale data transform and high order tensor data analysis for modeling, analysis, classification, and making inferences of multistream sensing signals.
  3. Optimal sensor distribution and hierarchical variable selection methods for system abnormal detection and sensor fusion decisions, which integrates the causal probability network model, statistical change detection, set-covering algorithm, and hierarchical lasso regression.
  4. A unified approach for variation reduction in multistage manufacturing processes (MMPs) using a state space model, which blend the control theory with advanced statistics for MMPs sensing, monitoring, diagnosis and control, integrative design of process tolerance and maintenance policy considering the interaction between product quality and tool reliability.

Data science applications: (a) Smart manufacturing with sensor fusion, process monitoring, diagnosis and control (e.g., metal forming including stamping, forging, casting and rolling), assembly, ultrasonic welding, photovoltaic thin film deposition. (b) Travel time estimation and traffic prediction for intelligent transportation systems. (c) Multi-stream data analysis of human motion/vehicle crash testing data for improving vehicle design and safety. (d) Risk informed decision support for healthcare and clinical decisions. (e) Customer behavior modeling for fraud detection in healthcare and telecommunication. (f) Human decision-making behavior modeling in a dynamic/emergency environment.


Long Nguyen

By |

I am broadly interested in statistical inference, which is informally defined as the process of turning data into prediction and understanding. I like to work with richly structured data, such as those extracted from texts, images and other spatiotemporal signals. In recent years I have gravitated toward a field in statistics known as Bayesian nonparametrics, which provides a fertile and powerful mathematical framework for the development of many computational and statistical modeling ideas. My motivation for all this came originally from an early interest in machine learning, which continues to be a major source of research interest. A primary focus of my group’s research in machine learning to develop more effective inference algorithms using stochastic, variational and geometric viewpoints.