My lab has two main areas of focus: molecular characteristics of head and neck cancer, and the intersection of regulatory genomics and pathway analysis. With head and neck cancer, we study tumor subtypes and biomarkers of prognosis, treatment response, and recurrence. We perform integrative omics analyses, dimension reduction methods, and prediction techniques, with the ultimate goal of identifying patient subsets who would benefit from either an additional targeted treatment or de-escalated treatment to increase quality of life. For regulatory genomics and pathway analysis, we develop statistical tests taking into account important covariates and other variables for weighting observations.
Dr. Douville is a critical care anesthesiologist with an investigative background in bioinformatics and perioperative outcomes research. He studies techniques for utilizing health care data, including genotype, to deliver personalized medicine in the perioperative period and intensive care unit. His research background has focused on ways technology can assist health care delivery to improve patient outcomes. This began designing microfluidic chips capable of recreating fluid mechanics of atelectatic alveoli and monitoring the resulting barrier breakdown real-time. His interest in bioinformatics was sparked when he observed how methodology designed for tissue engineering could be modified to the nano-scale to enable genomic analysis. Additionally, his engineering training provided the framework to apply data-driven modeling techniques, such as finite element analysis, to complex biological systems.
My research is focused on a wide range of topics from computational social sciences to bioinformatics where I do pattern recognition, perform data analysis, and build prediction models. At the core of my effort, there lie machine learning methods by which I have been trying to address problems related to social networks, opinion mining, biomarker discovery, pharmacovigilance, drug repositioning, security analytics, genomics, food contamination, and concussion recovery. I’m particularly interested in and eager to collaborate on cyber security aspect of social media analytics that includes but not limited to misinformation, bots, and fake news. In addition, I’m still pursuing opportunities in bioinformatics, especially about next generation sequencing analysis that can be also leveraged for phenotype predictions by using machine learning methods.
A typical pipeline for developing and evaluating a prediction models to identify malicious Android mobile apps in the market
Dr. Jin Lu is an Assistant Professor of Computer and Information Science at the University of Michigan, Dearborn.
His major research interests include machine learning, data mining, optimization, matrix analysis, biomedical informatics, and health informatics. Two main directions are being pursued:
(1) Large-scale machine learning problems with data heterogeneity. Data heterogeneity is common across many high-impact application domains, ranging from recommendation system to Computer Vision, Bioinformatics and Health-informatics. Such heterogeneity can be present in a variety of forms, including (a) sample heterogeneity, where multiple resources of data samples are available as side information; (b) task heterogeneity, where multiple related learning tasks can be jointly learned to improve the overall performance; (c) view heterogeneity, where complementary information is available from various sources. My research interests focus on building efficient machine learning methods from such data heterogeneity, aiming to improve the learning model by making the best use of all data resources.
(2) Machine learning methods with provable guarantees. Machine learning has been substantially developed and has demonstrated great success in various domains. Despite its practical success, many of the applications involve solving NP-hard problems based on heuristics. It is challenging to analyze whether a heuristic scheme has any theoretical guarantee. My research interest is to employ granular data structure, e.g. sample clusters or features describing an aspect of the sample, to design new theoretically-sound models and algorithms for machine learning problems.