Samuel K Handelman, Ph.D., is Research Assistant Professor in the department of Internal Medicine, Gastroenterology, of Michigan Medicine at the University of Michigan, Ann Arbor. Prof. Handelman is focused on multi-omics approaches to drive precision/personalized-therapy and to predict population-level differences in the effectiveness of interventions. He tends to favor regression-style and hierarchical-clustering approaches, partially because he has a background in both statistics and in cladistics. His scientific monomania is for compensatory mechanisms and trade-offs in evolution, but he has a principled reason to focus on translational medicine: real understanding of these mechanisms goes all the way into the clinic. Anything less that clinical translation indicates that we don’t understand what drove the genetics of human populations.
Jinseok Kim, Ph.D., is Research Assistant Professor in the Institute for Social Research at the University of Michigan, Ann Arbor. Prof. Kim works on resolving named entity ambiguity in large-scale scholarly data (publication, patent, and funding records) in digital libraries. Especially, his current research is focused on developing methods for disambiguating author and affiliation names at a digital library scale using various supervised machine learning approaches trained on automatically labeled data . Disambiguated data from multiple sources will be integrated to be analyzed for insights into research production, scientific collaboration, funding evaluation, and research policy at a national level.
Michael Cafarella, PhD, is Associate Professor of Electrical Engineering and Computer Science, College of Engineering and Faculty Associate, Survey Research Center, Institute for Social Research, at the University of Michigan, Ann Arbor.
Prof. Cafarella’s research focuses on data management problems that arise from extreme diversity in large data collections. Big data is not just big in terms of bytes, but also type (e.g., a single hard disk likely contains relations, text, images, and spreadsheets) and structure (e.g., a large corpus of relational databases may have millions of unique schemas). As a result, certain long-held assumptions — e.g., that the database schema is always known before writing a query — are no longer useful guides for building data management systems. As a result, my work focuses heavily on information extraction and data mining methods that can either improve the quality of existing information or work in spite of lower-quality information.