My research interests are in natural language semantics and psycholinguistics, focusing on verbs. I conduct behavioral psycholinguistic experiments with methodologies such as self-paced reading and maze tasks, as well as surveys of linguistic and semantic judgments. I also study semantic variation using corpora and datasets such as the Twitter Decahose, to better understand how words have developed diverging meanings in different communities, age groups, or regions. I use primarily R and Python to collect, manage, and analyze data. I direct the UM WordLab in the linguistics department, working with students (especially undergraduates) on experimental and computational research focusing on lexical representations.
Study of Pandemic Publishing: How Scholarly Literature is Affected by COVID-19 Pandemic
This project addresses the quality of recently published COVID-19 publications. With the COVID-19 pandemic, researchers publish a lot their research as preprints. And while preprints are an important development in scholarly publishing, they are works in progress that need further refinement to become a more rigorous final product. Scholarly publishers are also taking initiatives to accelerate publication process, for example, by asking reviewers to curtail requests for additional experiments upon revisions. Sacrificing rigor for haste inevitably increases the likelihood of article correction and retraction, leading to spread of false information within supposedly trustworthy sources that have a peer-reviewing process in place to ensure proper verification. I study the quality of COVID-19 related scholarly works by using CADRE’s datasets to identify signs of incoherency, irreproducibility, and haste.
We have developed and tested machine learning approaches to integrate quantitative markers for diagnosis and assessment of progression of TMJ OA, as well as extended the capabilities of 3D Slicer4 into web-based tools and disseminated open source image analysis tools. Our aims use data processing and in-depth analytics combined with learning using privileged information, integrated feature selection, and testing the performance of longitudinal risk predictors. Our long term goals are to improve diagnosis and risk prediction of TemporoMandibular Osteoarthritis in future multicenter studies.
The Spectrum of Data Science for Diagnosis of Osteoarthritis of the Temporomandibular Joint
Greg’s research primarily investigates information flow in financial markets and the actions of agents in those markets – both consumers and producers of that information. His approach draws on theory from the social sciences (economics, psychology and sociology) combined with large data sets from diverse sources and a variety of data science approaches. Most projects combine data from across multiple sources, including commercial data bases, experimentally created data and extracting data from sources designed for other uses (commercial media, web scrapping, cellphone data etc.). In addition to a wide range of econometric and statistical methods, his work has included applying machine learning , textual analysis, mining social media, processes for missing data and combining mixed media.
The long temporal and large spatial scales of ecological systems make controlled experimentation difficult and the amassing of informative data challenging and expensive. The resulting sparsity and noise are major impediments to scientific progress in ecology, which therefore depends on efficient use of data. In this context, it has in recent years been recognized that the onetime playthings of theoretical ecologists, mathematical models of ecological processes, are no longer exclusively the stuff of thought experiments, but have great utility in the context of causal inference. Specifically, because they embody scientific questions about ecological processes in sharpest form—making precise, quantitative, testable predictions—the rigorous confrontation of process-based models with data accelerates the development of ecological understanding. This is the central premise of my research program and the common thread of the work that goes on in my laboratory.
Current research includes a project funded by Toyota that uses Markov Models and Machine Learning to predict heart arrhythmia, an NSF-funded project to detect Acute Respiratory Distress Syndrome (ARDS) from x-ray images and projects using tensor analysis on health care data (funded by the Department of Defense and National Science Foundation).
Dr. Lee’s research in data science concerns biological questions in systems biology and network medicine by developing algorithms and models through a combination of statistical/machine learning, information theory, and network theory applied to multi-dimensional large-scale data. His projects have covered genomics, transcriptomics, proteomics, and metabolomics from yeast to mouse to human for integrative analysis of regulatory networks on multiple molecular levels, which also incorporates large-scale public databases such as GO for functional annotation, PDB for molecular structures, and PubChem and LINCS for drugs or small compounds. He previously carried out proteomics and metabolomics along with a computational derivation of dynamic protein complexes for IL-3 activation and cell cycle in murine pro-B cells (Lee et al., Cell Reports 2017), for which he developed integrative analytical tools using diverse approaches from machine learning and network theory. His ongoing interests in methodology include machine/deep learning and topological Kolmogorov-Sinai entropy-based network theory, which are applied to (1) multi-level dynamic regulatory networks in immune response, cell cycle, and cancer metabolism and (2) mass spectrometry-based omics data analysis.
Somangshu (Sam) Mukherji, PhD, is Assistant Professor of Music Theory in the School of Music, Theatre & Dance at the University of Michigan, Ann Arbor.
Sam Mukherji‘s work lies at the interface of traditional Western tonal theory, the theory and practice of popular and non-Western idioms, and the cognitive science of music. Within this framework, the main focus of his research has been on the prolongational, grammatical aspects of Western tonality, and their connection to the tonal structures of Indian music, and the blues-based traditions within rock and metal. This emphasis makes his work similar to that of a linguist who explores relationships between the world’s languages-and, therefore, Mukherji’s research has been influenced in particular by ideas from linguistic theory as well, especially the Minimalist Program in contemporary generative linguistics. For this reason, he has investigated connections not only between different musical idioms but also between music and language-and musical and linguistic theory-more generally. Much of his work explores overlaps between Minimalist linguistics, and related, generative approaches within music theory (such as those found in the writings of Heinrich Schenker), and he has also written extensively about what such ‘musicolinguistic’ connections imply for the wider study of human musical behavior, cognition, and evolution.
Lu Wei, DSc, is Assistant Professor in the Department of Electrical and Computer Engineering at the University of Michigan, Dearborn.
Prof. Wei studies the analytical properties of interacting particle systems relevant to both classical and quantum information theory.