Our data architecture combines naturally-occurring data from research grant inputs with scientific outputs including publications, citations, dissertations, and patents, as well as with biographic data on researchers scraped from the web and in databases. These data integrate with STAR METRICS administrative data on grant purchases and employment, which can in turn be linked to Longitudinal Employer-Household Dynamics (LEHD) Census data enabling individuals to be traced as they move across employers and start businesses. These data are then linked using cutting edge disambiguation/name-entity resolution, web scraping and entity extraction. This IRIS methodology is advancing the underlying computational sciences and creating more useful data for broader applications.
My research focuses on developing and applying computational and data-enabled methodology in the broader area of sustainability. Main thrusts are as follows:
1. Human mobility dynamics. I am interested in mining large-scale real-world travel trajectory data to understand human mobility dynamics. This involves the processing and analyzing travel trajectory data, characterizing individual mobility patterns, and evaluating environmental impacts of transportation systems/technologies (e.g., electric vehicles, ride-sharing) based on individual mobility dynamics.
2. Global supply chains. Increasingly intensified international trade has created a connected global supply chain network. I am interested in understanding the structure of the global supply chain network and economic/environmental performance of nations.
3. Networked infrastructure systems. Many infrastructure systems (e.g., power grid, water supply infrastructure) are networked systems. I am interested in understanding the basic structural features of these systems and how they relate to the system-level properties (e.g., stability, resilience, sustainability).
My research is broadly concerned with corporate governance and the effects of finance on society. Recent writings examine how ideas about corporate social responsibility have evolved to meet changes in the structures and geographic footprint of multinational corporations; whether “shareholder capitalism” is still a viable model for economic development; how income inequality in an economy is related to corporate size and structure; why theories about organizations do (or do not) progress; how architecture shapes social networks and innovation in organizations; why stock markets spread to some countries and not others; and whether there exist viable organizational alternatives to shareholder-owned corporations in the United States. Recent publications are available at http://webuser.bus.umich.edu/gfdavis/articles.htm.
Jun Li’s main research interests are empirical operations management and business analytics, with special emphases on revenue management, pricing, consumer behavior, economic and social networks. She has worked extensively with large-scale data, including transactions, pricing, inventory and capacity, consumer online search and click stream data, supply chain relationships and disruptions, clinical and healthcare claims. She is the Winner of INFORMS Revenue Management and Pricing Practice Award for her close collaboration with retailing practitioners in implementing best response pricing algorithms. Her paper on airline pricing and consumer behavior is the finalist for Best Management Science Papers in Operations Management 2012 to 2014. She is also the principal investigator of a National Science Foundation funded project: “Gaining Visibility Into Supply Network Risks Using Large-Scale Textual Analysis”. Her work has enjoyed coverage by The Economist, New York Times and Forbes.
My research group is engaged in fundamental research in the following areas: Statistical learning theory: We are developing theory and algorithms for predictions problems (e.g., learning to rank and multilabel learning) with complex label spaces and where the available human supervision is often weak. Sequential prediction in a game theoretic framework: We are trying to understand the power and limitations of sequential predictions algorithms when no probabilistic assumptions are placed on the data generating mechanism. High dimensional and network data analysis: We are developing scalable algorithms with provable performance guarantees for learning from high dimensional and network data. Optimization algorithms: We are creating incremental, distributed and parallel algorithms for machine learning problems arising in today’s data rich world. Reinforcement learning: We are synthesizing concepts and techniques from artificial intelligence, control theory and operations research for pushing the frontier in sequential decision making with a focus on delivering personalized health interventions via mobile devices. My research group is pursuing and continues to actively search for challenging machine learning problems that arise across disciplines including behavioral sciences, computational biology, computational chemistry, learning sciences, and network science.