My research interests lie in design and analysis of randomized controlled trials (RCTs), partial identification, identification and inference with multi-valued treatments and instruments, and quantile regression. In one recent paper I study the optimal stratified randomization procedure in RCTs, and found a certain kind of matched-pair design is optimal. In another paper (coauthored with Joe Romano and Azeem Shaikh), we provide asymptotically exact inference procedure for matched-pair designs. In another paper we study inference with moment inequalities whose dimension grows exponentially fast with the sample size. I also have a paper in which we study the sharp identified sets for various treatment effects with multi-valued instruments and multi-values treatments.
I am interested in how governance, communities, and inequality emerge in sociotechnical systems, and how the structure of sociotechnical systems encodes and reinforces these processes. To those ends, I develop empirical data and computational methods, focusing on latent variable models; statistical inference in networks; empirical design to study governance in organizations, platforms, and computational social systems; and causal inference and measurement in observational data.
Several sample projects:
> developing empirical populations of networks to infer social and ecological processes encoded in networks
> using probabilistic methods to infer the structure and dynamics of the illicit wildlife trade
> building from theory from political science, statistics, and education to disentangle issues of “bias” in computational systems