Welcome to the RTG project page
The emerging era of big data has brought with it new unique challenges in both research and training in Statistics. For the new types of statistical problems researchers now aim to solve, the size of available data has grown immensely in many cases, and the nature of the data has changed no less dramatically. Statisticians now work routinely with data that combine many different kinds of observations, from genetic data to brain images to smartphone data. This creates a need for new training approaches and their close integration with current research directions, so that PhD students and postdocs are prepared to take on new challenges as they become independent researchers. It also creates an opportunity for recruiting undergraduates into the field, increasing and diversifying the domestic STEM workforce. This project will train undergraduate and graduate students and postdocs in modern techniques for dynamic big data with complex structures, in modern teaching methods for statistics, and provide mentoring on all aspects of professional development.
This project brings together three interlinked research streams: (1) statistical network analysis, (2) inference for dynamic systems, and (3) sequential decision making. This project will contribute to each of these areas, developing (1) realistic models for network community detection, link prediction and dynamically evolving networks, and tools for utilizing network connections to improve prediction of outcomes of interest on network-linked data; (2) practical algorithms with provably good properties for fitting complex partially observed Markov process models, with an emphasis on scalability; (3) sequential decision making algorithms based on reinforcement learning, with the goal of achieving excellent prediction performance and discovering interpretable decision variables. Each research stream will offer a short intensive graduate course and a regular interdisciplinary student workshop. Equally importantly, the streams will collaborate on topics that cut across these areas, such as inference for dynamically evolving networks or the role of social connections in predicting behavior and their impact on sequential decision making. Training undergraduates, PhD students, and postdocs in topics at the cutting edge of modern statistics will contribute to supplying much-needed statisticians and data scientists to both academia and industry, increasing and diversifying the STEM workforce. All three research streams have broad applications to areas beyond Statistics, such as neuroimaging, infectious disease transmission, and mobile health interventions. The project is thus expected to have wide-ranging impact on how the problems statisticians study are approached by domain scientists.
Faculty Investigators
- Liza Levina (PI)
- Xuming He (co-PI)
- Edward Ionides (co-PI)
- Ambuj Tewari (co-PI)
- Moulinath Banerjee
- Johann Gagnon-Bartsch
- Ben Hansen
- Stilian Stoev
- Ji Zhu
Trainee News
- Daniel Zhang graduates and starts a new job at Facebook, July 2019
- Jack Goetz starts a summer internship at Facebook, May 2019
- Zhiyuan (Julian) Lu successfully defends his PhD thesis. He will be joining the National Center for Toxicological Research, May 2019
- Caleb Ki and Drew Yarger win prestigious NSF Graduate Research Fellowships, April 2019
Trainees
Faculty advisors are mentioned (in parentheses) next to the trainee names.
Postdoctoral Scholars
- Mark Fredrickson (Hansen) | More info
- Keith Levin (Levina) | More info
- Asad Lodhia (Levina)
Graduate Students
- Robyn Ferg (Gagnon-Bartsch)
- Jonathan (Jack) Goetz (Tewari) | More info
- Greg Hunt (Gagnon-Bartsch) | More info
- Dan Kessler (Levina)
- Caleb Ki (Ionides)
- Tim Lycurgus (Hansen) | More info
- Laura Niss (Tewari) | More info
- Zoe Rehnberg (Gagnon-Bartsch) | More info
- James (Ed) Wu (Gagnon-Bartsch) | More info
- Drew Yarger (Stoev)
Undergraduate Students
- David Geering (Banerjee)
- William Klinenberg (Levina)
- John Nowland (Stoev)
- Evan Pesch (Stoev)
Alumni
- Zhiyuan (Julian) Lu (Banerjee) | Next step: Researcher, National Center for Toxicological Research
- Jarvis Miller (Ionides) | Next step: Data Scientist, BuzzFeed
- Adam Rauh (Hansen) | Next step: Statistical computation specialist, MIT
- Daniel Zhang (Tewari) | More info | Next step: Software Engineer, Facebook