I work on understanding how training data shapes the behavior of machine learning models. I'm interested in scaling these analyses to frontier models and using them to study generalization, interpretability, and data valuation. I completed my PhD in computer science at the University of Toronto in 2025.