Profile

Juhan Bae

baejuhan21 [at] gmail [dot] com
CV    Github    Google Scholar   

Latest Updated: 01. 16. 2025


About Me

I am a researcher in machine learning and artificial intelligence, interested in understanding the relationship between training data and the properties of trained models. I completed my PhD at the University of Toronto in 2025, studying under Roger Grosse. I received my HBSc in Computer Science and Statistics from the same university in 2019.


Selected Publications

Studying Large Language Model Generalization with Influence Functions

Roger Grosse*, Juhan Bae*, Cem Anil*, Nelson Elhage, Alex Tamkin, Amirhossein Tajdini, Benoit Steiner, Dustin Li, Esin Durmus, Ethan Perez, Evan Hubinger, Kamilė Lukošiūtė, Karina Nguyen, Nicholas Joseph, Sam McCandlish, Jared Kaplan, Samuel Bowman

Paper Blog Code Video Slides

Training Data Attribution via Approximate Unrolled Differentiation

Juhan Bae, Wu Lin, Jonathan Lorraine, Roger Grosse

Paper Code Video Slides Poster

What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions

Sang Keun Choe, Hwijeen Ahn*, Juhan Bae*, Kewen Zhao*, Minsoo Kang, Youngseog Chung, Adithya Pratapa, Willie Neiswanger, Emma Strubell, Teruko Mitamura, Jeff Schneider, Eduard Hovy, Roger Grosse, Eric Xing

Paper Code Slides

If Influence Functions are the Answer, Then What is the Question?

Juhan Bae, Nathan Ng, Alston Lo, Marzyeh Ghassemi, Roger Grosse

Paper Code (PyTorch) Code (Jax) Video Poster

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Laura Ruis, Maximilian Mozes, Juhan Bae, Siddhartha Rao Kamalakara, Dwaraknath Gnaneshwar, Acyr Locatelli, Robert Kirk, Tim Rocktaschel, Edward Grefenstette, Max Bartolo

Paper Blog Code Demo

Influence Functions for Scalable Data Attribution in Diffusion Models

Bruno Mlodozeniec, Runa Eschenhagen, Juhan Bae, Alexander Immer, David Krueger, Richard Turner

Paper

Benchmarking Neural Network Training Algorithms

George Dahl*, Frank Schneider*, Zachary Nado*, Naman Agarwal*, Chandramouli Sastry, Philipp Hennig, Sourabh Medapati, Runa Eschenhagen, Priya Kasimbeg, Daniel Suo, Juhan Bae, Justin Gilmer and 13 more authors

Paper Blog Code

Accelerating Neural Network Training: An Analysis of the AlgoPerf Competition

Priya Kasimbeg, Frank Schneider, Runa Eschenhagen, Juhan Bae, Chandramouli Shama Sastry, Mark Saroufim, Boyuan Feng, Less Wright, Edward Yang, Zachary Nado, Sourabh Medapati, Philipp Hennig, Michael Rabbat, George Dahl

Paper Blog Code

Amortized Proximal Optimization

Juhan Bae*, Paul Vicol*, Jeff HaoChen, Roger Grosse

Paper Code Video Slides Poster

Teaching