I’m currently a researcher at OpenAI. I work on reinforcement learning, small models, and synthetic data. I led the release of 4o-mini, and have contributed to other models such as o*-mini and o3.

Before that, I worked at Hudson River Trading and Meta AI, where I researched flavors of sequential decision making and deep learning.

I graduated from UC Berkeley, where I worked on reinforcement learning and modeling offline sequence data. I was fortunate to be advised by Pieter Abbeel and Igor Mordatch.

Email: matches my arxiv papers

Blog

Papers

Profile Picture