I am currently a researcher at Thinking Machines.

I was previously a researcher at OpenAI, where I worked on reinforcement learning, small models, and synthetic data. I led the release of 4o-mini, and contributed to other models such as o*-mini and o3 (see About).

I graduated from UC Berkeley, where I worked on reinforcement learning and offline sequence modeling. I was fortunate to be advised by Pieter Abbeel and Igor Mordatch.

Email: matches my arxiv papers

Blog

Papers

Profile Picture