I’m a researcher currently at OpenAI, working on reinforcement learning and synthetic data; some models I’ve contributed to are 4o-mini, o*-mini, and o3.
I graduated from UC Berkeley, where I was advised by Pieter Abbeel and Igor Mordatch. Email: matches my arxiv papers
Quick highlights:
Blogs
- Mar 2024 — Spending Inference Time
- Feb 2024 — LoRAs as Composable Programs
- Feb 2024 — Unifying RLHF Objectives
Papers
- Jun 2021 — Decision Transformer
- Mar 2021 — Pretrained Transformers as Universal Computation Engines
