I’m a researcher currently at OpenAI, working on reinforcement learning and synthetic data. I graduated from UC Berkeley, where I was advised by Pieter Abbeel and Igor Mordatch.
Email: matches my arxiv papers
Quick highlights:
Blogs
- Mar 2024 — Spending Inference Time
- Feb 2024 — LoRAs as Composable Programs
- Feb 2024 — Unifying RLHF Objectives
Papers
- Jun 2021 — Decision Transformer
- Mar 2021 — Pretrained Transformers as Universal Computation Engines
