Blogs
My blog represents ideas about research I think about (but not necessarily what I do day-to-day at work). All opinions are my own and do not represent those of my employer.
-
Jul 2025 — The Only Important Technology Is The Internet
Why you should care about product-research co-design, and what is the dual of RL?
-
Jun 2025 — AI Models for Pokemon Games
What can Pokemon teach us about designing interactive agents?
-
Mar 2024 — Spending Inference Time
How should we structure inference compute to maximize performance?
-
Feb 2024 — LoRAs as Composable Programs
How can we design LLMs to be future-proof operating systems?
-
Jan 2024 — Unifying RLHF Objectives
What are different RL algorithms actually doing?
Papers
-
Summary — Towards a Universal Decision Making Paradigm
How can we design a universal learning method for sequential decision making? -
Jun 2021 — Decision Transformer
How can we perform reinforcement learning with autoregressive sequence models? -
Mar 2021 — Pretrained Transformers as Universal Computation Engines
What are the limits of transfer of large pretrained language models?
Study Guides
- May 2021 — Probability and Random Processes