Blogs

My blog represents ideas about research I think about (but not necessarily what I do day-to-day at work). All opinions are my own and do not represent those of my employer.

If you’d like to chat (about my blogs, research, or whatever), feel free to reach out to me on X or email!

Jul 2025 — The Only Important Technology Is The Internet Why you should care about product-research co-design, and what is the dual of RL?
Jun 2025 — AI Models for Pokemon Games What can Pokemon teach us about designing interactive agents?
Mar 2024 — Spending Inference Time How should we structure inference compute to maximize performance?
Feb 2024 — LoRAs as Composable Programs How can we design LLMs to be future-proof operating systems?
Jan 2024 — Unifying RLHF Objectives What are different RL algorithms actually doing?

Papers

Summary — Towards a Universal Decision Making Paradigm
How can we design a universal learning method for sequential decision making?
Jun 2021 — Decision Transformer
How can we perform reinforcement learning with autoregressive sequence models?
Mar 2021 — Pretrained Transformers as Universal Computation Engines
What are the limits of transfer of large pretrained language models?

Study Guides

May 2021 — Probability and Random Processes