Blogs
My blog represents ideas about research I think about (but not necessarily what I do day-to-day at work). All opinions are my own and do not represent those of my employer.
If you’d like to chat (about my blogs, research, or whatever), feel free to reach out to me on X or email!
-
Jun 2025 — Agentic Models for Pokemon Games
What can Pokemon teach us about designing interactive agents?
-
Mar 2024 — Spending Inference Time
How should we structure inference compute to maximize performance?
-
Feb 2024 — LoRAs as Composable Programs
How can we design LLMs to be future-proof operating systems?
-
Jan 2024 — Unifying RLHF Objectives
What are different RL algorithms actually doing?
Papers
-
Summary — Towards a Universal Decision Making Paradigm
How can we design a universal learning method for sequential decision making? -
Jun 2021 — Decision Transformer
How can we perform reinforcement learning with autoregressive sequence models? -
Mar 2021 — Pretrained Transformers as Universal Computation Engines
What are the limits of transfer of large pretrained language models?
Study Guides
- May 2021 — Probability and Random Processes