AI Agents from First Principles
A Guide for Debugging LLM Training Data
Llama 4: The Challenges of Creating a Frontier-Level LLM
Vision Large Language Models (vLLMs)
nanoMoE: Mixture-of-Experts (MoE) LLMs from Scratch in PyTorch
Demystifying Reasoning Models
Mixture-of-Experts (MoE) LLMs
Scaling Laws for LLMs: From GPT-3 to o3