11

stage · curriculum

Agents

An LLM in a loop with tools. The architecture is simple; making it reliable in production is not. Build a 100-line single-agent loop without a framework first, then ask whether a framework would actually help.

7 articles
33 min to read
5 demos
1 book
if you only do one thing

An LLM in a loop with tools. The architecture is simple; the failure modes aren't. Step through three traces — happy, modal-interrupt, layout-shift — to see what 'reliable' looks like.

Articles in this stage

  1. 01 Agent Loop & Architecture
  2. 02 Browser & Vision Agents
  3. 03 Guardrails & Safety for Agents
  4. 04 Memory Systems
  5. 05 Multi-Agent Orchestration
  6. 06 Planning & Reflection
  7. 07 Tool Use & Function Calling

Stage 11 — Agents

An agent is an LLM in a loop with tools. The model takes an action, observes the result, decides what to do next, and repeats until done. This loop is the basic unit of every “AI agent” you’ve seen — Claude Code, ChatGPT browsing, code-fixing bots, customer support agents, browser agents.

The architecture is simple. Making it reliable in production is not.

Prerequisites

  • Stage 08 (prompting, structured outputs)
  • Stage 09 (RAG, for retrieval-augmented agents)

Learning ladder

  1. Agent loop & architecture — the core pattern
  2. Tool use & function calling — how the model invokes external capabilities
  3. Memory systems — working, episodic, semantic
  4. Planning & reflection — ReAct, plan-and-execute, reflexion
  5. Multi-agent orchestration — supervisor, swarm, debate
  6. Guardrails & safety — keeping agents inside their lane
  7. Browser & vision agents — the embodied frontier

MVU

You can:

  • Build a single-agent loop in <100 lines of code without a framework
  • Define tools with clean schemas and good error semantics
  • Articulate when to add a second agent vs more tools to one
  • Prevent the most common failure modes (loops, drift, runaway cost)

Exercise

Build an agent that can search the web, read pages, and answer multi-hop questions. No agent framework allowed for the first version. Then add: a tool registry, basic memory (summarize old turns), retry logic, a budget cap. Then ask: would a framework actually help me here?

Why this stage matters

In 2026, “agents” is what most product teams want to ship. Most of them ship something that works in demos but fails in production. The difference is in this stage’s content: tool design, memory management, error handling, evaluation.

Hands-on companions

This stage has the most code-side companion content on the site. After the theory:

Ship the agent stack:

See it as a real product — three case studies, increasing in complexity:

See also

Further reading

Books move slower than papers in this field — treat these as foundations, not replacements for the latest research. Real authors, real publishers, real editions. Free badges mark books with author-authorized full text online.