case studies · whole products · what you'd ship
Real products from the /ship stack.
/build taught you the model. /ship taught you the production stack. These case studies show what real products look like when you compose that stack — architecture, the code unique to each product, the eval story, and an honest retrospective. Every study cross-references the /ship steps it builds on, so you can plug back in to the foundations any time.
If /ship taught you the parts, this teaches the products
5 products, four shapes
Each study targets a different product shape: retrieval-heavy, agent-heavy, multi-agent-heavy, all-of-the-above. They're orderable but not ordered — pick whichever shape your product looks most like.
- 01 Docs assistant with citationsRAG over a real corpus, strict citation requirements, zero hallucination tolerance. The product /ship/06–08 wants to be.
- 02 Code-review agentReads a PR diff, runs tests, comments inline, produces a verdict. The product /ship/09–10 wants to be.
- 03 Research assistantMulti-agent fan-out for cited briefs. The product /ship/11 wants to be — with the cost/latency trade-offs in real numbers.
- 04 Customer-support botRAG + a tools layer that hits a (mock) CRM + escalation logic. The product that composes the entire /ship stack.
- 05 The cheapest version of itselfTake the docs assistant from CS-01 and distill it into a 7B student. Same retrieval, same citation contract, ~7× cheaper, ~5pp eval gap. The product that pays for itself.
How this fits the rest of the site
Case studies are the third leg. /build teaches the model from scratch; /ship teaches the production stack; /case-studies shows what real products look like when the stack composes. Where /ship is "step by step from nothing to a deployed service," each case study is "here's a finished product, here's what made it hard, here's the code unique to it."
Articles cross-link aggressively into /ship for foundations and /articles for theory, so you don't have to re-read everything to follow along. If something seems too compressed, the link to the underlying step is one click away.
Status: 5 live · 0 wip · 0 stubbed.