read · learn · build

114 articles. 15 stages. One coherent path.

The theory side of the site — the full organized/ learning path, vendored for offline-friendly publishing. Each article cross-links to the hands-on companion track that turns the theory into running code. Browse below, or open the curriculum view if you'd rather have a starting line picked for you.

114

articles

16

stages

7h 50m

of reading

when you want to build, not just read

Write a tiny LLM from scratch. 17-step from-zero curriculum.

Ship a production stack from open source. Foundations → building → production.

/case-studies →

Four full products composed from the /ship stack — with architecture, eval numbers, retrospectives.

if you read three things, read these

Editor's picks

The Transformer Block

The keystone. If you read one thing on this site, read this — it composes self-attention, residual streams, and the MLP into the unit every modern model is built from.

4 min · 3 demos

RAG Fundamentals

Retrieval-augmented generation, end to end. The single most-deployed pattern in production AI right now.

5 min · 1 demo

Why GPT-3 was undertrained and LLaMA-3 trains 70B on 15T tokens. Chinchilla in plain language.

4 min · 1 demo

path overview

Read me first

Top-level docs that frame the path itself.

overview · 7 min

Learning Path — Detailed Walk-Through

overview · 5 min

AI / ML / AI Engineering — Learning Path

overview · 6 min

Track A — Software Engineer → AI Product Engineer

overview · 6 min

Track B — ML Engineer → LLM Specialist

overview · 7 min

Track C — Complete from Scratch

jump to a stage

Browse by stage

01 Math Foundations 5 02 Ml Fundamentals 7 03 Neural Networks 7 04 Language Modeling 5 05 Tokens Embeddings 5 06 Transformers 6 07 Modern Llms 8 08 Prompting 6 09 Rag 8 10 Fine Tuning 10 11 Agents 8 12 Multimodal 7 13 Production 9 14 Applications 6 15 Engineering Career 4 exercise_solutions 8

01

Math Foundations

Linear algebra, probability, calculus, info theory.

Stage overview & further reading → 4 articles · 21 min

Calculus & Optimization

▶ 1 demo · animated

Information Theory for ML

Linear Algebra for ML

Probability & Statistics for ML

02

Ml Fundamentals

Supervised, unsupervised, evaluation, regularization.

Stage overview & further reading → 6 articles · 23 min

Classical ML Algorithms

Evaluation & Metrics

Loss Functions & Optimization

Regularization & Generalization

Supervised Learning

Unsupervised Learning

▶ 2 demos · animated

03

Neural Networks

Backprop, activations, optimizers, regularization.

Stage overview & further reading → 6 articles · 21 min

Activations & Initialization

Architectures: CNNs and RNNs

Backpropagation

▶ 1 demo · animated

Optimizers

Perceptrons & MLPs

Regularization Techniques

04

Language Modeling

n-grams to RNNs to why transformers won.

Stage overview & further reading → 4 articles · 14 min

n-gram Models

Neural Language Models

RNNs and LSTMs as Language Models

▶ 1 demo · animated

Why Transformers Won

05

Tokens Embeddings

How text becomes vectors.

Stage overview & further reading → 4 articles · 16 min

Contextual Embeddings

Semantic Geometry

Static Embeddings

Tokenization

06

Transformers

Self-attention, multi-head, KV caching, GPT from scratch.

Stage overview & further reading → 5 articles · 20 min

GPT From Scratch

Multi-Head Attention

Positional Encoding

▶ 1 demo · animated

Self-Attention (KQV)

▶ 1 demo · animated

The Transformer Block

07

Modern Llms

Scaling laws, MoE, reasoning models, long context.

Stage overview & further reading → 7 articles · 33 min

Field report: DeepSeek-R1 — reasoning from pure RL, in the open

Frontier Architectures

Long Context

Mixture of Experts (MoE)

▶ 1 demo · animated

Reasoning Models

Scaling Laws

Why LLMs Excel at Code

08

Prompting

Few-shot, CoT, structured outputs, sampling.

Stage overview & further reading → 5 articles · 20 min

Advanced Prompting Techniques

Few-Shot & Chain-of-Thought

Prompt Fundamentals

Sampling & Decoding

▶ 2 demos · animated

Structured Outputs

09

Rag

Chunking, embeddings, hybrid search, reranking, evals.

Stage overview & further reading → 7 articles · 34 min

Advanced Retrieval Patterns

Chunking Strategies

Embedding Models for Retrieval

Evaluating RAG

Hybrid Search & Reranking

▶ 1 demo · animated

RAG Fundamentals

Vector Databases

10

Fine Tuning

SFT, LoRA, RLHF, DPO, GRPO, embedding fine-tuning.

Stage overview & further reading → 9 articles · 51 min

Data & Tooling

Distillation

Embedding Fine-Tuning

Field report: Llama 3 — frontier post-training, in 92 pages

LoRA & QLoRA

Field report: Phi-3 — synthetic data and distillation, in the open

RLHF, DPO, GRPO — Preference and Reward Training

Supervised Fine-Tuning (SFT)

When to Fine-Tune

11

Agents

Loops, tools, memory, planning, multi-agent, browser/vision.

Stage overview & further reading → 7 articles · 33 min

Agent Loop & Architecture

▶ 1 demo · animated

Browser & Vision Agents

Guardrails & Safety for Agents

Memory Systems

Multi-Agent Orchestration

▶ 1 demo · animated

Planning & Reflection

Tool Use & Function Calling

12

Multimodal

CLIP, VLMs, diffusion, video, speech, synthetic data.

Stage overview & further reading → 6 articles · 27 min

Multimodal Embeddings (CLIP)

Speech & TTS

Synthetic Data

Text-to-Image Diffusion

▶ 1 demo · animated

Video Generation

Vision-Language Models (VLMs)

13

Production

Deployment, evals, guardrails, observability, cost, data systems.

Stage overview & further reading → 8 articles · 46 min

Cost & Latency

Data Systems for AI Products

Deployment Architectures

Enterprise Considerations

Evaluation & Benchmarks

Guardrails

Hallucination Mitigation

Observability & Tracing

▶ 1 demo · animated

14

Applications

Text-to-SQL, code, browser agents, finance, case studies.

Stage overview & further reading → 5 articles · 25 min

Browser Agents

Case Studies

Financial Reasoning

Text-to-Code

Text-to-SQL

15

Engineering Career

Roles, learning roadmap, staying current.

Stage overview & further reading → 3 articles · 15 min

AI Engineer Roles

Learning Roadmap

Staying Current

exercise_solutions

Stage overview & further reading → 7 articles · 13 min

Cross-Stage Projects

Stage 01 — Math Foundations: Solutions

Stage 02 — ML Fundamentals: Solutions

Stage 03 — Neural Networks: Solutions

Stage 06 — Transformers: Solutions

Stage 09 — RAG: Solutions

Stage 11 — Agents: Solutions