Machine Learning Systems
  • Read
    • Volume I: Foundations
    • Volume II: At Scale

    • Volume I PDF
    • Volume I EPUB

    • Volume II PDF
    • Volume II EPUB
  • Build
    • Labs
    • TinyTorch
    • Hardware Kits
    • MLSys·IM
  • Teach
    • Course Map
    • Lecture Slides
    • Instructor Hub
  • Prepare
    • StaffML
    • Study Plans
    • Gauntlet Mode
  • Connect
    • Newsletter
    • Global Network
    • Workshops & Events
    • Partners & Sponsors
  • About
    • Mission
    • Our Story

    • People
    • Contributors

    • License
  • Support
  • Star
  • Subscribe
  • GitHub
    • Discussions
    • Edit this page
    • Report an issue
    • View source

Machine Learning Systems

TWO-VOLUME TEXTBOOK

Machine Learning
Systems.

The physics of AI engineering.

A rigorous, principles-first treatment of how ML systems are built, optimized, and deployed — from a single machine to fleet-scale infrastructure.

Harvard University · MIT Press 2026

Volume I cover

Volume I

Introduction to Machine Learning Systems

Volume I downloads: HTML PDF EPUB

Volume II cover

Volume II

Machine Learning Systems at Scale

Volume II downloads: HTML PDF EPUB

Explore the Curriculum
↓

A complete curriculum for AI engineering.

For Students & Learners

EXPLORE

Labs

Interactive Marimo notebooks. Change a parameter, see what breaks, build intuition.

Lab 15 · Sustainable AI Explore

BUILD

TinyTorch

Build your own ML framework from scratch across 20 progressive modules. Zero magic.

tinytorch — tensor.py class Tensor: def __init__(self, data): self.data = data self.grad = 0.0 self._backward = lambda: None

MODEL

MLSys·im

First-principles performance modeling. One command, every bottleneck.

$ mlsysim eval llama-3-70b --batch 1 mem-bound compute-bound b=1 b=32 b=128 Arithmetic Intensity FLOP/s

DEPLOY

Hardware Kits

Deploy ML to Arduino, Raspberry Pi, and Jetson. Real memory limits, real power budgets.

Arduino · Raspberry Pi · Jetson

For Career & Instructors

PRACTICE

StaffML

Physics-grounded interview questions for ML systems roles. Vault, drills, and mock interviews.

Systems Design L5 · Staff A 70B model needs 1,000 req/s. Walk through your hardware selection and parallelism strategy. Hardware Parallelism Trade-offs Cloud Edge Mobile TinyML

ADOPT

Instructor Hub

The AI Engineering Blueprint: two-semester syllabi, pedagogy guide, rubrics, and TA handbook.

The Blueprint — Course Architecture ML Systems · Two-Semester Curriculum Semester 1: Foundations 16 wks · Vol I · 8 assignments Semester 2: At Scale 16 wks · Vol II · capstone Assessment Rubrics · Peer review · Grading Teaching Staff Pedagogy · TA handbook READY

TEACH

Lecture Slides

35 Beamer decks with speaker notes and 266 original SVG diagrams. Drop in and teach.

Intro Systems DNN Training Accel Deploy Ethics The Iron Law of ML Systems T = D/BW + O/(R·η) + L Data Term — memory bandwidth Compute Term — utilization η ≤ 0.7 Latency Term — orchestration overhead Harvard University · ML Systems 12 / 38

FOLLOW

Newsletter

Updates on the curriculum, new chapters, and what the community is building.

MLSysBook Weekly 4 New: Vol II Ch. 14 — Fault Tolerance Updated: TinyTorch Module 12 Community: 500+ PRs merged Milestone: 23,000 GitHub stars Join 12,000+ subscribers
Support the Mission
↓

OUR MISSION

AI education should be
free and open to everyone.

Everyone calls AI the new electricity — but electricity is useless without engineers who can build the grid. For AI to be efficient, reliable, and safe, the world needs engineers who understand how to build it.

That knowledge should be accessible to anyone willing to learn. This curriculum is our commitment to making it so.

Live readership — 180+ countries

23,000+ stars · 243,000+ readers · 180+ countries

Our goal: 1,000,000 AI engineers by 2030

Next milestone: 100,000 ★ — we're at 23,000+.
Every star helps others discover this resource.

Star on GitHub

Meet our sponsors & partners →

© 2024-2026 Harvard University. Licensed under CC-BY-NC-SA 4.0

Volume I · Volume II · About · Community · Newsletter