API Reference

Public API surface for MLSys·im models, hardware registries, solvers, and analytical engines.

Core API

Primary objects and resolvers.

hardware
models
infrastructure
systems
platforms	Platform deployment envelopes.
datasets	Dataset zoo — canonical data corpus profiles.
literature
ops
core
engine
solvers	Canonical public solver import surface.
core.provenance.Provenance	How we know a numeric value (package audit trail; not BibTeX).
core.provenance.ProvenanceKind
core.provenance.Sourced	Scalar with mandatory `Provenance`. Subclasses `float` so appendix
engine.calibration	Parameters for analytical solvers and the roofline engine.
fmt.fmt	Format a Pint Quantity (or plain number) for narrative text.
fmt.fmt_int	Format a value as an integer for narrative text.
physics	Canonical physics and accounting formulas for ML systems.
solvers.SingleNodeModel	Resolves single-node hardware Roofline bounds and feasibility.
solvers.NetworkRooflineModel	Analyzes the Distributed Performance Bounds (The Network Wall).
solvers.EfficiencyModel	Models the gap between peak and achieved FLOPS (Wall 3: Software Efficiency).
solvers.ForwardModel	Forward-evaluating mechanistic engine (Y = f(X)).
solvers.ServingModel	Analyzes the two-phase LLM serving lifecycle: Pre-fill vs. Decoding.
solvers.TrainingMemoryModel	Decomposes per-accelerator training memory into teachable components.
solvers.ServingCapacityModel	Sizes an LLM serving deployment from a QPS and tail-latency target.
solvers.ContinuousBatchingModel	Analyzes production LLM serving with Continuous Batching and PagedAttention.
solvers.WeightStreamingModel	Analyzes Wafer-Scale inference (e.g., Cerebras CS-3) using Weight Streaming.
solvers.TailLatencyModel	Analyzes queueing delays and P99 tail latency for deployed inference models.
solvers.DataModel	Analyzes the ‘Data Wall’ — the throughput bottleneck between storage and compute.
solvers.TransformationModel	Quantifies the CPU preprocessing bottleneck (Wall 9: Transformation).
solvers.TopologyModel	Models bisection bandwidth for different network topologies (Wall 10).
solvers.ScalingModel	Analyzes the ‘Scaling Physics’ of model training (Chinchilla Laws).
solvers.InferenceScalingModel	Models inference-time compute scaling (Wall 12: Reasoning/CoT Cost).
solvers.CompressionModel	Analyzes model compression trade-offs (Accuracy vs. Efficiency).
solvers.DistributedModel	Resolves fleet-wide communication, synchronization, and pipelining constraints.
solvers.MoERoutingModel	Models first-order MoE routing imbalance and expert-parallel all-to-all cost.
solvers.ReliabilityModel	Calculates Mean Time Between Failures (MTBF) and optimal checkpointing intervals.
solvers.OrchestrationModel	Analyzes Cluster Orchestration and Queueing (Little’s Law).
solvers.EconomicsModel	Calculates Total Cost of Ownership (TCO) including Capex and Opex.
solvers.SustainabilityModel	Calculates Datacenter-scale Sustainability metrics.
solvers.CheckpointModel	Analyzes the storage constraints and I/O burst penalties of saving model states.
solvers.ResponsibleEngineeringModel	Models the computational cost of responsible AI practices (Wall 20: Safety).
solvers.SensitivitySolver	Identifies the binding constraint via numerical sensitivity analysis (Wall 21).
solvers.SynthesisSolver	Given an SLA, synthesizes the required hardware specs (Wall 22: Inverse Solve).
solvers.ParallelismOptimizer	Searches for the optimal 3D/4D parallelism split (DP, TP, PP, EP).
solvers.BatchingOptimizer	Finds the maximum batch size that satisfies a P99 latency SLA.
solvers.PlacementOptimizer	Finds the optimal datacenter location to minimize TCO and Carbon.
engine.dse.DSE	Declarative Design Space Exploration (DSE) Engine.