API Reference

Core API

Primary objects and solvers.

hardware
models
infra
systems
core
core.solver.SingleNodeModel Resolves single-node hardware Roofline bounds and feasibility.
core.solver.ServingModel Analyzes the two-phase LLM serving lifecycle: Pre-fill vs. Decoding.
core.solver.DistributedModel Resolves fleet-wide communication, synchronization, and pipelining constraints.
core.solver.DataModel Analyzes the ‘Data Wall’ — the throughput bottleneck between storage and compute.
core.solver.ScalingModel Analyzes the ‘Scaling Physics’ of model training (Chinchilla Laws).
core.solver.OrchestrationModel Analyzes Cluster Orchestration and Queueing (Little’s Law).
core.solver.CompressionModel Analyzes model compression trade-offs (Accuracy vs. Efficiency).
core.solver.SustainabilityModel Calculates Datacenter-scale Sustainability metrics.
core.solver.EconomicsModel Calculates Total Cost of Ownership (TCO) including Capex and Opex.
core.solver.ContinuousBatchingModel Analyzes production LLM serving with Continuous Batching and PagedAttention.
core.solver.WeightStreamingModel Analyzes Wafer-Scale inference (e.g., Cerebras CS-3) using Weight Streaming.
core.solver.TailLatencyModel Analyzes queueing delays and P99 tail latency for deployed inference (M/M/c).
core.solver.ReliabilityModel Calculates Mean Time Between Failures (MTBF) and optimal checkpointing intervals.
core.solver.CheckpointModel Analyzes checkpoint I/O burst penalties and MFU impact.
Back to top