Skip to content

Counterfactual Routing Replay

Pick a routing rule. We replay your last 30 days of real traffic against it and show the projected cost, latency, and quality difference vs your current rule. No competitor ships this feature in 2026.

Replay window
30 days · 17,787 requests
Counterfactual model under "Cost-first": llama-3.3-70b
Actual cost (last 30d)
$1210.30
Projected cost (counterfactual)
$26.66
Projected savings
$1183.64
97.8% cheaper
Actual p50 latency (weighted)
910 ms
Projected p50 latency
480 ms(-430 ms)
How this works: we keep your last 30 days of real traffic (request count, token sizes, current model). When you select a counterfactual rule, we recompute the cost and latency as if the new rule had been in effect — no actual model invocation needed.