BrainstormRouter
Public beta · governed AI routing

Your AI agents need HR.
We built it.

You wouldn’t let a human employee access every system, spend unlimited money, and operate without oversight. Why are you letting your AI agents do exactly that?

BrainstormRouter gives every agent an ID badge (Trust Envelope), a corporate card (per-agent budget with atomic reserve), a job description (scoped capabilities), and a performance review (Thompson-sampled quality scoring) — on top of one OpenAI- and Anthropic-compatible API routing across 36 curated models from 8 providers.

Models
36
curated, ranked
Providers
8
routed on merit
Intelligence
13
wired systems
Overhead
<1ms
p95 guardian
Why BrainstormRouter

Most gateways are pass-through.
This one learns.

Every request feeds Thompson sampling, validity scoring, and a cost/quality frontier that re-ranks providers in real time. Every response ships with a receipt: which model, which tier, what it cost, what it would have cost.

Learned routing

Thompson sampling chooses per-request, not per-config. Winners get more traffic; losers get fewer. No static rules to maintain.

Verifiable identity

Short-lived mTLS certs via the Cryptographic Agent Framework. Every agent signs in; every call carries a provable origin.

Memory-aware context

SOUL, MEMORY, SKILL, HEARTBEAT, WORKSPACE, REFERENCE — six attributes, 4000 tokens, injected consistently across providers.

Counterfactual receipts

The shadow comparator logs what static routing would have spent. The savings widget shows the delta, live.

Budget enforcement

Per-agent, per-tenant daily caps. Budget forecaster predicts burn and degrades to cheaper tiers before you hit a ceiling.

Semantic cache

pgvector + in-memory hybrid at a 90% similarity threshold. Streaming responses get reconstructed from cache, not just JSON.

Counterfactual proof

Every request knows
what static routing would’ve cost.

The shadow comparator records BR’s Thompson decision alongside what a fixed price-only or quality-only router would have picked. Welch’s t-test, Cohen’s d, win-rate confidence interval — exposed over /v1/intelligence/benchmark.

POST /v1/chat/completions
# request headers returned
X-BR-Model:              claude-sonnet-4-6
X-BR-Estimated-Cost:      $0.00412
X-BR-Actual-Cost:         $0.00389
X-BR-Routing-Savings:     $0.00124  # vs. static price-only
X-BR-Guardian-Status:     ok
X-BR-Phases:              planning,execution
X-BR-Efficiency:          0.94
X-BR-Guardian-Overhead-Ms: 0.7

# link headers for AI discoverability
Link: </openapi.yaml>; rel="describedby"; type="application/yaml"
Link: </llms.txt>; rel="alternate"; type="text/plain"

Stop paying for pass-through.
Route with evidence.

Drop-in compatible with the OpenAI SDK. Bring your own provider keys — the gateway is free, all 13 intelligence systems on, full receipts.