Public beta · governed AI routing

Your AI agents need HR.
We built it.

You wouldn’t let a human employee access every system, spend unlimited money, and operate without oversight. Why are you letting your AI agents do exactly that?

BrainstormRouter gives every agent an ID badge (Trust Envelope), a corporate card (per-agent budget with atomic reserve), a job description (scoped capabilities), and a performance review (Thompson-sampled quality scoring) — on top of one OpenAI- and Anthropic-compatible API routing across 45 curated models from 8 providers.

Start routing — free tier See the architecture

Models

curated, ranked

Providers

routed on merit

Intelligence

wired systems

Overhead

<1ms

p95 guardian

Why BrainstormRouter

Most gateways are pass-through.
This one learns.

Every request feeds Thompson sampling, validity scoring, and a cost/quality frontier that re-ranks providers in real time. Every response ships with a receipt: which model, which tier, what it cost, what it would have cost.

Learned routing

Thompson sampling chooses per-request, not per-config. Winners get more traffic; losers get fewer. No static rules to maintain.

Verifiable identity

Short-lived mTLS certs via the Cryptographic Agent Framework. Every agent signs in; every call carries a provable origin.

Memory-aware context

SOUL, MEMORY, SKILL, HEARTBEAT, WORKSPACE, REFERENCE — six attributes, 4000 tokens, injected consistently across providers.

Counterfactual receipts

The shadow comparator logs what static routing would have spent. The savings widget shows the delta, live.

Budget enforcement

Per-agent, per-tenant daily caps. Budget forecaster predicts burn and degrades to cheaper tiers before you hit a ceiling.

Semantic cache

pgvector + in-memory hybrid at a 90% similarity threshold. Streaming responses get reconstructed from cache, not just JSON.

Five governance pillars

The control plane, not the crayon box.

Thirteen intelligence systems across five pillars. Every pillar is wired into the hot path — no dashboards without runtime, no runtime without receipts.

Identity

SOUL-first bootstrap

Upload identity → profile, HR record, budget, workspace, JWT. Zero admin ceremony.

Auth

Cryptographic Agent Framework

5-minute mTLS certificates signed by BR's CA. Agent JWTs with tenant scope.

Runtime

Thompson sampling + cascade

Bandit chooses, cascade recovers, validity scores, frontier re-ranks.

Economics

Budget forecaster & guardian

Per-tenant usage, daily caps, pre-request anomaly, counterfactual savings.

Evidence

Receipts & shadow comparator

Every response headers cost, savings, phase prediction, efficiency score.

Counterfactual proof

Every request knows
what static routing would’ve cost.

The shadow comparator records BR’s Thompson decision alongside what a fixed price-only or quality-only router would have picked. Welch’s t-test, Cohen’s d, win-rate confidence interval — exposed over /v1/intelligence/benchmark.

Savings surface on the dashboard and via the X-BR-Routing-Savings header on every response.

POST /v1/chat/completions

# request headers returned
X-BR-Model:              claude-sonnet-4-6
X-BR-Estimated-Cost:      $0.00412
X-BR-Actual-Cost:         $0.00389
X-BR-Routing-Savings:     $0.00124  # vs. static price-only
X-BR-Guardian-Status:     ok
X-BR-Phases:              planning,execution
X-BR-Efficiency:          0.94
X-BR-Guardian-Overhead-Ms: 0.7

# link headers for AI discoverability
Link: </openapi.yaml>; rel="describedby"; type="application/yaml"
Link: </llms.txt>; rel="alternate"; type="text/plain"

Stop paying for pass-through.
Route with evidence.

Drop-in compatible with the OpenAI SDK. Bring your own provider keys — the gateway is free, all 13 intelligence systems on, full receipts.

Start free Read the SDK docs

Your AI agents need HR. We built it.

Most gateways are pass-through. This one learns.