Learned routing
Thompson sampling chooses per-request, not per-config. Winners get more traffic; losers get fewer. No static rules to maintain.
You wouldn’t let a human employee access every system, spend unlimited money, and operate without oversight. Why are you letting your AI agents do exactly that?
BrainstormRouter gives every agent an ID badge (Trust Envelope), a corporate card (per-agent budget with atomic reserve), a job description (scoped capabilities), and a performance review (Thompson-sampled quality scoring) — on top of one OpenAI- and Anthropic-compatible API routing across 36 curated models from 8 providers.
Every request feeds Thompson sampling, validity scoring, and a cost/quality frontier that re-ranks providers in real time. Every response ships with a receipt: which model, which tier, what it cost, what it would have cost.
Thompson sampling chooses per-request, not per-config. Winners get more traffic; losers get fewer. No static rules to maintain.
Short-lived mTLS certs via the Cryptographic Agent Framework. Every agent signs in; every call carries a provable origin.
SOUL, MEMORY, SKILL, HEARTBEAT, WORKSPACE, REFERENCE — six attributes, 4000 tokens, injected consistently across providers.
The shadow comparator logs what static routing would have spent. The savings widget shows the delta, live.
Per-agent, per-tenant daily caps. Budget forecaster predicts burn and degrades to cheaper tiers before you hit a ceiling.
pgvector + in-memory hybrid at a 90% similarity threshold. Streaming responses get reconstructed from cache, not just JSON.
Thirteen intelligence systems across five pillars. Every pillar is wired into the hot path — no dashboards without runtime, no runtime without receipts.
Upload identity → profile, HR record, budget, workspace, JWT. Zero admin ceremony.
5-minute mTLS certificates signed by BR's CA. Agent JWTs with tenant scope.
Bandit chooses, cascade recovers, validity scores, frontier re-ranks.
Per-tenant usage, daily caps, pre-request anomaly, counterfactual savings.
Every response headers cost, savings, phase prediction, efficiency score.
The shadow comparator records BR’s Thompson decision alongside
what a fixed price-only or quality-only router would have picked.
Welch’s t-test, Cohen’s d, win-rate confidence interval
— exposed over /v1/intelligence/benchmark.
X-BR-Routing-Savings header on every
response.
# request headers returned X-BR-Model: claude-sonnet-4-6 X-BR-Estimated-Cost: $0.00412 X-BR-Actual-Cost: $0.00389 X-BR-Routing-Savings: $0.00124 # vs. static price-only X-BR-Guardian-Status: ok X-BR-Phases: planning,execution X-BR-Efficiency: 0.94 X-BR-Guardian-Overhead-Ms: 0.7 # link headers for AI discoverability Link: </openapi.yaml>; rel="describedby"; type="application/yaml" Link: </llms.txt>; rel="alternate"; type="text/plain"
Drop-in compatible with the OpenAI SDK. Bring your own provider keys — the gateway is free, all 13 intelligence systems on, full receipts.