autoThompson sampling picks based on learned quality posteriors. General workloads.
base_url.
Hand-curated catalog from Anthropic, OpenAI, Google, x-AI, Groq,
Perplexity, DeepSeek, and Moonshot. OpenAI- and Anthropic-compatible
endpoints. Set model="auto" and Thompson
sampling picks the best one per request.
Per-model quality, latency, and success variance tracked live. The leaderboard is public to every caller via GET /v1/intelligence/rankings.
gpt-4.1 · gpt-4o · o3 · o3-mini · …
claude-opus-4-6 · claude-sonnet-4-6 · claude-haiku-4-5
gemini-2.5-pro · gemini-2.5-flash · gemini-1.5-pro
llama-3.3-70b-versatile · mixtral-8x7b-32768
grok-3 · grok-3-mini
sonar-pro · sonar
deepseek-chat · deepseek-reasoner
kimi-k2.5 · kimi-k2.6
model to a strategy, not a name.autoThompson sampling picks based on learned quality posteriors. General workloads.
auto:fastLowest p50 latency model. Real-time UX, streaming interfaces.
auto:floorCheapest model above the quality threshold. Bulk processing, classification, triage.
auto:bestHighest quality regardless of cost. Critical reasoning, code review, legal.
curl. Auto-routing across 31 models.