Guides

Multi-provider Routing

InferexAI can route requests to multiple LLM providers — Simplismart, OpenAI, Anthropic, Groq — based on model name, cost, or availability.

🚀

Coming soon

Multi-provider routing is on the roadmap for Q3 2026. When released, you will be able to configure provider priority, cost-based routing, and automatic fallback from the admin panel without changing any application code.

What to expect

Route by model name — use "gpt-4o", "claude-3-5-sonnet", or "llama-3" directly
Automatic failover — if one provider returns 5xx, retry on the next
Cost-based routing — always pick the cheapest provider for a given capability
Load balancing — spread traffic across providers to avoid rate limits
Per-route configuration in the admin panel — no code changes needed