Guides
Multi-provider Routing
InferexAI can route requests to multiple LLM providers — Simplismart, OpenAI, Anthropic, Groq — based on model name, cost, or availability.
🚀
Coming soon
Multi-provider routing is on the roadmap for Q3 2026. When released, you will be able to configure provider priority, cost-based routing, and automatic fallback from the admin panel without changing any application code.
What to expect
→Route by model name — use "gpt-4o", "claude-3-5-sonnet", or "llama-3" directly
→Automatic failover — if one provider returns 5xx, retry on the next
→Cost-based routing — always pick the cheapest provider for a given capability
→Load balancing — spread traffic across providers to avoid rate limits
→Per-route configuration in the admin panel — no code changes needed