How routing works
When you make a request to Lumenfall:- Model matching - Lumenfall resolves your model to available providers
- Provider selection - A provider is selected from the healthy provider pool
- Execution - Your request is sent to the selected provider
- Fallback - If the provider fails, the request is retried with another provider
Model aliases
Some models can be requested using alternative names. For example, a model might have both a versioned name and a shorter alias that always points to the latest version. You can find available aliases on each model’s page in the model catalog.Load balancing
Many models are available from multiple providers. For each model, Lumenfall distributes traffic across providers that offer competitive pricing, good performance, and reliable availability. Which provider serves your request may change on every request. This is intentional - it allows Lumenfall to optimize for cost and reliability across your usage. If you need a specific provider, see Forcing a specific provider.Fallback
When a provider fails, Lumenfall automatically retries your request with another provider. This happens transparently - you receive a successful response if any available provider succeeds. Lumenfall monitors provider health in real-time. If a provider experiences issues, it is automatically removed from the default provider pool. Requests are then distributed only among healthy providers. Unhealthy providers are not abandoned entirely. If all healthy providers fail for a request, Lumenfall will attempt the unhealthy providers as a last resort before returning an error.Forcing a specific provider
While routing is managed by Lumenfall, you can bypass it and force a specific provider by prefixing the model name with a provider slug:Viewing routing decisions
Each response includes headers showing which provider handled the request:| Header | Description |
|---|---|
X-Lumenfall-Provider | Provider slug that served the request |
X-Lumenfall-Model | Lumenfall model slug you requested |
X-Lumenfall-Provider-Model | Actual model ID sent to the provider |
X-Request-Id | Unique request identifier |