Routing

Lumenfall automatically routes your requests to the best available AI provider based on the model you specify and provider availability. Routing is managed by Lumenfall to ensure optimal reliability, performance, and cost.

How routing works

When you make a request to Lumenfall:

Model matching - Lumenfall resolves your model to available providers
Provider selection - A provider is selected from the healthy provider pool
Execution - Your request is sent to the selected provider
Fallback - If the provider fails, the request is retried with another provider

Request → Model Matching → Provider Selection → Execute → Response
                                    ↓
                              (on failure)
                                    ↓
                            Next Provider → Execute → Response

Model aliases

Some models can be requested using alternative names. For example, a model might have both a versioned name and a shorter alias that always points to the latest version. You can find available aliases on each model’s page in the model catalog.

Load balancing

Many models are available from multiple providers. For each model, Lumenfall distributes traffic across providers that offer competitive pricing, good performance, and reliable availability. Which provider serves your request may change on every request. This is intentional - it allows Lumenfall to optimize for cost and reliability across your usage. If you need a specific provider, see Forcing a specific provider.

Fallback

When a provider fails, Lumenfall automatically retries your request with another provider. This happens transparently - you receive a successful response if any available provider succeeds. Lumenfall monitors provider health in real-time. If a provider experiences issues, it is automatically removed from the default provider pool. Requests are then distributed only among healthy providers. Unhealthy providers are not abandoned entirely. If all healthy providers fail for a request, Lumenfall will attempt the unhealthy providers as a last resort before returning an error.

Forcing a specific provider

While routing is managed by Lumenfall, you can bypass it and force a specific provider by prefixing the model name with a provider slug:

gemini-3-pro-image → vertex/gemini-3-pro-image

curl https://api.lumenfall.ai/openai/v1/images/generations \
  -H "Authorization: Bearer $LUMENFALL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "vertex/gemini-3-pro-image",
    "prompt": "A mountain landscape"
  }'

Viewing routing decisions

Each response includes headers showing which provider handled the request:

X-Lumenfall-Provider: vertex
X-Lumenfall-Model: gemini-3-pro-image
X-Lumenfall-Provider-Model: gemini-3-pro-image-preview
X-Request-Id: req_abc123

Header	Description
`X-Lumenfall-Provider`	Provider slug that served the request
`X-Lumenfall-Model`	Lumenfall model slug you requested
`X-Lumenfall-Provider-Model`	Actual model ID sent to the provider
`X-Request-Id`	Unique request identifier

For detailed routing information including fallback attempts, latency breakdown, and cost - see the Requests page in your dashboard.

Getting started

Core concepts

Dashboard

Client libraries

Integrations

How routing works

Model aliases

Load balancing

Fallback

Forcing a specific provider

Viewing routing decisions

Getting started

Core concepts

Dashboard

Client libraries

Integrations

​How routing works

​Model aliases

​Load balancing

​Fallback

​Forcing a specific provider

​Viewing routing decisions

How routing works

Model aliases

Load balancing

Fallback

Forcing a specific provider

Viewing routing decisions