13 built-in LLM providers — Anthropic, OpenAI, Google, Mistral, Groq, DeepSeek, xAI, Together, Fireworks, Perplexity, Cerebras, Ollama, OpenRouter.

Providers

Cersei supports 13 providers out of the box. Most use the OpenAI-compatible API format, which means adding a provider is just an env var — no new code.

Beyond text, messages can carry images, video, audio, and PDFs. See Multimodal Input for the file-passing API and which media each provider accepts.

Model String Format

provider/model

Pass this to the Agent builder or the Abstract CLI:

Agent::builder()
    .provider(cersei_provider::from_model_string("groq/llama-3.1-70b-versatile")?.0)

abstract "fix the tests" --model groq/llama-3.1-70b-versatile

Bare model names auto-detect the provider from known prefixes:

abstract "fix the tests" --model gpt-4o          # → openai
abstract "fix the tests" --model claude-sonnet-4-6  # → anthropic
abstract "fix the tests" --model gemini-2.0-flash   # → google

Anthropic

Claude models. Uses Anthropic's native API format (different from OpenAI).


API Base	`https://api.anthropic.com`
Env Var	`ANTHROPIC_API_KEY` or `ANTHROPIC_KEY`
Format	Anthropic (native)
Default Model	`claude-sonnet-4-6`

Models:

Model	Context Window	Vision	Thinking	Tool Use
`claude-opus-4-6`	200K	Yes	Yes	Yes
`claude-sonnet-4-6`	200K	Yes	Yes	Yes
`claude-haiku-4-5`	200K	Yes	No	Yes

export ANTHROPIC_API_KEY=sk-ant-...
abstract "fix the tests" --model anthropic/claude-sonnet-4-6
abstract "fix the tests" --model sonnet   # alias

let (provider, model) = from_model_string("anthropic/claude-sonnet-4-6")?;

OpenAI

GPT and o-series models.


API Base	`https://api.openai.com/v1`
Env Var	`OPENAI_API_KEY`
Format	OpenAI-compatible
Default Model	`gpt-4o`

Models:

Model	Context Window
`gpt-4o`	128K
`gpt-4-turbo`	128K
`o1`	200K
`o3`	200K

export OPENAI_API_KEY=sk-...
abstract "fix the tests" --model openai/gpt-4o
abstract "fix the tests" --model 4o   # alias

Google

Gemini models via the OpenAI-compatible endpoint.


API Base	`https://generativelanguage.googleapis.com/v1beta/openai`
Env Var	`GOOGLE_API_KEY` or `GEMINI_API_KEY`
Format	OpenAI-compatible
Default Model	`gemini-3.1-pro-preview`

Models:

Model	Context Window
`gemini-3.1-pro-preview`	2M
`gemini-3.0-flash`	1M
`gemini-2.0-flash`	1M
`gemini-2.0-pro`	1M
`gemini-1.5-pro`	2M

export GOOGLE_API_KEY=AIza...
abstract "fix the tests" --model google/gemini-2.0-flash
abstract "fix the tests" --model gemini   # alias

Mistral

Mistral and Codestral models.


API Base	`https://api.mistral.ai/v1`
Env Var	`MISTRAL_API_KEY`
Format	OpenAI-compatible
Default Model	`mistral-large-latest`

Models:

Model	Context Window
`mistral-large-latest`	128K
`codestral-latest`	256K

export MISTRAL_API_KEY=...
abstract "fix the tests" --model mistral/mistral-large-latest
abstract "fix the tests" --model mistral   # alias

Groq

Llama, Mixtral, and other open models on Groq's inference hardware.


API Base	`https://api.groq.com/openai/v1`
Env Var	`GROQ_API_KEY`
Format	OpenAI-compatible
Default Model	`llama-3.1-70b-versatile`

Models:

Model	Context Window
`llama-3.1-70b-versatile`	128K
`llama-3.1-8b-instant`	128K
`mixtral-8x7b-32768`	32K

export GROQ_API_KEY=gsk_...
abstract "fix the tests" --model groq/llama-3.1-70b-versatile
abstract "fix the tests" --model llama   # alias

DeepSeek

DeepSeek Chat, Reasoner, and Coder models via DeepSeek's native endpoint.


API Base	`https://api.deepseek.com/v1`
Env Var	`DEEPSEEK_API_KEY` (the only variable required)
Format	OpenAI-compatible
Default Model	`deepseek-chat`

Models:

Model	Context Window
`deepseek-chat`	64K
`deepseek-reasoner`	64K
`deepseek-coder`	64K

Minimal working example:

export DEEPSEEK_API_KEY=sk-...
abstract --provider deepseek --model deepseek-chat
abstract --provider deepseek --model deepseek-reasoner

# Equivalent model-string forms:
abstract "fix the tests" --model deepseek/deepseek-chat
abstract "fix the tests" --model deepseek            # alias → deepseek/deepseek-chat

Selecting --provider deepseek (or any deepseek-* model name) always resolves DEEPSEEK_API_KEY and the native DeepSeek base URL — it never falls back to the OpenAI endpoint. To point DeepSeek at a proxy/gateway instead, set DEEPSEEK_BASE_URL (see Custom Endpoints).

xAI

Grok models.


API Base	`https://api.x.ai/v1`
Env Var	`XAI_API_KEY`
Format	OpenAI-compatible
Default Model	`grok-2`

export XAI_API_KEY=xai-...
abstract "fix the tests" --model xai/grok-2
abstract "fix the tests" --model grok   # alias

Together

Open-source models hosted by Together AI.


API Base	`https://api.together.xyz/v1`
Env Var	`TOGETHER_API_KEY`
Format	OpenAI-compatible
Default Model	`meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo`

export TOGETHER_API_KEY=...
abstract "fix the tests" --model together/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo

Fireworks

Fast inference for open models.


API Base	`https://api.fireworks.ai/inference/v1`
Env Var	`FIREWORKS_API_KEY`
Format	OpenAI-compatible
Default Model	`accounts/fireworks/models/llama-v3p1-70b-instruct`

export FIREWORKS_API_KEY=fw_...
abstract "fix the tests" --model fireworks/accounts/fireworks/models/llama-v3p1-70b-instruct

Perplexity

Search-augmented models.


API Base	`https://api.perplexity.ai`
Env Var	`PERPLEXITY_API_KEY`
Format	OpenAI-compatible
Default Model	`llama-3.1-sonar-large-128k-online`

export PERPLEXITY_API_KEY=pplx-...
abstract "fix the tests" --model perplexity/llama-3.1-sonar-large-128k-online

Cerebras

Wafer-scale inference.


API Base	`https://api.cerebras.ai/v1`
Env Var	`CEREBRAS_API_KEY`
Format	OpenAI-compatible
Default Model	`llama3.1-70b`

export CEREBRAS_API_KEY=csk-...
abstract "fix the tests" --model cerebras/llama3.1-70b

Ollama

Local models. No API key needed.


API Base	`http://localhost:11434/v1`
Env Var	(none)
Format	OpenAI-compatible
Default Model	`llama3.1`

ollama pull llama3.1
abstract "fix the tests" --model ollama/llama3.1

Ollama must be running locally on port 11434. No API key is required.

OpenRouter

Aggregator that proxies to multiple providers with a single key.


API Base	`https://openrouter.ai/api/v1`
Env Var	`OPENROUTER_API_KEY`
Format	OpenAI-compatible
Default Model	`anthropic/claude-3.5-sonnet`

export OPENROUTER_API_KEY=sk-or-...
abstract "fix the tests" --model openrouter/anthropic/claude-3.5-sonnet

Auto-Detection

When you pass a bare model name (no /), the router auto-detects the provider:

Prefix	Provider
`claude-*`	anthropic
`gpt-`, `o1`, `o3*`	openai
`gemini-*`	google
`mistral-`, `codestral-`	mistral
`deepseek-*`	deepseek
`grok-*`	xai
`llama*`	groq (if `GROQ_API_KEY` set), else together, else ollama

If no prefix matches, the router picks the first provider with a valid API key in the environment.

Aliases

The Abstract CLI supports short aliases:

Alias	Resolves To
`opus`	`anthropic/claude-opus-4-6`
`sonnet`	`anthropic/claude-sonnet-4-6`
`haiku`	`anthropic/claude-haiku-4-5`
`4o`, `gpt4o`	`openai/gpt-4o`
`gemini`	`google/gemini-3.1-pro-preview`
`llama`	`groq/llama-3.1-70b-versatile`
`deepseek`	`deepseek/deepseek-chat`
`grok`	`xai/grok-2`
`mistral`	`mistral/mistral-large-latest`

Adding Custom Providers

Any OpenAI-compatible endpoint works with the OpenAI provider directly:

let provider = OpenAi::builder()
    .base_url("https://my-custom-api.com/v1")
    .api_key("my-key")
    .model("my-model")
    .build()?;

let output = Agent::builder()
    .provider(provider)
    .tools(cersei::tools::coding())
    .run_with("fix the tests")
    .await?;

For the CLI, any OpenAI-compatible provider's base URL can be overridden with a <PROVIDER>_BASE_URL environment variable. The provider id is upper-cased, so:

Provider	Override variable
`openai`	`OPENAI_BASE_URL` (legacy alias: `OPENAI_API_BASE`)
`deepseek`	`DEEPSEEK_BASE_URL`
`openrouter`	`OPENROUTER_BASE_URL`
…any OpenAI-compatible provider	`<ID>_BASE_URL`

# Route the OpenAI provider at any OpenAI-compatible gateway:
OPENAI_API_KEY=my-key OPENAI_BASE_URL=https://my-custom-api.com/v1 \
  abstract "fix the tests" --provider openai --model my-model

When an override is in effect, Abstract prints a redacted target line before the first request — e.g. provider=openai host=my-custom-api.com model=my-model key_present=true — so a misrouted endpoint is obvious instead of surfacing as an opaque 401. If you intend to talk to DeepSeek, prefer --provider deepseek with DEEPSEEK_API_KEY; only use the openai provider + OPENAI_BASE_URL route for endpoints without a dedicated provider entry.

Providers

Providers

Model String Format

Model Router

Anthropic

OpenAI

Google

Mistral

Groq

DeepSeek

xAI

Together

Fireworks

Perplexity

Cerebras

Ollama

OpenRouter

Auto-Detection

Aliases

Adding Custom Providers

On this page