Providers

open-multi-agent keeps the agent config shape stable across hosted, cloud, and local providers. Change provider, model, and the relevant credential; the rest of your team definition stays the same.

const agent = {
  name: 'my-agent',
  provider: 'anthropic',
  model: 'claude-sonnet-4-6',
  systemPrompt: 'You are a helpful assistant.',
}

Built-In Provider Shortcuts

The framework ships a wired-in provider name for each of these. Set provider and the env var, and the adapter handles the endpoint.

Under the hood, Anthropic, Gemini, and Bedrock use provider-specific APIs. The other built-in shortcuts are pre-configured wrappers around OpenAI-compatible endpoints; same wire format as the OpenAI-compatible table below, with the baseURL already supplied.

Provider	Config	Env var	Example model	Notes
Anthropic (Claude)	`provider: 'anthropic'`	`ANTHROPIC_API_KEY`	`claude-sonnet-4-6`	Native Anthropic SDK.
Gemini	`provider: 'gemini'`	`GEMINI_API_KEY`	`gemini-2.5-pro`	Native Google GenAI SDK. Requires `npm install @google/genai`.
OpenAI (GPT)	`provider: 'openai'`	`OPENAI_API_KEY`	`gpt-4o`
Azure OpenAI	`provider: 'azure-openai'`	`AZURE_OPENAI_API_KEY`, `AZURE_OPENAI_ENDPOINT`	`gpt-4`	Optional `AZURE_OPENAI_API_VERSION`, `AZURE_OPENAI_DEPLOYMENT`.
GitHub Copilot	`provider: 'copilot'`	`GITHUB_COPILOT_TOKEN` (falls back to `GITHUB_TOKEN`)	`gpt-4o`	Custom token-exchange flow on top of OpenAI protocol.
Grok (xAI)	`provider: 'grok'`	`XAI_API_KEY`	`grok-4`	OpenAI-compatible; endpoint is `api.x.ai/v1`.
DeepSeek	`provider: 'deepseek'`	`DEEPSEEK_API_KEY`	`deepseek-v4-flash`	OpenAI-compatible. `deepseek-v4-flash` (default) or `deepseek-v4-pro` (flagship for coding); both support 1M context and 384K max output. Legacy `deepseek-chat` / `deepseek-reasoner` retire 2026-07-24.
Doubao (Volcengine)	`provider: 'doubao'`	`ARK_API_KEY`	`doubao-seed-1-8-251228`	OpenAI-compatible. ByteDance Volcengine Ark endpoint `https://ark.cn-beijing.volces.com/api/v3`. See `providers/doubao`.
Hunyuan (Tencent MaaS / TokenHub)	`provider: 'hunyuan'`	`HUNYUAN_API_KEY`	`hy3-preview`	OpenAI-compatible. Default endpoint `https://tokenhub.tencentmaas.com/v1` (Tencent’s current platform; `sk-...` keys, Hunyuan 3 models). Tool calling verified on `hy3-preview`. See `providers/hunyuan`.
Hunyuan (legacy Tencent Cloud)	`provider: 'hunyuan'` + `HUNYUAN_BASE_URL`	`HUNYUAN_API_KEY`	`hunyuan-turbos-latest`	Legacy endpoint `https://api.hunyuan.cloud.tencent.com/v1` (console.cloud.tencent.com/hunyuan key; separate key namespace). Tencent has announced this platform is being retired (sales stop 2026-06-30, full shutdown 2026-09-30). Set `HUNYUAN_BASE_URL=https://api.hunyuan.cloud.tencent.com/v1` to target it until then. Tool calling verified on `hunyuan-turbos` and `hunyuan-functioncall`.
MiniMax (global)	`provider: 'minimax'`	`MINIMAX_API_KEY`	`MiniMax-M3`	OpenAI-compatible.
MiniMax (China)	`provider: 'minimax'` + `MINIMAX_BASE_URL`	`MINIMAX_API_KEY`	`MiniMax-M3`	Set `MINIMAX_BASE_URL=https://api.minimaxi.com/v1`.
MiMo	`provider: 'mimo'`	`MIMO_API_KEY` (+ optional `MIMO_BASE_URL`)	`mimo-v2.5-pro`	OpenAI-compatible. Defaults to pay-as-you-go endpoint `https://api.xiaomimimo.com/v1`; Token Plan keys (`tp-...`) require the cluster base URL from your subscription page, such as `https://token-plan-cn.xiaomimimo.com/v1`. Supports reasoning/tool-call loops through the built-in MiMo adapter. See `providers/mimo`.
Qiniu	`provider: 'qiniu'`	`QINIU_API_KEY`	`deepseek-v3`	OpenAI-compatible. Endpoint `https://api.qnaigc.com/v1`; multiple model families, see Qiniu AI docs.
AWS Bedrock	`provider: 'bedrock'`	none (AWS SDK credential chain)	`anthropic.claude-3-5-haiku-20241022-v1:0`	No API key. Set `AWS_REGION` or pass `region` as the 4th arg to `createAdapter`. Credentials come from env vars, shared config, or IAM role. Newer Claude models can require a cross-region inference profile prefix such as `us.`. Also supports Llama, Mistral, and Cohere. See `providers/bedrock`. Requires `npm install @aws-sdk/client-bedrock-runtime`.

OpenAI-Compatible Providers

No bundled shortcut is needed when a server speaks OpenAI Chat Completions. Use provider: 'openai' and point baseURL at the service.

Service	Config	Env var	Example model	Notes
Ollama (local)	`provider: 'openai'` + `baseURL: 'http://localhost:11434/v1'`	none	`llama3.1`
vLLM (local)	`provider: 'openai'` + `baseURL`	none	server-loaded
LM Studio (local)	`provider: 'openai'` + `baseURL`	none	server-loaded
llama.cpp server (local)	`provider: 'openai'` + `baseURL`	none	server-loaded
OpenRouter	`provider: 'openai'` + `baseURL: 'https://openrouter.ai/api/v1'` + `apiKey`	`OPENROUTER_API_KEY`	`openai/gpt-4o-mini`
Groq	`provider: 'openai'` + `baseURL: 'https://api.groq.com/openai/v1'`	`GROQ_API_KEY`	`llama-3.3-70b-versatile`
Mistral	`provider: 'openai'` + `baseURL: 'https://api.mistral.ai/v1'`	`MISTRAL_API_KEY`	`mistral-large-latest`	See `providers/mistral`.
MiMo	`provider: 'openai'` + `baseURL: 'https://api.xiaomimimo.com/v1'`	`MIMO_API_KEY`	`mimo-v2.5-pro`	Prefer the built-in `mimo` provider when using tool-calling agent loops. Token Plan users should set their `token-plan-*.xiaomimimo.com/v1` base URL.
Zhipu GLM	`provider: 'openai'` + `baseURL: 'https://open.bigmodel.cn/api/paas/v4'`	`ZHIPU_API_KEY`	`glm-4-plus`	See `providers/zhipu`.
Qwen (DashScope)	`provider: 'openai'` + `baseURL: 'https://dashscope.aliyuncs.com/compatible-mode/v1'`	`DASHSCOPE_API_KEY`	`qwen-plus`	See `providers/qwen`.
Moonshot AI (Kimi)	`provider: 'openai'` + `baseURL: 'https://api.moonshot.ai/v1'`	`MOONSHOT_API_KEY`	`kimi-k2.5`	See `providers/moonshot`.
LiteLLM (proxy)	`provider: 'openai'` + `baseURL: 'http://localhost:4000/v1'` + `apiKey`	`LITELLM_API_KEY` (if proxy auth enabled)	any model on your proxy	LiteLLM unifies 100+ providers (OpenAI, Anthropic, Azure, Bedrock, Vertex, etc.) behind one OpenAI-compatible endpoint. Run `litellm --config config.yaml` and point `baseURL` at the proxy.

Other services can be connected the same way if they implement the OpenAI Chat Completions API, but they are not listed as verified providers here. For services where the key is not OPENAI_API_KEY, pass it explicitly via apiKey; otherwise the openai adapter falls back to OPENAI_API_KEY.

Local Model Tool-Calling

The framework supports tool-calling with local models served by Ollama, vLLM, LM Studio, or llama.cpp. Tool-calling is handled natively through the OpenAI-compatible API.

Verified local models include Gemma 4, Llama 3.1, Qwen 3, Mistral, and Phi-4. Ollama publishes its tool-capable models at ollama.com/search?c=tools.

If a local model returns tool calls as text instead of the tool_calls wire format, the framework automatically extracts them from the text output. This helps with thinking models or misconfigured local servers.

Use timeoutMs on AgentConfig for slow local inference:

const localAgent = {
  name: 'local',
  model: 'llama3.1',
  provider: 'openai',
  baseURL: 'http://localhost:11434/v1',
  apiKey: 'ollama',
  tools: ['bash', 'file_read'],
  timeoutMs: 120_000,
}

Highly quantized MoE models on consumer hardware can fall into repetition loops or hallucinate tool-call schemas under default sampling. AgentConfig exposes topK, minP, frequencyPenalty, presencePenalty, parallelToolCalls, and extraBody for server-specific knobs such as vLLM’s repetition_penalty. See providers/local-quantized for a complete setup.

Troubleshooting

Model not calling tools? Confirm it appears in Ollama’s Tools category.
Using Ollama? Update to the latest version with ollama update.
Proxy interfering with local servers? Use no_proxy=localhost.