LLM Providers

VM-X currently supports seven providers, each implemented as a plug-in class behind the CompletionProvider interface. Pick one (or several — VM-X is built for multi-provider routing) by creating an AI Connection.

Provider	Provider id	Endpoint shape on the wire	Auth
OpenAI	`openai`	OpenAI Chat Completions	API key
Anthropic	`anthropic`	Anthropic Messages (native SDK)	API key
Google Gemini	`gemini`	OpenAI-compatible endpoint	API key
Groq	`groq`	OpenAI-compatible endpoint	API key
Perplexity	`perplexity`	OpenAI-compatible endpoint (search-augmented)	API key
AWS Bedrock (Converse)	`aws-bedrock`	Bedrock Converse	IAM role
AWS Bedrock-Invoke (Anthropic on AWS)	`aws-bedrock-invoke`	Anthropic Messages via Bedrock InvokeModel	IAM role

Endpoints

Every provider can be reached through any of the three VM-X endpoints — the gateway converts request and response shapes when they don't match the provider's native wire format. See API Endpoints for the full per-endpoint contract.

The gateway's per-pair conversion matrix (no internal pivot — every non-passthrough cell has a direct converter that targets the provider's wire format in one hop):

	Chat Completions	Responses	Anthropic Messages
OpenAI	✅ passthrough	✅ passthrough (native `responses.create`)	direct ↔ Responses (single hop)
Anthropic (native SDK)	direct ↔ Anthropic	direct ↔ Anthropic	✅ passthrough
Gemini	✅ passthrough¹	direct ↔ Chat Completions	direct ↔ Chat Completions
Groq	✅ passthrough	direct ↔ Chat Completions	direct ↔ Chat Completions
Perplexity	✅ passthrough	direct ↔ Chat Completions	direct ↔ Chat Completions
AWS Bedrock (Converse)	direct ↔ Converse	direct ↔ Converse	direct ↔ Converse
AWS Bedrock-Invoke	direct ↔ Anthropic + Bedrock-wire	direct ↔ Anthropic + Bedrock-wire	✅ passthrough

¹ Gemini's chat-completion path automatically routes to Google's native @google/genai SDK when the request carries a Gemini-only tool descriptor (googleSearch, googleSearchRetrieval, urlContext, codeExecution, fileSearch) or web_search_options. The native path supports streaming + non-streaming for grounded flows; everything else stays on the OpenAI-compat endpoint. See Gemini for the supported subset.

"Passthrough" means no information loss: provider-only fields (cache_control, thinking, top_k, service_tier, server tools, betas, …) round-trip unchanged on the wire. "Direct ↔ X" means the gateway has a single-hop converter into the provider's wire format — no pivot through a third format. Cross-format details live on the API Endpoints pages.

Capabilities at a glance

Feature	OpenAI	Anthropic	Gemini	Groq	Perplexity	Bedrock	Bedrock-Invoke
Streaming	✅	✅	✅	✅	✅	✅	✅
Function / tool calling	✅	✅	✅	✅¹	—	✅	✅
Vision input	✅	✅	✅	—	—	✅	✅
Prompt caching	—	✅	—	—	—	—	✅
Extended thinking	—	✅	—	—	—	—	✅
Server tools (`web_search`, …)	—	✅	—	—	✅²	—	✅
Reasoning tokens reported	✅	✅	✅	—	—	✅	✅
Per-call retries (`maxRetries`)	✅	✅	✅	✅	✅	✅	✅
Native cost-tracking columns	✅	✅ (cache breakdown)	✅	✅	✅	✅	✅ (cache breakdown)

¹ Best on the larger Llama models (llama-3.3-70b-versatile); the smaller 8b-instant is fast but inconsistent at function calling. ² Built-in for Sonar models — every completion is search-augmented; custom tools aren't exposed.

Adding a connection

Navigate to AI Connections in the UI.
Click Create Connection.
Pick the provider, fill in the credentials form, and (optionally) set capacity / discovered-rate-limit defaults.
Save — the connection becomes available for any AI Resource in the same workspace + environment.

See AI Connections for the full walkthrough including credential encryption, capacity, discovered limits, and the shared IAM-role pattern for AWS providers.

Adding a new provider (contributors)

The provider classes live in packages/api/src/ai-provider/<provider-id>/ — one folder per provider with the same 4-file shape (shared.ts, openai-chat-completion.provider.ts, openai-response.provider.ts, anthropic-messages.provider.ts, plus an index.ts composer). The contributor doc walks through the three "paths" for adding a new one — extending OpenAIProvider, modeling after AnthropicProvider / AWSBedrockInvokeProvider, or full custom — with code skeletons, DI registration, factory wiring, and the live-flow test matrix: contributing-docs/ai-providers.md.

Next steps

OpenAI
Anthropic
Google Gemini
Groq
Perplexity
AWS Bedrock (Converse)
AWS Bedrock-Invoke
AI Connections — credential & capacity setup
AI Resources — pick the provider/model strategy your endpoint resolves to
API Endpoints — Chat Completions / Anthropic Messages / Responses

Endpoints​

Capabilities at a glance​

Adding a connection​

Adding a new provider (contributors)​

Next steps​

Endpoints

Capabilities at a glance

Adding a connection

Adding a new provider (contributors)

Next steps