Prerequisites
- A running vLLM server with its OpenAI-compatible API enabled
- The server must be reachable from the OpenClaw gateway host
Activation
vLLM auto-activates whenVLLM_API_KEY is set in the environment. The default base URL is http://127.0.0.1:8000/v1.
Configuration
| Key | Type | Description |
|---|---|---|
baseUrl | string | Base URL of the vLLM OpenAI-compatible endpoint |
api | string | API style — openai-completions for vLLM |
apiKey | string | Name of the env var holding the API key |
models[].id | string | Model ID as loaded by your vLLM server |
models[].contextWindow | integer | Context window size in tokens |
models[].maxTokens | integer | Max output tokens |
models[].input / cost | number | Cost per token (use 0 for self-hosted) |
Running vLLM
Verify the connection
vLLM does not expose a model list endpoint by default. The
--probe flag sends a test completion to verify connectivity.