NVIDIA NIM
NVIDIA NIM provides optimised, containerised model inference through an OpenAI-compatible API endpoint. Models are tuned for NVIDIA GPU hardware and served via NVIDIA’s cloud infrastructure.Prerequisites
Create an API key at build.nvidia.com and set it on the gateway host:Configuration
NVIDIA NIM uses a custom base URL that must be set explicitly:apiKey is the name of the environment variable, not the key value itself. Keep secrets in .env, not in committed config files.Selecting a model
| Model | Notes |
|---|---|
meta/llama-3.3-70b-instruct | Meta Llama 3.3 70B — strong general-purpose |
nvidia/llama-3.1-nemotron-70b-instruct | NVIDIA fine-tuned Nemotron variant |
mistralai/mistral-large-2-instruct | Mistral Large 2 on NVIDIA infrastructure |
openclaw models list --provider nvidia for the current catalog.
Troubleshooting
401 Unauthorized
401 Unauthorized
404 Model not found
404 Model not found
Confirm the full model ID including the vendor prefix (e.g.
meta/llama-3.3-70b-instruct). NVIDIA NIM requires the scoped name.Connection error
Connection error
Confirm
baseUrl is set to https://integrate.api.nvidia.com/v1 in your provider config — the provider will not work without it.