Groq
Groq delivers ultra-low-latency inference via its LPU (Language Processing Unit) hardware. It is OpenAI API-compatible, making integration straightforward.Prerequisites
Create an API key at console.groq.com and set it on the gateway host:Configuration
apiKey is the name of the environment variable, not the key value itself. Keep secrets in .env, not in committed config files.Selecting a model
| Model | Notes |
|---|---|
llama-3.3-70b-versatile | Best general-purpose Llama model on Groq |
llama-3.1-8b-instant | Fastest — lowest latency |
mixtral-8x7b-32768 | Large context window (32k tokens) |
gemma2-9b-it | Google Gemma 2 9B instruction-tuned |
openclaw models list --provider groq for the full current catalog.
Troubleshooting
401 Unauthorized
401 Unauthorized
429 Rate limited
429 Rate limited
Groq free-tier limits are generous but finite. Check your usage at the Groq console or add a paid plan.
Model not available
Model not available
Groq’s model catalog changes frequently. Run
openclaw models list --provider groq for the current list.