Realtime voice - WednesdayAI

Realtime voice enables spoken conversations with the AI. Audio is captured, transcribed, processed, and synthesized back to speech in real time.

Requirements

A realtime voice provider plugin (e.g. @wednesdayai/voice-openai)
A channel with voice support (currently: Discord voice channels)

Quick start

Install a voice provider plugin

openclaw plugins install @wednesdayai/voice-openai

Configure the provider

plugins:
  voice-openai:
    apiKey: ""   # or OPENAI_API_KEY
    model: "gpt-4o-realtime-preview"
    voice: "alloy"

Enable voice on a channel

channels:
  discord:
    voice:
      enabled: true
      provider: "voice-openai"

Restart the gateway

openclaw restart

Configuration reference

channels:
  discord:
    voice:
      enabled: false
      provider: ""
      vad:
        sensitivity: 0.5    # 0–1; higher = less noise-sensitive
        silenceMs: 800      # ms of silence to end a turn
      maxSessionMinutes: 60
      requirePermission: false

Key	Type	Default	Description
`enabled`	boolean	`false`	Enable voice for this channel
`provider`	string	—	Voice provider plugin name
`vad.sensitivity`	number	`0.5`	Voice activity detection sensitivity (0–1)
`vad.silenceMs`	integer	`800`	Silence duration to end a turn
`maxSessionMinutes`	integer	`60`	Auto-end sessions after this duration (0 = no limit)
`requirePermission`	boolean	`false`	Restrict to users with the `voice` permission

Troubleshooting

Run openclaw doctor --check voice to diagnose configuration issues.

Heartbeat Workspace lanes

​Requirements

​Quick start

​Configuration reference

​Troubleshooting

Requirements

Quick start

Configuration reference

Troubleshooting