OpenAI Realtime speech-to-speech voice agent using the s2s verb over WebSocket transport.
Connects the caller to OpenAI's Realtime API for a real-time voice conversation with server-side VAD turn detection. Demonstrates:
- The
s2sverb withvendor: 'openai' response_createconfiguration (voice, modalities, instructions)session_updatefor VAD turn detection and audio transcription- API key authentication
- The
actionHookcallback for session completion
npm installConfigured in the jambonz portal and passed via session.data.env_vars:
| Variable | Default | Description |
|---|---|---|
OPENAI_API_KEY |
— | OpenAI API key (required) |
| Variable | Default | Description |
|---|---|---|
PORT |
3000 |
Port the WebSocket server listens on |
LOG_LEVEL |
info |
Pino log level (debug, info, warn, error) |
npm startConfigure your jambonz application to use the WebSocket URL ws://your-server:3000/.