DeepL Voice API
The DeepL Voice API provides real-time speech transcription and translation. A POST to /v3/voice/realtime issues an ephemeral token and WebSocket streaming URL; clients then open a WSS channel to stream source audio chunks and receive incremental source-language transcriptions, translated transcriptions, and (closed beta) synthesized translated audio. There is no documented webhook callback URL on the REST APIs; document translation remains polling-based.
Documentation
Specifications
OpenAPI
https://raw.githubusercontent.com/api-evangelist/deepl/refs/heads/main/openapi/deepl-voice-api-openapi.yml
AsyncAPI
https://raw.githubusercontent.com/api-evangelist/deepl/refs/heads/main/asyncapi/deepl-voice-api-asyncapi.yml
AsyncAPI
https://developers.deepl.com/api-reference/voice.asyncapi.yaml