Cartesia Sonic Text-to-Speech API
The Sonic text-to-speech API converts text into ultra-low-latency, emotive speech with sub-100ms time-to-first-byte. It supports REST, server-sent events, and WebSocket streaming for real-time voice agents and applications.
Cartesia Sonic Text-to-Speech API is one of 2 APIs that Cartesia publishes on the APIs.io network.
Tagged areas include TTS, Streaming, SSE, WebSocket, and Real-Time. The published artifact set on APIs.io includes API documentation, a getting-started guide, an API reference, SDKs, a GitHub repository, and pricing.
Documentation
Documentation
https://docs.cartesia.ai
GettingStarted
https://docs.cartesia.ai/get-started
APIReference
https://docs.cartesia.ai/api-reference
Authentication
https://docs.cartesia.ai
SDKs
SDK
https://github.com/cartesia-ai/cartesia-python
SDK
https://github.com/cartesia-ai/cartesia-js
SDK
https://github.com/cartesia-ai/cartesia-go
GitHubRepository
https://github.com/cartesia-ai