Gemini Text-to-Speech API

Native audio generation text-to-speech capabilities through the Gemini API, supporting single and multi-speaker speech synthesis with natural language control over style, accent, pace, and tone.

API entry from apis.yml

apis.yml Raw ↑
name: Gemini Text-to-Speech API
description: Native audio generation text-to-speech capabilities through the Gemini API, supporting single
  and multi-speaker speech synthesis with natural language control over style, accent, pace, and tone.
humanURL: https://ai.google.dev/gemini-api/docs/speech-generation
baseURL: https://generativelanguage.googleapis.com
tags:
- Audio Generation
- Multi-Speaker
- Speech Synthesis
- Text-To-Speech
- Tts
properties:
- type: Documentation
  url: https://ai.google.dev/gemini-api/docs/speech-generation