ElevenLabs Sound Effects API

The ElevenLabs Sound Effects API generates cinematic sound effects from text descriptions. Developers can describe the desired sound in natural language and receive high-quality audio output. The API supports audio tags for controlling delivery, emotion, emphasis, pauses, and specific sound effects, making it suitable for game development, film production, and multimedia content creation.

OpenAPI Specification

elevenlabs-sound-effects-openapi.yml Raw ↑
openapi: 3.1.0
info:
  title: ElevenLabs Sound Effects API
  description: >-
    The ElevenLabs Sound Effects API generates cinematic sound effects from
    text descriptions. Developers can describe the desired sound in natural
    language and receive high-quality audio output. The API supports audio
    tags for controlling delivery, emotion, emphasis, pauses, and specific
    sound effects, making it suitable for game development, film production,
    and multimedia content creation.
  version: '1.0'
  contact:
    name: ElevenLabs Support
    url: https://help.elevenlabs.io
  termsOfService: https://elevenlabs.io/terms-of-service
externalDocs:
  description: ElevenLabs Sound Effects API Documentation
  url: https://elevenlabs.io/docs/api-reference/text-to-sound-effects/convert
servers:
  - url: https://api.elevenlabs.io
    description: Production Server
tags:
  - name: Sound Effects
    description: >-
      Endpoints for generating sound effects from text descriptions using
      AI models.
security:
  - apiKeyAuth: []
paths:
  /v1/sound-generation:
    post:
      operationId: createSoundEffect
      summary: Create sound effect
      description: >-
        Generates a sound effect from a text description. The description
        should detail the desired sound in natural language. Returns the
        generated audio in the specified format.
      tags:
        - Sound Effects
      requestBody:
        required: true
        content:
          application/json:
            schema:
              $ref: '#/components/schemas/SoundEffectRequest'
      responses:
        '200':
          description: Sound effect generated successfully
          content:
            audio/mpeg:
              schema:
                type: string
                format: binary
        '400':
          description: Bad request - invalid parameters
        '401':
          description: Unauthorized - invalid or missing API key
        '422':
          description: Unprocessable entity - validation error
components:
  securitySchemes:
    apiKeyAuth:
      type: apiKey
      in: header
      name: xi-api-key
      description: >-
        ElevenLabs API key passed in the xi-api-key header for authentication.
  schemas:
    SoundEffectRequest:
      type: object
      required:
        - text
      properties:
        text:
          type: string
          description: >-
            A natural language description of the sound effect to generate.
            Be specific about the type of sound, its characteristics, and
            any environmental context.
          maxLength: 1000
        duration_seconds:
          type: number
          description: >-
            The desired duration of the generated sound effect in seconds.
            If not provided, the model determines an appropriate length.
          minimum: 0.5
          maximum: 22
        prompt_influence:
          type: number
          description: >-
            Controls how closely the generation follows the text prompt.
            Higher values produce output more faithful to the description,
            while lower values allow more creative interpretation.
          minimum: 0
          maximum: 1
          default: 0.3