Azure OpenAI Inference REST API

Data-plane REST API for running inference against deployed Azure OpenAI models, including chat completions, completions, embeddings, image generation, and audio transcription/translation. Authenticate with API key (api-key header) or Microsoft Entra ID bearer token.

API entry from apis.yml

apis.yml Raw ↑
aid: azure-openai:inference-api
name: Azure OpenAI Inference REST API
description: Data-plane REST API for running inference against deployed Azure OpenAI models, including
  chat completions, completions, embeddings, image generation, and audio transcription/translation. Authenticate
  with API key (api-key header) or Microsoft Entra ID bearer token.
humanURL: https://learn.microsoft.com/en-us/azure/ai-services/openai/reference
baseURL: https://{your-resource-name}.openai.azure.com/openai
tags:
- Inference
- Chat Completions
- Embeddings
- Image Generation
- Audio
properties:
- type: Documentation
  url: https://learn.microsoft.com/en-us/azure/ai-services/openai/reference
- type: API Reference (Latest)
  url: https://learn.microsoft.com/en-us/azure/foundry/openai/latest
- type: API Reference (Preview)
  url: https://learn.microsoft.com/en-us/azure/foundry/openai/reference-preview-latest
- type: OpenAPI Source
  url: https://github.com/Azure/azure-rest-api-specs/tree/main/specification/cognitiveservices/data-plane/AzureOpenAI