Cloudflare Workers AI API
The Cloudflare Workers AI API enables developers to run machine learning models on Cloudflare's global network via a REST API. The catalog offers 78+ open-source models spanning text generation (Llama 3.1, Mistral, Qwen3, Kimi K2.6), embeddings (BGE, EmbeddingGemma), text-to-image (Flux 2, Stable Diffusion XL), automatic speech recognition (Whisper, Deepgram Nova 3, Flux), text-to-speech (Aura 2, MeloTTS), image-to-text (LLaVA), translation (Indic Trans2, M2M100), classification, object detection (DETR), and voice activity detection. OpenAI-compatible endpoints are available.
Documentation
Documentation
https://developers.cloudflare.com/workers-ai/
GettingStarted
https://developers.cloudflare.com/workers-ai/get-started/rest-api/
APIReference
https://developers.cloudflare.com/workers-ai/configuration/open-ai-compatibility/
APIReference
https://developers.cloudflare.com/workers-ai/models/