OctoAI Text Gen Inference API
OpenAI-compatible chat and text-completion endpoints serving open-source LLMs including Llama 2, Llama 3, Mixtral 8x7B, Mistral 7B, Code Llama, and customer fine-tunes. Supported streaming, function calling, JSON mode, and a shared model catalog. The API was reachable at https://text.octoai.run/v1 and shut down on 31 October 2024.