OctoAI Text Gen Inference API

OpenAI-compatible chat and text-completion endpoints serving open-source LLMs including Llama 2, Llama 3, Mixtral 8x7B, Mistral 7B, Code Llama, and customer fine-tunes. Supported streaming, function calling, JSON mode, and a shared model catalog. The API was reachable at https://text.octoai.run/v1 and shut down on 31 October 2024.

OctoAI Text Gen Inference API is one of 5 APIs that OctoAI publishes on the APIs.io network.

Tagged areas include LLM, Chat, Completions, OpenAI Compatible, and Defunct.

API entry from apis.yml

apis.yml Raw ↑
aid: octoai:octoai-text-gen-api
name: OctoAI Text Gen Inference API
description: OpenAI-compatible chat and text-completion endpoints serving open-source LLMs including Llama
  2, Llama 3, Mixtral 8x7B, Mistral 7B, Code Llama, and customer fine-tunes. Supported streaming, function
  calling, JSON mode, and a shared model catalog. The API was reachable at https://text.octoai.run/v1
  and shut down on 31 October 2024.
humanURL: https://octo.ai
baseURL: https://text.octoai.run/v1
tags:
- LLM
- Chat
- Completions
- OpenAI Compatible
- Defunct
properties:
- type: StatusPage
  url: https://octo.ai
  description: Domain now 301-redirects to nvidia.com; service terminated 31 October 2024.