Parasail Inference API
OpenAI-compatible real-time and streaming inference API exposing serverless access to popular open-weight LLMs, embedding models, and the model catalog. Endpoints: /v1/chat/completions, /v1/completions, /v1/embeddings, /v1/models. Bearer-token authentication; pay-per-token billing; supports streaming, tool use, and structured outputs. Compatible with the OpenAI Python and TypeScript clients by overriding base_url.
Parasail Inference API is one of 3 APIs that Parasail publishes on the APIs.io network, described by a machine-readable OpenAPI specification.
This API exposes 3 machine-runnable capabilities that can be deployed as REST, MCP, or Agent Skill surfaces via Naftiko and 1 JSON Schema definition.
Tagged areas include AI, Artificial Intelligence, Inference, Chat, and Embeddings. The published artifact set on APIs.io includes API documentation, an OpenAPI specification, a JSON-LD context, 3 Naftiko capability specs, and 1 JSON Schema.