CoreWeave

CoreWeave Inference API

The CoreWeave Inference API manages Deployments, Gateways, and Capacity Claims for serverless and dedicated AI inference. It is used to create, update, and route to managed model deployments backed by CoreWeave's GPU fleet.

CoreWeave Inference API is one of 5 APIs that CoreWeave publishes on the APIs.io network.

Tagged areas include AI, Deployments, Gateways, Inference, and Models. The published artifact set on APIs.io includes API documentation and an API reference.

Documentation GitHub

Documentation

https://docs.coreweave.com/products/inference/reference/api-overview

Other Resources

https://docs.coreweave.com/products/inference/reference/deploymentservice/create-deployment

https://docs.coreweave.com/products/inference/reference/gatewayservice/create-gateway

https://docs.coreweave.com/products/inference/reference/capacityclaimservice/create-capacity-claim

API entry from apis.yml

aid: coreweave:inference-api
name: CoreWeave Inference API
description: The CoreWeave Inference API manages Deployments, Gateways, and Capacity Claims for serverless
  and dedicated AI inference. It is used to create, update, and route to managed model deployments backed
  by CoreWeave's GPU fleet.
humanURL: https://docs.coreweave.com/products/inference/reference/api-overview
tags:
- AI
- Deployments
- Gateways
- Inference
- Models
properties:
- type: Documentation
  url: https://docs.coreweave.com/products/inference/reference/api-overview
- type: Reference
  url: https://docs.coreweave.com/products/inference/reference/deploymentservice/create-deployment
- type: Reference
  url: https://docs.coreweave.com/products/inference/reference/gatewayservice/create-gateway
- type: Reference
  url: https://docs.coreweave.com/products/inference/reference/capacityclaimservice/create-capacity-claim