CoreWeave Inference API
The CoreWeave Inference API manages Deployments, Gateways, and Capacity Claims for serverless and dedicated AI inference. It is used to create, update, and route to managed model deployments backed by CoreWeave's GPU fleet.
Documentation
Other Resources
Reference
https://docs.coreweave.com/products/inference/reference/deploymentservice/create-deployment
Reference
https://docs.coreweave.com/products/inference/reference/gatewayservice/create-gateway
Reference
https://docs.coreweave.com/products/inference/reference/capacityclaimservice/create-capacity-claim