NVIDIA DGX Cloud Lepton API

Managed AI platform API exposing model endpoints (HTTP server with OpenAI-compatible chat surface), dev sessions with managed GPUs, distributed training jobs, and batch processing. Endpoints are deployed at unique URLs per workload via the Lepton dashboard at dashboard.dgxc-lepton.nvidia.com.

API entry from apis.yml

apis.yml Raw ↑
aid: lepton-ai:dgx-cloud-lepton
name: NVIDIA DGX Cloud Lepton API
description: Managed AI platform API exposing model endpoints (HTTP server with OpenAI-compatible chat
  surface), dev sessions with managed GPUs, distributed training jobs, and batch processing. Endpoints
  are deployed at unique URLs per workload via the Lepton dashboard at dashboard.dgxc-lepton.nvidia.com.
image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg
humanURL: https://docs.nvidia.com/dgx-cloud/lepton/get-started/
baseURL: https://dashboard.dgxc-lepton.nvidia.com
tags:
- AI
- ML
- Endpoints
- Training
- Batch
- GPU
- OpenAI Compatible
properties:
- type: Documentation
  url: https://docs.nvidia.com/dgx-cloud/lepton/get-started/
- type: ProductPage
  url: https://www.nvidia.com/en-us/data-center/dgx-cloud-lepton/
- type: Examples
  url: https://docs.nvidia.com/dgx-cloud/lepton/examples/
- type: AcquisitionNote
  url: NVIDIA acquired Lepton AI; lepton.ai now redirects to NVIDIA DGX Cloud Lepton.