Triton Metrics API
Prometheus-compatible metrics API for monitoring server and model performance including inference request counts, latencies, GPU utilization, and memory usage.
Documentation
Documentation
https://github.com/triton-inference-server/server/blob/main/docs/user_guide/metrics.md