KServe Inference API

KServe (formerly KFServing) provides a serverless model inference API on Kubernetes, supporting standardized prediction protocols, autoscaling, and multi-framework model serving.

OpenAPI Specification

rest_predict_v2.yaml Raw ↑