BentoML Service REST API

Auto-generated REST API endpoints produced when BentoML services are deployed. Each decorated service method becomes an HTTP POST endpoint. Supports custom routes, path prefixes, adaptive batching, async task queues, and context-aware request/response handling for model inference workloads.

API entry from apis.yml

apis.yml Raw ↑
aid: bentoml:bentoml-service-api
name: BentoML Service REST API
description: 'Auto-generated REST API endpoints produced when BentoML services are deployed. Each decorated
  service method becomes an HTTP POST endpoint. Supports custom routes, path prefixes, adaptive batching,
  async task queues, and context-aware request/response handling for model inference workloads.

  '
humanURL: https://docs.bentoml.com/en/latest/build-with-bentoml/services.html
baseURL: https://localhost:3000
tags:
- inference
- REST API
- model serving
properties:
- type: Documentation
  url: https://docs.bentoml.com/en/latest/build-with-bentoml/services.html