Triton Inference Server Triton GRPC API High-performance gRPC API for model inference with support for streaming and binary tensor data. Documentation GitHub Documentation 📖 Documentation https://github.com/triton-inference-server/server/blob/main/docs/protocol/README.md Other Resources 🔗 Protocol Buffers https://github.com/triton-inference-server/common/blob/main/protobuf/grpc_service.proto 🔗 Examples https://github.com/triton-inference-server/client/tree/main/src/python/examples