AI Rate Limiting Advanced Plugin
Token-aware rate limiting tailored for LLM traffic, with per-consumer and per-model budgets rather than just request counts.
AI Rate Limiting Advanced Plugin is one of 21 APIs that Kong AI Gateway publishes on the APIs.io network.
Tagged areas include Plugin, Rate Limiting, and Token Budget. The published artifact set on APIs.io includes API documentation.