AI Rate Limiting Advanced Plugin

Token-aware rate limiting tailored for LLM traffic, with per-consumer and per-model budgets rather than just request counts.

AI Rate Limiting Advanced Plugin is one of 21 APIs that Kong AI Gateway publishes on the APIs.io network.

Tagged areas include Plugin, Rate Limiting, and Token Budget. The published artifact set on APIs.io includes API documentation.

API entry from apis.yml

apis.yml Raw ↑
aid: kong-ai-gateway:ai-rate-limiting-advanced-plugin
name: AI Rate Limiting Advanced Plugin
description: Token-aware rate limiting tailored for LLM traffic, with per-consumer and per-model budgets
  rather than just request counts.
humanURL: https://developer.konghq.com/plugins/ai-rate-limiting-advanced/
tags:
- Plugin
- Rate Limiting
- Token Budget
properties:
- type: Documentation
  url: https://developer.konghq.com/plugins/ai-rate-limiting-advanced/