AI Semantic Cache Plugin
Caches LLM responses by prompt similarity so semantically equivalent requests can be served from cache, reducing latency and provider spend.
AI Semantic Cache Plugin is one of 21 APIs that Kong AI Gateway publishes on the APIs.io network.
Tagged areas include Plugin, Caching, and Semantic. The published artifact set on APIs.io includes API documentation.