AI Semantic Cache Plugin

Caches LLM responses by prompt similarity so semantically equivalent requests can be served from cache, reducing latency and provider spend.

AI Semantic Cache Plugin is one of 21 APIs that Kong AI Gateway publishes on the APIs.io network.

Tagged areas include Plugin, Caching, and Semantic. The published artifact set on APIs.io includes API documentation.

API entry from apis.yml

apis.yml Raw ↑
aid: kong-ai-gateway:ai-semantic-cache-plugin
name: AI Semantic Cache Plugin
description: Caches LLM responses by prompt similarity so semantically equivalent requests can be served
  from cache, reducing latency and provider spend.
humanURL: https://developer.konghq.com/plugins/ai-semantic-cache/
tags:
- Plugin
- Caching
- Semantic
properties:
- type: Documentation
  url: https://developer.konghq.com/plugins/ai-semantic-cache/