Kong AI Gateway

AI Semantic Cache Plugin

Caches LLM responses by prompt similarity so semantically equivalent requests can be served from cache, reducing latency and provider spend.

AI Semantic Cache Plugin is one of 21 APIs that Kong AI Gateway publishes on the APIs.io network.

Tagged areas include Plugin, Caching, and Semantic. The published artifact set on APIs.io includes API documentation.

Documentation GitHub

Documentation

https://developer.konghq.com/plugins/ai-semantic-cache/

API entry from apis.yml

aid: kong-ai-gateway:ai-semantic-cache-plugin
name: AI Semantic Cache Plugin
description: Caches LLM responses by prompt similarity so semantically equivalent requests can be served
  from cache, reducing latency and provider spend.
humanURL: https://developer.konghq.com/plugins/ai-semantic-cache/
tags:
- Plugin
- Caching
- Semantic
properties:
- type: Documentation
  url: https://developer.konghq.com/plugins/ai-semantic-cache/