Gemini Context Caching API

Cache input tokens for repeated use across multiple requests to reduce costs and improve latency for large context workloads.

API entry from apis.yml

apis.yml Raw ↑
name: Gemini Context Caching API
description: Cache input tokens for repeated use across multiple requests to reduce costs and improve
  latency for large context workloads.
humanURL: https://ai.google.dev/gemini-api/docs/caching
baseURL: https://generativelanguage.googleapis.com
tags:
- Caching
- Cost Optimization
- Performance
properties:
- type: Documentation
  url: https://ai.google.dev/gemini-api/docs/caching
- type: APIReference
  url: https://ai.google.dev/api/caching