Patronus AI

Patronus AI is a frontier lab building evaluation infrastructure and Digital World Models for human-aligned AGI. Its evaluator models include Lynx (a hallucination-detection model reported to outperform GPT-4 on hallucination tasks) and GLIDER (an evaluation model producing reasoning chains with explainable judgments). Coverage spans research science, software development, customer service, product applications, finance, and multi-turn dialogue / long-horizon task planning.

API entry from apis.yml

apis.yml Raw ↑
name: Patronus AI
description: Patronus AI is a frontier lab building evaluation infrastructure and Digital World Models
  for human-aligned AGI. Its evaluator models include Lynx (a hallucination-detection model reported to
  outperform GPT-4 on hallucination tasks) and GLIDER (an evaluation model producing reasoning chains
  with explainable judgments). Coverage spans research science, software development, customer service,
  product applications, finance, and multi-turn dialogue / long-horizon task planning.
humanURL: https://www.patronus.ai/
baseURL: https://api.patronus.ai
tags:
- Commercial
- Hallucination Detection
- Judge Models
- Lynx
- GLIDER
properties:
- type: Documentation
  url: https://docs.patronus.ai/
- type: Portal
  url: https://app.patronus.ai