Weights and Biases Weave

W&B Weave is a platform for evaluating, monitoring, and iterating on AI agents and applications, started with "one line of code." Weave Evaluations enable visual comparison of runs, automatic versioning of datasets and scorers, an interactive playground, and leaderboards. Scorers include pre-built ones (toxicity, hallucination), custom Python scoring functions, human feedback collection, and third-party scorers from providers such as RAGAS and LangChain.

API entry from apis.yml

apis.yml Raw ↑
name: Weights and Biases Weave
description: W&B Weave is a platform for evaluating, monitoring, and iterating on AI agents and applications,
  started with "one line of code." Weave Evaluations enable visual comparison of runs, automatic versioning
  of datasets and scorers, an interactive playground, and leaderboards. Scorers include pre-built ones
  (toxicity, hallucination), custom Python scoring functions, human feedback collection, and third-party
  scorers from providers such as RAGAS and LangChain.
humanURL: https://wandb.ai/site/weave
baseURL: https://api.wandb.ai
tags:
- Weights and Biases
- Commercial
- Scorers
- Leaderboards
- Human Feedback
properties:
- type: Documentation
  url: https://weave-docs.wandb.ai/
- type: GitHubRepository
  url: https://github.com/wandb/weave
- type: Portal
  url: https://wandb.ai