W&B Weave

Try in Colab

Weave is a lightweight toolkit for tracking and evaluating LLM applications. Use W&B Weave to visualize and inspect the execution flow of your LLMs, analyze the inputs and outputs of your LLMs, view the intermediate results and securely store and manage your prompts and LLM chain configurations.

With W&B Weave, you can:

Log and debug language model inputs, outputs, and traces
Build rigorous, apples-to-apples evaluations for language model use cases
Organize all the information generated across the LLM workflow, from experimentation to evaluations to production

info

Looking for Weave docs? See https://wandb.github.io/weave/.

How to get started

Depending on your use case, explore the following resources to get started with W&B Weave:

Quickstart: Track inputs and outputs of LLM calls
Build an Evaluation pipeline tutorial
Model-Based Evaluation of RAG applications tutorial

How to get started​

How to get started