W&B Inference

W&B Inference gives you access to leading open-source foundation models through W&B Weave and an OpenAI-compatible API. You can:

Build AI applications and agents without signing up for a hosting provider or self-hosting a model
Try supported models in the W&B Weave Playground

With Weave, you can trace, evaluate, monitor, and improve your W&B Inference-powered applications.

Quickstart

Here’s a simple example using Python:

import openai

client = openai.OpenAI(
    # The custom base URL points to W&B Inference
    base_url='https://api.inference.wandb.ai/v1',

    # Get your API key from https://wandb.ai/authorize
    api_key="<your-api-key>",

    # Optional: Team and project for usage tracking
    project="<your-team>/<your-project>",
)

response = client.chat.completions.create(
    model="meta-llama/Llama-3.1-8B-Instruct",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Tell me a joke."}
    ],
)

print(response.choices[0].message.content)

Next steps

Review the available models and usage information and limits
Set up your account using the prerequisites
Use the service through the API or UI
Try the usage examples

Usage details

For information about pricing, usage limits, and credits, see Usage Information and Limits.

Response Settings

API Reference

Quickstart

Next steps

Usage details

Response Settings

API Reference

​Quickstart

​Next steps

​Usage details

Quickstart

Next steps

Usage details