API error code 404 - Model not found - Weights & Biases Documentation

A 404 response from the W&B Inference API means the server could not find the model or resource you asked for. The response body often includes a message such as “The requested resource was not found”, which usually indicates the model identifier in your request does not match any model available through the API.

Why this happens

Incorrect model ID: The model name or path in your request is misspelled or formatted incorrectly.
Model is not available on W&B Inference: Not all models hosted on other providers are available through the W&B Inference API. The model you are requesting may not have been onboarded.
Using the wrong base URL: If you are pointing at a different provider’s endpoint but using a W&B API key, the model ID will not resolve correctly.
Model was removed: A model that was previously available may have been deprecated or removed.

What you can do

Check the model ID
- Verify the exact model identifier against Available models.
- Model IDs are case-sensitive and typically use the format provider/model-name (for example, meta-llama/Llama-3.3-70B-Instruct).
Confirm the base URL
- The W&B Inference base URL is https://api.inference.wandb.ai/v1. Ensure your client uses this host and not the Multi-tenant Cloud API host (https://api.wandb.ai) or another provider’s endpoint. For usage tracking, some clients also send the optional OpenAI-Project header set to team/project as shown in the list models API reference.
Request access to a new model
- If the model you need is not currently supported, contact W&B support to request it be added.

For more information, see Serverless Inference and the list models API reference.

Server Errors

​Why this happens

​What you can do

Why this happens

What you can do