Skip to main content
A 404 response from the W&B Inference API means the server could not find the model or resource you asked for. The response body often includes a message such as “The requested resource was not found”, which usually indicates the model identifier in your request does not match any model available through the API.

Why this happens

  • Incorrect model ID: The model name or path in your request is misspelled or formatted incorrectly.
  • Model is not available on W&B Inference: Not all models hosted on other providers are available through the W&B Inference API. The model you are requesting may not have been onboarded.
  • Using the wrong base URL: If you are pointing at a different provider’s endpoint but using a W&B API key, the model ID will not resolve correctly.
  • Model was removed: A model that was previously available may have been deprecated or removed.

What you can do

  1. Check the model ID
    • Verify the exact model identifier against Available models.
    • Model IDs are case-sensitive and typically use the format provider/model-name (for example, meta-llama/Llama-3.3-70B-Instruct).
  2. Confirm the base URL
    • The W&B Inference base URL is https://api.inference.wandb.ai/v1. Ensure your client uses this host and not the Multi-tenant Cloud API host (https://api.wandb.ai) or another provider’s endpoint. For usage tracking, some clients also send the optional OpenAI-Project header set to team/project as shown in the list models API reference.
  3. Request access to a new model
    • If the model you need is not currently supported, contact W&B support to request it be added.
For more information, see Serverless Inference and the list models API reference.
Server Errors