Prerequisites
Set up your environment to use W&B Training.
less than a minute
Now in public preview, W&B Training offers serverless reinforcement learning (RL) for post-training large language models (LLMs) to improve their reliability performing multi-turn, agentic tasks while also increasing speed and reducing costs. RL is a training technique where models learn to improve their behavior through feedback on their outputs.
W&B Training includes integration with:
To get started, satisfy the prerequisites to start using the service and then see OpenPipe’s Serverless RL quickstart to learn how to post-train your models.
Set up your environment to use W&B Training.
Learn about how to more efficiently post-train your models using reinforcement learning.
Complete API documentation for W&B Training.
Was this page helpful?
Glad to hear it! If you have more to say, please let us know.
Sorry to hear that. Please tell us how we can improve.