Available models

See the models you can train with Serverless RL.

Serverless RL currently only supports a single open-source foundation model for training.

To express interest in a particular model, contact support.

Model catalog

Model Model ID (for API usage) Type Context Window Parameters Description
Qwen2.5 14B Qwen/Qwen2.5-14B-Instruct Text 32K 14B (Active-Total) Dense model optimized for throughput and quality