TRL
Ctrlk
  • 🌍GET STARTED
  • 🌍API
    • Model Classes
    • Trainer Classes
    • Reward Model Training
    • Supervised Fine-Tuning
    • PPO Trainer
    • Best of N Sampling
    • DPO Trainer
    • Denoising Diffusion Policy Optimization
    • Text Environments
  • 🌍EXAMPLES
Powered by GitBook
On this page

🌍API

Model ClassesTrainer ClassesReward Model TrainingSupervised Fine-TuningPPO TrainerBest of N SamplingDPO TrainerDenoising Diffusion Policy OptimizationText Environments
PreviousUnderstanding LogsNextModel Classes