TRL
  • 🌍GET STARTED
    • TRL
    • Quickstart
    • Installation
    • PPO Training FAQ
    • Use Trained Models
    • Customize the Training
    • Understanding Logs
  • 🌍API
    • Model Classes
    • Trainer Classes
    • Reward Model Training
    • Supervised Fine-Tuning
    • PPO Trainer
    • Best of N Sampling
    • DPO Trainer
    • Denoising Diffusion Policy Optimization
    • Text Environments
  • 🌍EXAMPLES
    • Example Overview
    • Sentiment Tuning
    • Training with PEFT
    • Detoxifying a Language Model
    • Training StackLlama
    • Learning to Use Tools
    • Multi Adapter RLHF
Powered by GitBook
On this page

🌍EXAMPLES

Example OverviewSentiment TuningTraining with PEFTDetoxifying a Language ModelTraining StackLlamaLearning to Use ToolsMulti Adapter RLHF
PreviousText EnvironmentsNextExample Overview