TRL
Search...
Ctrl + K
🌍
EXAMPLES
Example Overview
Sentiment Tuning
Training with PEFT
Detoxifying a Language Model
Training StackLlama
Learning to Use Tools
Multi Adapter RLHF
Previous
Text Environments
Next
Example Overview