Sentiment Tuning
Sentiment Tuning Examples
File
Description
Usage
# 1. run directly
python examples/scripts/ppo.py
# 2. run via `accelerate` (recommended), enabling more features (e.g., multiple GPUs, deepspeed)
accelerate config # will prompt you to define the training configuration
accelerate launch examples/scripts/ppo.py # launches training
# 3. get help text and documentation
python examples/scripts/ppo.py --help
# 4. configure logging with wandb and, say, mini_batch_size=1 and gradient_accumulation_steps=16
python examples/scripts/ppo.py --ppo_config.log_with wandb --ppo_config.mini_batch_size 1 --ppo_config.gradient_accumulation_steps 16Few notes on multi-GPU
Benchmarks

With and without gradient accumulation

Comparing different models (gpt2, gpt2-xl, falcon, llama2)

With and without PEFT

Last updated