core.builders.rl
core.builders.rl
Builder for RLHF trainers
Classes
| Name | Description |
|---|---|
| HFPPOTrainerBuilder | HF Factory class for PPO Trainer |
| HFRLTrainerBuilder | Trainer factory class for TRL-based RLHF trainers (e.g. DPO) |
HFPPOTrainerBuilder
core.builders.rl.HFPPOTrainerBuilder(cfg, model, tokenizer, processor=None)HF Factory class for PPO Trainer
HFRLTrainerBuilder
core.builders.rl.HFRLTrainerBuilder(cfg, model, tokenizer, processor=None)Trainer factory class for TRL-based RLHF trainers (e.g. DPO)