core.trainers.grpo.trainer
core.trainers.grpo.trainer
Axolotl GRPO trainers (with and without sequence parallelism handling)
Classes
| Name | Description |
|---|---|
| AxolotlGRPOSequenceParallelTrainer | Extend the base GRPOTrainer for sequence parallelism handling |
| AxolotlGRPOTrainer | Extend the base GRPOTrainer for axolotl helpers |
AxolotlGRPOSequenceParallelTrainer
core.trainers.grpo.trainer.AxolotlGRPOSequenceParallelTrainer(
self,
model,
reward_funcs,
args=None,
train_dataset=None,
eval_dataset=None,
processing_class=None,
reward_processing_classes=None,
callbacks=None,
optimizers=(None, None),
peft_config=None,
)Extend the base GRPOTrainer for sequence parallelism handling
Methods
| Name | Description |
|---|---|
| get_train_dataloader | Get dataloader for training |
get_train_dataloader
core.trainers.grpo.trainer.AxolotlGRPOSequenceParallelTrainer.get_train_dataloader(
)Get dataloader for training
AxolotlGRPOTrainer
core.trainers.grpo.trainer.AxolotlGRPOTrainer()Extend the base GRPOTrainer for axolotl helpers