Files
axolotl/docs
Motoki Wu 98c25e15cb Add ORPO example and e2e test (#1572)
* add example for mistral orpo

* sample_packing: false for orpo

* go to load_dataset (since load_rl_datasets require a transfom_fn, which only dpo uses currently)
2024-04-27 12:07:06 -04:00
..
2024-04-01 08:00:52 -07:00
2024-04-27 12:07:06 -04:00