Files
axolotl/tests
Wing Lian 7d1d22f72f ORPO Trainer replacement (#1551)
* WIP use trl ORPOTrainer

* fixes to make orpo work with trl

* fix the chat template laoding

* make sure to handle the special tokens and add_generation for assistant turn too
2024-04-19 17:25:36 -04:00
..
2024-04-19 17:25:36 -04:00
2024-04-19 01:03:04 -04:00
2024-02-26 12:24:14 -05:00