Files
axolotl/tests/e2e
Ben Redmond 22ae21a6c2 Add KTO support (#1640)
* add kto support

* test cleanup

* fix outdated comment

* fix llama3 ultra

* chore: lint

* update to use rl_beta instead of dpo_beta

---------

Co-authored-by: Wing Lian <wing.lian@gmail.com>
2024-05-20 16:05:16 -04:00
..
2024-04-19 01:03:04 -04:00
2023-11-06 18:33:01 -05:00
2024-05-20 16:05:16 -04:00
2023-11-06 18:33:01 -05:00
2024-04-19 01:03:04 -04:00