Files
axolotl/tests/e2e/test_dpo.py
Wing Lian c996881ec2 add support for rpo_alpha (#1681)
* add support for rpo_alpha

* Add smoke test for dpo + nll loss
2024-06-04 16:09:51 -04:00

11 KiB