Files
axolotl/docs
Andrew Wu 90090fa9e8 DPO support loss types (#3566)
* Support loss_type/loss_weights DPO

* Validate dpo loss type/weights only set for dpo

* Tests: Update ipo tests to use new path

* Docs: Update docs for new ipo path

* PR fixes - typo/validation

* PR nit - warning

* chore: fix warnings arg

---------

Co-authored-by: NanoCode012 <nano@axolotl.ai>
2026-04-23 00:25:28 -04:00
..
2026-04-23 00:25:28 -04:00
2026-01-27 17:08:24 -05:00
2026-03-16 00:13:18 -04:00
2025-06-18 15:36:53 -04:00
2026-04-21 10:16:03 -04:00
2026-04-21 10:16:03 -04:00
2025-09-17 10:38:15 +01:00
2026-04-23 00:25:28 -04:00
2025-09-02 12:08:44 -04:00