Files
axolotl/tests/e2e
Andrew Wu a81feabbd9 DPO transformers v0.29 fixes (#3560) [skip ci]
* Deperecate dpo_norm_loss

* Rename chosen/rejected_input_ids to chosen/rejected_ids to match TRL https://github.com/huggingface/trl/pull/5179

* Remove deprecated rpo_alpha

* Remove dead_code tokenize_row

* Add _tokenize override to prevent double bos token on Llama DPO

* Fix DPO loss type now list not string

* Linting fix

* PR fixes

* update _tokenize override for DPO for multimodal
2026-03-31 19:04:53 -04:00
..
2023-11-06 18:33:01 -05:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2026-03-05 13:40:45 -05:00