VED
d0d26d5064
feat: Add GDPO Support (#3353)
* gdpo support - test left
* lint
* fixxes for vllm serv
* test advantages
* docss
* lint
* lint =
* gdpo simple + lint
* lint nit
* example
* lint
* trl 0.27.0
* blocklist
* test assert rmv
* add validation check for GDPO + sum_then_normalize
---------
Co-authored-by: Wing Lian <wing@axolotl.ai>
2026-01-21 17:22:45 -05:00
..
2025-07-31 15:25:02 -04:00
2026-01-21 17:22:45 -05:00
2024-08-09 11:50:13 -04:00
2025-12-19 10:43:47 -05:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-10-14 15:54:05 -04:00
2025-06-10 19:53:07 -04:00
2025-10-08 10:43:41 -04:00
2025-08-23 23:37:33 -04:00