Wing Lian
7d1d22f72f
ORPO Trainer replacement ( #1551 )
...
* WIP use trl ORPOTrainer
* fixes to make orpo work with trl
* fix the chat template laoding
* make sure to handle the special tokens and add_generation for assistant turn too
2024-04-19 17:25:36 -04:00
..
2024-04-19 17:25:36 -04:00
2024-04-19 01:03:04 -04:00
2023-12-12 09:39:22 -08:00
2024-02-01 10:18:42 -05:00
2024-04-19 01:03:04 -04:00
2024-01-18 10:16:07 -05:00
2023-09-13 00:16:40 -04:00
2024-03-29 00:19:36 -04:00
2024-02-26 12:24:14 -05:00
2023-08-12 15:14:56 -04:00
2024-03-14 11:05:42 -04:00
2024-02-28 12:57:45 -05:00
2024-02-01 10:18:42 -05:00
2023-08-12 15:14:56 -04:00
2024-02-06 00:37:03 -05:00
2024-04-19 01:03:04 -04:00
2023-08-12 15:14:56 -04:00
2024-02-12 21:23:28 -08:00
2024-01-31 18:13:13 -05:00
2024-04-05 12:47:32 +09:00