Commit Graph

5 Commits

Author SHA1 Message Date
Dan Saunders
c9e37496cb Fix 2025-01-13 17:19:06 +00:00
Dan Saunders
6e72baf287 continued cleanup and documentation 2025-01-13 17:19:02 +00:00
Dan Saunders
6f80d1d670 fix 2025-01-13 17:05:38 +00:00
Wing Lian
dd26cc3c0f add helper to verify the correct model output file exists (#2245)
* add helper to verify the correct model output file exists

* more checks using helper

* chore: lint

* fix import and relora model check

* workaround for trl trainer saves

* remove stray print
2025-01-13 10:43:29 -05:00
Wing Lian
68b1369de9 Reward model (#1879) 2024-10-13 15:11:13 -04:00