NanoCode012
5c7e89105d
Fix: modelloader handling of model_kwargs load_in*bit (#1999)
* fix: load_in_*bit not properly read
* fix: load_*bit check
* fix: typo
* refactor: load * bit handling
* feat: add test dpo lora multi-gpu
* fix: turn off sample packing for dpo
* fix: missing warmup_steps
* fix: test to load in 8bit for lora
* skip 8bit lora on h100, add 4bit lora on h100 to multi gpu tests
* chore: reduce max_steps
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com>
2024-10-30 14:41:34 -04:00
..
2024-10-13 12:15:18 -04:00
2024-10-30 14:41:34 -04:00
2023-12-12 09:39:22 -08:00
2024-02-01 10:18:42 -05:00
2024-10-29 10:14:51 +07:00
2024-10-25 09:06:56 -04:00
2024-08-21 13:36:51 -04:00
2024-10-30 12:27:04 -04:00
2024-02-26 12:24:14 -05:00
2023-08-12 15:14:56 -04:00
2024-03-14 11:05:42 -04:00
2024-02-28 12:57:45 -05:00
2024-05-23 17:32:14 -04:00
2023-08-12 15:14:56 -04:00
2024-07-17 10:58:38 -04:00
2024-06-04 16:11:56 -04:00
2024-04-19 01:03:04 -04:00
2024-08-22 11:46:57 -04:00
2024-02-12 21:23:28 -08:00
2024-01-31 18:13:13 -05:00
2024-10-29 10:14:51 +07:00
2024-10-22 08:52:21 -04:00