NanoCode012
5c7e89105d
Fix: modelloader handling of model_kwargs load_in*bit (#1999)
* fix: load_in_*bit not properly read
* fix: load_*bit check
* fix: typo
* refactor: load * bit handling
* feat: add test dpo lora multi-gpu
* fix: turn off sample packing for dpo
* fix: missing warmup_steps
* fix: test to load in 8bit for lora
* skip 8bit lora on h100, add 4bit lora on h100 to multi gpu tests
* chore: reduce max_steps
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com>
2024-10-30 14:41:34 -04:00
..
2024-09-01 19:29:37 -04:00
2024-10-30 14:41:34 -04:00
2024-10-30 14:41:34 -04:00
2023-11-06 18:33:01 -05:00
2023-09-15 15:46:54 -04:00
2024-07-10 11:15:44 -04:00
2024-01-22 21:01:42 -05:00
2024-07-27 10:24:11 -04:00
2024-07-17 10:58:38 -04:00
2024-10-25 09:06:56 -04:00
2024-07-14 19:12:57 -04:00
2024-01-09 21:23:23 -05:00
2023-11-06 18:33:01 -05:00
2024-04-19 01:03:04 -04:00
2024-07-14 19:12:57 -04:00
2024-10-25 11:28:23 -04:00
2024-05-29 15:41:46 -04:00
2024-02-06 00:35:30 -05:00
2024-10-13 15:11:13 -04:00
2024-10-30 14:41:34 -04:00