NanoCode012
5c7e89105d
Fix: modelloader handling of model_kwargs load_in*bit (#1999)
* fix: load_in_*bit not properly read
* fix: load_*bit check
* fix: typo
* refactor: load * bit handling
* feat: add test dpo lora multi-gpu
* fix: turn off sample packing for dpo
* fix: missing warmup_steps
* fix: test to load in 8bit for lora
* skip 8bit lora on h100, add 4bit lora on h100 to multi gpu tests
* chore: reduce max_steps
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com>
2024-10-30 14:41:34 -04:00
..
2024-01-09 21:23:23 -05:00
2024-10-30 14:41:34 -04:00
2024-07-14 19:11:31 -04:00
2024-01-22 21:01:42 -05:00
2024-02-26 11:41:33 -05:00
2024-06-29 01:38:55 -04:00
2024-02-28 12:57:45 -05:00
2024-02-06 00:35:30 -05:00
2024-04-19 01:03:04 -04:00
2024-07-23 01:41:11 -04:00
2024-01-23 12:54:36 -05:00
2024-05-29 10:12:11 -04:00
2024-10-25 11:28:23 -04:00