Wing Lian
|
54d2ac155b
|
Mixtral fixes 20240124 (#1192) [skip ci]
* mixtral nccl fixes
* make sure to patch for z3
|
2024-01-24 14:59:57 -05:00 |
|
Wing Lian
|
62eaee7649
|
make phi training work with Loras (#588)
* valdiation for phi loras
* fix model config class check
* update readme for phi traiing
|
2023-09-15 20:51:55 -04:00 |
|
Wing Lian
|
228420972e
|
Phi examples (#569)
* add phi full ft example
* Add readme to point out that deepspeed should be used
* zero1 is better than zero2 for phi
|
2023-09-14 11:17:47 -04:00 |
|