* add phi full ft example * Add readme to point out that deepspeed should be used * zero1 is better than zero2 for phi
1.3 KiB
1.3 KiB
* add phi full ft example * Add readme to point out that deepspeed should be used * zero1 is better than zero2 for phi