* add phi full ft example * Add readme to point out that deepspeed should be used * zero1 is better than zero2 for phi
Phi
Due to some nuances with the phi code, please use deepspeed when training phi.
accelerate launch scripts/finetune.py examples/phi/phi-ft.yml --deepspeed deepspeed/zero1.json