* add phi full ft example * Add readme to point out that deepspeed should be used * zero1 is better than zero2 for phi