* simplify the example configs to be more minimal and less daunting * drop empty s2_attention from example yamls
Qwen
TODO
Qwen2 MoE
✅ multipack ✅ qwen2_moe 4-bit QLoRA ✅ qwen2_moe 16-bit LoRA ❓ qwen2_moe 8-bit LoRA
* simplify the example configs to be more minimal and less daunting * drop empty s2_attention from example yamls
TODO
✅ multipack ✅ qwen2_moe 4-bit QLoRA ✅ qwen2_moe 16-bit LoRA ❓ qwen2_moe 8-bit LoRA