Files
axolotl/examples/qwen3-next/qwen3-next-80b-a3b-qlora.yaml
miketung 33975ce4bc feat(qwen3-next): Adds targeting of shared expert and attention modules (#3183)
* Adds targetting of shared expert and attention modules in each layer

* Update VRAM usage

---------

Co-authored-by: Mike Tung <mike@diffbot.com>
2025-09-25 17:06:16 +07:00

1.3 KiB