VED
|
9e64c76326
|
qwen3.5 configs (#3554) [skip ci]
* qwen3.5 configs
* update shared experts readme
|
2026-04-01 09:19:31 -04:00 |
|
Owen Arliawan
|
c57acef2c7
|
Qwen3.5-MoE example config with lora_target_modules regex (#3515) [skip ci]
* lora target modules with regex
* updates
* fsdp for non moe
* update wording
* chore: cleanup and lint
* chore: cleanup docs from merge
---------
Co-authored-by: NanoCode012 <nano@axolotl.ai>
|
2026-03-20 16:52:46 +07:00 |
|
VED
|
113d275bd9
|
qwen docs + new config (#3499) [skip ci]
* qwen docs + new config
* docss lint
* simplify comments
* read me
* lint comments
* Update docs/multimodal.qmd
* Update docs/multimodal.qmd
* Update examples/qwen3.5/9b-fft-vision.yaml
* chore: fix link and incorrect points
---------
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
Co-authored-by: NanoCode012 <nano@axolotl.ai>
|
2026-03-20 16:13:34 +07:00 |
|
VED
|
c119382337
|
add: qwen 3.5 (#3442)
* add: qwen 3.5
* test for qwen , patch
* lint
* qwen3 fix on main
* Apply suggestions from code review
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
* moe config
* config moe
* configs and chore
* Update examples/qwen3.5/122b-a10b-moe-qlora.yaml
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
* Update examples/qwen3.5/35b-a3b-moe-qlora.yaml
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
* chore for qwen + vlm patch
* chore lint
* qwen lint
* 3_5_moe
* Update examples/qwen3.5/README.md
---------
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
|
2026-03-06 09:31:00 -05:00 |
|