NanoCode012
6a8baf8fa7
feat: add sonicmoe (#3411)
* feat: add sonicmoe
* feat: add torch compile for routing
* feat: add routing smoke test
* feat: add qwen3_5_moe, qwen3_vl_moe, qwen3_omni_moe
* fix: disable mlp kernel for sonicmoe too
* feat: update to sonicmoe release
* chore: update import following new sonicmoe changes
* feat: update handling for blackwell
* feat: add sonicmoe e2e test
* fix: installation for updated sonicmoe
* fix: git commit
* fix: ignore py req and fix metadata
* fix: increase min hidden size to match sonicmoe kernel min
* fix: attempt properly interleave and handle unpatch mid-test
* chore: refactor teardown better
* chore: refactor to re-use rearrange
* fix: add idempotency guard
* fix: address comments on CI memory and interleave
* fix: tests grad, param doublewrapped
2026-03-05 13:43:31 -05:00
..
2026-03-05 13:43:31 -05:00
2025-08-23 23:37:33 -04:00
2026-03-02 12:26:30 -05:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2023-11-06 18:33:01 -05:00
2023-09-15 15:46:54 -04:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-11-07 08:21:20 -05:00
2025-08-23 23:37:33 -04:00
2025-07-14 20:11:11 -04:00
2026-03-05 13:40:45 -05:00
2026-03-05 13:40:45 -05:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-02-10 17:44:17 +07:00
2025-08-26 09:30:04 -04:00
2026-01-27 17:08:24 -05:00