Commit Graph

2 Commits

Author SHA1 Message Date
Wing Lian
66fea258c7 add correctness unit tests and benchmarks for scattermoe + lora 2026-03-19 06:40:04 +00:00
Wing Lian
163bd4dd5a use custom triton kernels for entropy from logits and selective softmax (#3510)
* use custom triton kernels for entropy from logits and selective softmax

* PR comments fixes

* fix out of bounds, include tests, include benchmarks

* chore: lint
2026-03-19 02:02:43 -04:00