Tensor parallel w DeepSpeed AutoTP (#2574)

* support for deepspeed autotup

* bump to latest deepspeed that supports deepcompile too

* add deepcompile support too

* fix total steps calculation for TP

* setup fixture for tp

* update ds config to ensure weights are gathered for checkpoint

* fix duplicate validation names

* chore: lint
This commit is contained in:
Wing Lian
2025-07-14 21:33:48 -04:00
committed by GitHub
parent 5cc16040a8
commit cd079b5536
5 changed files with 85 additions and 4 deletions

View File

@@ -121,7 +121,7 @@ extras_require = {
"yunchang==0.6.0",
],
"deepspeed": [
"deepspeed==0.17.1",
"deepspeed==0.17.2",
"deepspeed-kernels",
],
"mamba-ssm": [