Tensor parallel w DeepSpeed AutoTP (#2574)

* support for deepspeed autotup

* bump to latest deepspeed that supports deepcompile too

* add deepcompile support too

* fix total steps calculation for TP

* setup fixture for tp

* update ds config to ensure weights are gathered for checkpoint

* fix duplicate validation names

* chore: lint
This commit is contained in:
Wing Lian
2025-07-14 21:33:48 -04:00
committed by GitHub
parent 5cc16040a8
commit cd079b5536
5 changed files with 85 additions and 4 deletions

View File

@@ -65,6 +65,7 @@ def fixture_base_cfg():
"dataloader_pin_memory": True,
"dataloader_prefetch_factor": 2,
"sequence_parallel_degree": 1,
"tensor_parallel_size": 1,
# Dtype
"fp16": False,
"bf16": False,