Gilles Turpin
|
4b8bc52424
|
fix: correct total_num_steps and batch_size calculation with context parallelism (#3444)
* fix: correct total_num_steps and batch_size calculation with context parallelism
* feat: add test for CP batch size
---------
Co-authored-by: NanoCode012 <nano@axolotl.ai>
|
2026-03-05 12:33:28 -05:00 |
|