Dan Saunders
59cd472504
SP cu_seqlens fix, refactor (#2495)
* working on masking fix
* refactor and fix multipack seqlens
* pre-commit fix
* adding smoke test
* using existing packed seqlens util
* log warning re: logged losses / gradient scaling per rank
2025-04-07 14:47:57 -04:00
..
2025-04-05 18:02:57 -04:00
2024-08-09 11:50:13 -04:00
2025-03-31 15:47:11 -04:00
2025-03-31 17:15:23 -04:00
2025-04-07 10:49:15 -04:00
2025-04-05 17:41:31 -04:00
2025-04-07 10:49:15 -04:00
2025-04-07 14:47:57 -04:00