Dan Saunders
59cd472504
SP cu_seqlens fix, refactor (#2495)
* working on masking fix
* refactor and fix multipack seqlens
* pre-commit fix
* adding smoke test
* using existing packed seqlens util
* log warning re: logged losses / gradient scaling per rank
2025-04-07 14:47:57 -04:00
..
2025-03-26 18:13:51 -04:00
2024-01-09 21:23:23 -05:00
2025-01-30 11:45:56 -05:00
2025-03-21 11:02:43 -04:00
2025-02-18 09:59:27 +07:00
2025-01-13 17:55:29 +00:00
2025-01-30 11:45:56 -05:00
2025-01-30 11:45:56 -05:00
2025-01-30 11:45:56 -05:00
2025-01-30 11:45:56 -05:00
2025-01-23 21:17:57 -05:00
2025-01-23 21:17:57 -05:00
2025-01-13 17:55:29 +00:00
2025-01-13 17:55:29 +00:00
2025-04-07 14:47:57 -04:00
2025-03-21 11:02:43 -04:00
2025-03-21 11:02:43 -04:00