Dan Saunders
23f0c51d88
Sequence parallelism (#2412)
* adding easy_context as integration for now
* progress on ring attn impl
* progress on ring attn impl
* cleanup
* remove errant file
* fix req
* removing unused code
* updates
* pytest
* update
* updates
* fixes
* precommit fixes
* working multi-group SP
* fixing sample packing
* remove debug logs and simplify
* eval dataloader and sampler changes
* removing some obvious comments
* update config.qmd and rename option
* scoping down problematic import
* another import scoping change
* pernicious Fire CLI bugfix
* isolate cli tests
* actually isolate CLI tests
* gracefully handle no ring-flash-attn
* fix
* fix
* move ring flash attn to extras with flash-attn (#2414)
* removing flash-attn from requirements.txt (in setup.py extras already)
* rename file, delete another
* using field validator instead of model validator
* test fix
* sampler / dataloader refactor
* non-seq2se1 collator fix
* removing print statement
* bugfix
* add SP doc, review comments
* small changes
* review comments, docstrings
* refactors, SP mixin
* small updates
* fix tests
* precommit
* precommit
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com>
Co-authored-by: Dan Saunders <dan@axolotl.ai>
2025-03-21 12:43:55 -04:00
..
2025-03-10 16:25:50 +07:00
2025-01-29 00:10:19 -05:00
2025-03-21 12:26:47 -04:00
2025-02-25 16:09:37 +07:00
2024-04-04 13:43:40 +09:00
2025-03-21 12:26:47 -04:00
2025-03-21 12:43:55 -04:00
2025-03-21 10:18:01 -04:00
2025-03-21 12:26:47 -04:00
2025-02-25 16:09:37 +07:00
2025-03-05 10:01:00 -05:00
2025-03-21 11:59:22 -04:00
2024-07-11 09:19:29 -04:00
2025-03-17 08:39:04 -04:00
2025-03-17 08:39:04 -04:00
2025-02-25 16:09:37 +07:00
2025-03-21 10:17:47 -04:00
2025-03-17 08:38:19 -04:00
2025-01-24 12:56:28 -05:00
2025-02-25 16:09:37 +07:00
2025-02-25 16:09:37 +07:00
2025-02-25 16:09:37 +07:00
2024-10-02 21:02:48 -04:00
2024-03-21 22:28:36 -07:00
2025-02-25 16:09:37 +07:00
2025-02-25 16:09:37 +07:00
2025-03-17 08:38:19 -04:00
2025-03-17 08:39:04 -04:00
2025-03-21 12:43:55 -04:00
2025-02-25 16:09:37 +07:00
2025-02-25 16:09:37 +07:00