axolotl

Files

Wing Lian 69cd49a7aa update transformers to 4.53.1 (#2844 ) [skip ci]

* update transformers to 4.53.0

* remove attention_mask from signature columns if using packing

* remove attention_mask column from dataloader

* update signature of flash attn forward for ring attn patch

* fix FSDP

* patch ring-flash-attn with upstream signature fix

* fix patch indentation level

* fix the patch

* add batch flattening smoke test with loss check that works in older transformers

* fix patch

* don't drop attention mask for flex

* more fixes

* patch create_causal_mask for packing w flex

* global torch manual_seed fixture

* tweak loss checks

* fix patch and use single batch for flex

* don't need to reload

* fix causal mask patch

* use transformers patch releasE

* make sure env var is string

* make sure to drop attention mask for flex w packing for latest transformers patch release

* tweak loss

* guard on signature columns before removing attention mask

* bump loss

* set remove isn't chainable

* skip slow mistral test in 2.5.1

2025-07-07 09:35:22 -04:00

__init__.py

Various fixes for CI, save_only_model for RL, prevent packing multiprocessing deadlocks (#2661 )

2025-05-12 10:51:18 -04:00

cicd.sh

Various fixes for CI, save_only_model for RL, prevent packing multiprocessing deadlocks (#2661 )

2025-05-12 10:51:18 -04:00

cleanup.py

Various fixes for CI, save_only_model for RL, prevent packing multiprocessing deadlocks (#2661 )

2025-05-12 10:51:18 -04:00

cleanup.sh

Various fixes for CI, save_only_model for RL, prevent packing multiprocessing deadlocks (#2661 )