axolotl/docs/images/4d-mask.png at 05f70342883fdb75fb36edd09f5e8a72a25be3e9

Files

Wing Lian 00568c1539 support for true batches with multipack (#1230 )

* support for true batches with multipack

* patch the map dataset fetcher to handle batches with packed indexes

* patch 4d mask creation for sdp attention

* better handling for BetterTransformer

* patch general case for 4d mask

* setup forward patch. WIP

* fix patch file

* support for multipack w/o flash attention for llama

* cleanup

* add warning about bf16 vs fp16 for multipack with sdpa

* bugfixes

* add 4d multipack tests, refactor patches

* update tests and add warnings

* fix e2e file check

* skip sdpa test if not at least torch 2.1.1, update docs

2024-02-01 10:18:42 -05:00

239 KiB

1200x523px

Raw History

/tocmo0nlord/axolotl/raw/commit/05f70342883fdb75fb36edd09f5e8a72a25be3e9/docs/images/4d-mask.png

239 KiB 1200x523px Raw History

239 KiB

1200x523px

Raw History