axolotl/examples/llama-2/fft_optimized.yml at afedc470bd77bbd3713d480ac9d5e3d3625d522a

Files

Casper 15d3a654bf Implement fused modules (#747 )

* MLP: Memory saving

* Remove RMSNorm restrictions

* Map packed weights to original

* FusedAttention module

* Simplify code

* Move fused modules

* Fix critical typo

* Split inplace

* Add FFT config

* Add validation of fused arguments

* Add fused arguments to config

* Update docs

* Fix validation logic

* Add fused modules to flash attn

* Only fuse during training

* Remove timing

* Formatting

* Formatting

* Formatting

* chore: lint

* chore: lint

* add e2e tests for fused llama

* no lora for tests

---------

Co-authored-by: Wing Lian <wing.lian@gmail.com>

2023-10-21 16:08:25 -04:00

1.3 KiB

Raw Blame History

View Raw

1.3 KiB Raw Blame History

1.3 KiB

Raw Blame History