Wing Lian
|
6dc68a653f
|
use temp_dir kwarg instead
|
2023-11-06 18:33:01 -05:00 |
|
Wing Lian
|
7de6a5639c
|
missing dunder-init
|
2023-11-06 18:33:01 -05:00 |
|
Wing Lian
|
c74f045ba7
|
chore: lint
|
2023-11-06 18:33:01 -05:00 |
|
Wing Lian
|
0402d19759
|
make sure to cleanup tmp output_dir for e2e tests
|
2023-11-06 18:33:01 -05:00 |
|
Wing Lian
|
2d8def68dc
|
simplify by removing duplicate base_model_config (#772)
|
2023-10-23 01:42:38 -04:00 |
|
Wing Lian
|
21cf09b608
|
remove lora fused packing test (#758)
|
2023-10-21 22:59:35 -04:00 |
|
Casper
|
15d3a654bf
|
Implement fused modules (#747)
* MLP: Memory saving
* Remove RMSNorm restrictions
* Map packed weights to original
* FusedAttention module
* Simplify code
* Move fused modules
* Fix critical typo
* Split inplace
* Add FFT config
* Add validation of fused arguments
* Add fused arguments to config
* Update docs
* Fix validation logic
* Add fused modules to flash attn
* Only fuse during training
* Remove timing
* Formatting
* Formatting
* Formatting
* chore: lint
* chore: lint
* add e2e tests for fused llama
* no lora for tests
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com>
|
2023-10-21 16:08:25 -04:00 |
|