Files
axolotl/docs
Wing Lian e4063d60a7 bump transformers and set roundup_power2_divisions for more VRAM improvements, low bit ao optimizers (#1769)
* bump transformers and set roundup_power2_divisions for more VRAM improvements

* support for low bit optimizers from torch ao

* fix check for alternate optimizers and use nous models on hf for llama3

* add missing check for ao_adamw_fp8

* fix check when using custom optimizers w adamw
2024-07-19 00:47:07 -04:00
..
2024-07-11 09:19:29 -04:00
2024-07-11 09:19:29 -04:00
2024-07-11 09:19:29 -04:00
2024-04-27 12:07:06 -04:00
2024-07-18 14:54:41 -04:00