axolotl/docker/Dockerfile-cloud at 1f09f48d8fa5f244aef05461b0b2d557860875da

Files

Wing Lian 3ebf22464b qlora-fsdp ram efficient loading with hf trainer (#1791 )

* fix 405b with lower cpu ram requirements

* make sure to use doouble quant and only skip output embeddings

* set model attributes

* more fixes for sharded fsdp loading

* update the base model in example to use pre-quantized nf4-bf16 weights

* upstream fixes  for qlora+fsdp

2024-07-30 19:21:38 -04:00

970 B

Raw Blame History

View Raw

970 B Raw Blame History

970 B

Raw Blame History