axolotl

Files

Wing Lian 3ebf22464b qlora-fsdp ram efficient loading with hf trainer (#1791 )

* fix 405b with lower cpu ram requirements

* make sure to use doouble quant and only skip output embeddings

* set model attributes

* more fixes for sharded fsdp loading

* update the base model in example to use pre-quantized nf4-bf16 weights

* upstream fixes  for qlora+fsdp

2024-07-30 19:21:38 -04:00

Dockerfile

update test and main/nightly builds (#1797 )

2024-07-30 12:37:40 -04:00

Dockerfile-base

fix dockerfile and base builder (#1795 ) [skip-ci]

2024-07-30 08:34:37 -04:00

Dockerfile-cloud

qlora-fsdp ram efficient loading with hf trainer (#1791 )

2024-07-30 19:21:38 -04:00

Dockerfile-cloud-no-tmux

qlora-fsdp ram efficient loading with hf trainer (#1791 )

2024-07-30 19:21:38 -04:00

Dockerfile-tests

github urls (#1734 )

2024-07-11 09:19:29 -04:00