axolotl

Files

NanoCode012 669f1d052c Fix: Higher vram usage for mistral and sample_packing (#691 )

* Fix: Higher vram usage for mistral and sample_packing

* chore: update comment

* chore: lint

2023-10-06 12:33:43 -04:00

config.yml

2023-10-02 21:07:24 -04:00

qlora.yml

2023-10-06 12:33:43 -04:00

README.md

2023-09-28 10:24:56 -04:00

Mistral 7B is a language model with a total of 7.3 billion parameters, showcasing a notable performance across a variety of benchmarks.

Fine Tune:

accelerate launch -m axolotl.cli.train examples/mistral/config.yml

If you run into CUDA OOM, use deepspeed with config zero2.json:

accelerate launch -m axolotl.cli.train examples/mistral/config.yml --deepspeed deepspeed/zero2.json