axolotl

Files

Oliver Molenschot a4b1cc6df0 Add example YAML file for training Mistral using DPO (#2029 ) [skip ci]

* Add example YAML file for training Mistral using DPO

* chore: lint

* Apply suggestions from code review

Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>

* Update mistral-dpo.yml 

Adding qlora and removing role-related data (unecessary)

* Rename mistral-dpo.yml to mistral-dpo-qlora.yml

* Apply suggestions from code review

Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>

---------

Co-authored-by: Wing Lian <wing.lian@gmail.com>
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>

2024-11-13 10:06:25 -05:00

bigstral-ds-zero3.yaml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

config.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

lora-mps.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

lora.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

mistral-dpo-qlora.yml

Add example YAML file for training Mistral using DPO (#2029 ) [skip ci]

2024-11-13 10:06:25 -05:00

mistral-qlora-fsdp.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

mistral-qlora-orpo.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

mixtral_22.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

mixtral-8x22b-qlora-fsdp.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

mixtral-qlora-fsdp.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

mixtral.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

qlora.yml

update outputs path so that we can mount workspace to /workspace/data (#1623 )

2024-05-15 12:44:13 -04:00

README.md

Mixtral fixes 20240124 (#1192 ) [skip ci]

2024-01-24 14:59:57 -05:00

README.md

Mistral 7B is a language model with a total of 7.3 billion parameters, showcasing a notable performance across a variety of benchmarks.

Fine Tune:

accelerate launch -m axolotl.cli.train examples/mistral/config.yml

If you run into CUDA OOM, use deepspeed with config zero2.json:

accelerate launch -m axolotl.cli.train examples/mistral/config.yml --deepspeed deepspeed_configs/zero2.json