axolotl

Files

Wing Lian c67910fa6f bump hf deps (#2735 ) [skip ci]

* bump hf deps

* upgrade liger-kernel too

* install cce from fork for transformers fix

* fix reference to vocab size in gemma3 patch

* use padding_idx instead of pad_token_id

* remove fixed gemma3 patch

* use updated cce fork

* fix local mllama cce patches w docstring

* add test for multipack with trainer setup and fix trainer for trainer refactor upstream

* bump modal version

* guard for iterable datasetS

* mllama model arch layout changed in latest transformers

* fix batch sampler with drop_last

* fix: address upstream vlm changes for lora

* fix: update references to old lora target path

* fix: remove mllama fa2 patch due to upstream fix

* fix: lora kernel patch path for multimodal models

* fix: removed mllama from quarto

* run test for came optim on 2.6.0+

* fix fsdp2 patch and remove deprecated patch

* make sure to set sequence_parallel_degree for grpo

* Add SP test for GRPO

* add sp to grpo config for trainer

* use reward_funcs as kwarg to grpo trainer

* fix the comprehension for reward funcs

* reward funcs already passed in as args

* init sp_group right before training

* fix check for adding models to SP context

* make sure to pass args to super

* upgrade deepspeed

* use updated trl and add reasoning flags for vllm

* patch the worker

---------

Co-authored-by: NanoCode012 <nano@axolotl.ai>

2025-06-05 07:20:33 -07:00

bigstral-ds-zero3.yaml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

config.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

lora-mps.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

lora.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

mistral-dpo-qlora.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

mistral-qlora-fsdp.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

mistral-qlora-orpo.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

mistral-small-3.1-24B-lora.yml

bump hf deps (#2735 ) [skip ci]

2025-06-05 07:20:33 -07:00

mixtral_22.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

mixtral-8x22b-qlora-fsdp.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

mixtral-qlora-fsdp.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

mixtral.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

qlora.yml

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

README.md

Mixtral fixes 20240124 (#1192 ) [skip ci]

2024-01-24 14:59:57 -05:00

README.md

Mistral 7B is a language model with a total of 7.3 billion parameters, showcasing a notable performance across a variety of benchmarks.

Fine Tune:

accelerate launch -m axolotl.cli.train examples/mistral/config.yml

If you run into CUDA OOM, use deepspeed with config zero2.json:

accelerate launch -m axolotl.cli.train examples/mistral/config.yml --deepspeed deepspeed_configs/zero2.json