axolotl

Go to file

NanoCode012 dfba881e99 Feat: add gemma3n support (#2852 )

* feat: add gemma3n cce

* feat: add sample config

* feat: add gemma3n multimodal mode

* feat: add audio example

* feat: support audio and return pixel values in collator

* feat: support unmask only assistant region (gemma3n for now)

* feat(doc): add notes for audio loading

* feat: add audio support for gemma3n

* feat: update examples

* feat: add gemma3n to the docs

* fix: add link at top

* feat(doc): clarify additional requirements

* fix: mllama missing aspect ratio

* fix: mllama need attention fixes for fa2

* Partially Revert "fix: mllama need attention fixes for fa2"

This reverts commit a0bfdd1777.

* fix: disable FA2 for mllama in vision mode

* feat: update configs to use proper attention

* fix: support other vision features

* feat(doc): clarify requirements for gemma3n

2025-07-22 16:52:15 +07:00

.github

include torchvision in build for upstream changes requiring it now (#2953 ) [skip ci]

2025-07-22 04:19:16 -04:00

.runpod

[doc] Fix docs for text field mapping for completion datasets (#2890 )

2025-07-09 14:52:44 -04:00

.vscode

feat: enable trl's autounwrap (#1060 )

2024-01-11 08:43:41 -05:00

cicd

add additional packages via apt for better multi-node support (#2949 )

2025-07-20 21:19:23 -04:00

deepspeed_configs

fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956 ) [skip ci]

2025-07-21 11:41:31 -04:00

devtools

remove fastchat and sharegpt (#2021 )

2024-11-08 13:45:49 -05:00

docker

Fix cloud docker image build and remove apt files for optim (#2961 )

2025-07-21 11:05:00 -04:00

docs

Feat: add gemma3n support (#2852 )

2025-07-22 16:52:15 +07:00

examples

Feat: add gemma3n support (#2852 )

2025-07-22 16:52:15 +07:00

image

Readme updates v2 (#2078 )

2024-11-18 14:58:03 -05:00

scripts

upstream fixes in cce for dora and tensor paralel support (#2960 ) [skip ci]

2025-07-21 11:41:53 -04:00

src

Feat: add gemma3n support (#2852 )

2025-07-22 16:52:15 +07:00

tests

make pad_to_sequence_len default to the same value as sample_packing (#2941 ) [skip ci]

2025-07-21 11:40:56 -04:00

_quarto.yml

Activation Offloading w CUDA Streams (#2900 ) [skip ci]

2025-07-14 20:10:20 -04:00

.bandit

chore: update pre-commit hooks (#2870 ) [skip ci]

2025-07-07 15:26:15 -04:00

.coderabbit.yaml

coderabbit manual settings (#2940 ) [skip ci]

2025-07-17 15:32:16 -04:00

.coveragerc

adding codecov reporting (#2372 ) [skip ci]

2025-04-16 15:02:17 -07:00

.editorconfig

WIP for axolotl trainer

2023-04-14 00:20:05 -04:00

.flake8

Update ignores

2023-05-31 02:53:22 +09:00

.gitattributes

make it work with pythia in the cloud

2023-04-14 07:24:55 -04:00

.gitignore

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

.isort.cfg

fix: minor patches for multimodal (#2441 )

2025-03-31 13:40:12 +07:00

.mypy.ini

Liger Kernel integration (#1861 )

2024-08-23 12:21:51 -04:00

.pre-commit-config.yaml

chore: update pre-commit hooks (#2870 ) [skip ci]

2025-07-07 15:26:15 -04:00

.pylintrc

Fixing OSX installation (#2231 )

2025-01-07 13:42:01 +00:00

CNAME

feat: add CNAME (#2513 )

2025-04-10 12:34:25 +07:00

codecov.yml

misc fixes 202507 (#2937 ) [skip ci]

2025-07-17 09:47:45 -04:00

docker-compose.yaml

add git environment variables to compose: avoid checkout failure error 128 on build (#534 )

2023-09-08 15:59:49 -04:00

FAQS.md

Update FAQS.md

2023-06-10 23:36:14 -07:00

favicon.jpg

update favicon (#2801 )

2025-06-17 18:09:24 -04:00

index.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

LICENSE

add apache 2.0 license

2023-07-21 09:49:29 -04:00

MANIFEST.in

manage jinja templates as nicely formatted files (#2795 )

2025-07-07 10:11:48 -04:00

pyproject.toml

chore: update doc links (#2509 )

2025-04-11 09:53:18 -04:00

README.md

release v0.11.0 (#2875 )

2025-07-09 09:22:35 -04:00

requirements-dev.txt

adding codecov reporting (#2372 ) [skip ci]

2025-04-16 15:02:17 -07:00

requirements-tests.txt

Codecov fixes / improvements (#2549 )

2025-04-23 10:33:30 -04:00

requirements.txt

bump accelerate to 1.9.0 (#2936 ) [skip ci]

2025-07-17 09:46:43 -04:00

setup.py

Tensor parallel w DeepSpeed AutoTP (#2574 )

2025-07-14 21:33:48 -04:00

styles.css

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

TODO.md

fdsp config dict fix, todo list, add torchdistx support

2023-04-30 13:32:07 -04:00

README.md

🎉 Latest Updates

2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!
2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!
2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl's linearized version!
2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.
2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!
2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.
2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!
2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.

✨ Overview

Axolotl is a tool designed to streamline post-training for various AI models.

Features:

Multiple Model Support: Train various models like LLaMA, Mistral, Mixtral, Pythia, and more. We are compatible with HuggingFace transformers causal language models.
Training Methods: Full fine-tuning, LoRA, QLoRA, GPTQ, QAT, Preference Tuning (DPO, IPO, KTO, ORPO), RL (GRPO), Multimodal, and Reward Modelling (RM) / Process Reward Modelling (PRM).
Easy Configuration: Re-use a single YAML file between dataset preprocess, training, evaluation, quantization, and inference.
Performance Optimizations: Multipacking, Flash Attention, Xformers, Flex Attention, Liger Kernel, Cut Cross Entropy, Sequence Parallelism (SP), LoRA optimizations, Multi-GPU training (FSDP1, FSDP2, DeepSpeed), Multi-node training (Torchrun, Ray), and many more!
Flexible Dataset Handling: Load from local, HuggingFace, and cloud (S3, Azure, GCP, OCI) datasets.
Cloud Ready: We ship Docker images and also PyPI packages for use on cloud platforms and local hardware.

🚀 Quick Start

Requirements:

NVIDIA GPU (Ampere or newer for bf16 and Flash Attention) or AMD GPU
Python 3.11
PyTorch ≥2.6.0

Installation

Using pip

pip3 install -U packaging==23.2 setuptools==75.8.0 wheel ninja
pip3 install --no-build-isolation axolotl[flash-attn,deepspeed]

# Download example axolotl configs, deepspeed configs
axolotl fetch examples
axolotl fetch deepspeed_configs  # OPTIONAL

Using Docker

Installing with Docker can be less error prone than installing in your own environment.

docker run --gpus '"all"' --rm -it axolotlai/axolotl:main-latest

Other installation approaches are described here.

Your First Fine-tune

# Fetch axolotl examples
axolotl fetch examples

# Or, specify a custom path
axolotl fetch examples --dest path/to/folder

# Train a model using LoRA
axolotl train examples/llama-3/lora-1b.yml

That's it! Check out our Getting Started Guide for a more detailed walkthrough.

📚 Documentation

Installation Options - Detailed setup instructions for different environments
Configuration Guide - Full configuration options and examples
Dataset Loading - Loading datasets from various sources
Dataset Guide - Supported formats and how to use them
Multi-GPU Training
Multi-Node Training
Multipacking
API Reference - Auto-generated code documentation
FAQ - Frequently asked questions

🤝 Getting Help

Join our Discord community for support
Check out our Examples directory
Read our Debugging Guide
Need dedicated support? Please contact ✉️wing@axolotl.ai for options

🌟 Contributing

Contributions are welcome! Please see our Contributing Guide for details.

❤️ Sponsors

Thank you to our sponsors who help make Axolotl possible:

Modal - Modal lets you run jobs in the cloud, by just writing a few lines of Python. Customers use Modal to deploy Gen AI models at large scale, fine-tune large language models, run protein folding simulations, and much more.

Interested in sponsoring? Contact us at wing@axolotl.ai

📜 License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.