axolotl

Go to file

Carsten Kragelund Jørgensen eb3a57eb17 Ignore generation/endgeneration tags when analyzing Jinja chat template (#2787 )

* ignore generation/endgeneration tags

Axolotl handles calculating the mask for assistant turns on its own, and as such these tags are not needed, however currently the analyzer does not recognize them at all and throws an error.

* feat: add phi4 tokenizer test and unblock gemma2

* fix: improve template

* chore: refactor

* chore: lint

---------

Co-authored-by: NanoCode012 <nano@axolotl.ai>
Co-authored-by: Wing Lian <wing@axolotl.ai>

2025-06-18 15:59:07 -04:00

.github

Config doc autogen: follow-up fix docs build (#2806 )

2025-06-18 15:42:54 -04:00

.runpod

Config doc autogen (#2718 )

2025-06-18 15:36:53 -04:00

.vscode

feat: enable trl's autounwrap (#1060 )

2024-01-11 08:43:41 -05:00

cicd

bump transformers==4.52.4 (#2800 ) [skip ci]

2025-06-18 15:46:14 -04:00

deepspeed_configs

KD fix w/ online distillation (#2700 ) [skip ci]

2025-06-17 12:09:13 -04:00

devtools

remove fastchat and sharegpt (#2021 )

2024-11-08 13:45:49 -05:00

docker

build base images for torch 2.7.1 (#2764 )

2025-06-11 17:11:06 -04:00

docs

Fix: logging on py310 (#2802 )

2025-06-18 15:46:27 -04:00

examples

fixed the lora_target_modules syntax (#2793 )

2025-06-15 16:47:02 -04:00

image

Readme updates v2 (#2078 )

2024-11-18 14:58:03 -05:00

scripts

add uv tooling for e2e gpu tests (#2750 )

2025-06-05 07:25:06 -07:00

src

Ignore generation/endgeneration tags when analyzing Jinja chat template (#2787 )

2025-06-18 15:59:07 -04:00

tests

Ignore generation/endgeneration tags when analyzing Jinja chat template (#2787 )

2025-06-18 15:59:07 -04:00

_quarto.yml

Config doc autogen (#2718 )

2025-06-18 15:36:53 -04:00

.bandit

Add bandit

2023-05-31 02:53:53 +09:00

.coveragerc

adding codecov reporting (#2372 ) [skip ci]

2025-04-16 15:02:17 -07:00

.editorconfig

WIP for axolotl trainer

2023-04-14 00:20:05 -04:00

.flake8

Update ignores

2023-05-31 02:53:22 +09:00

.gitattributes

make it work with pythia in the cloud

2023-04-14 07:24:55 -04:00

.gitignore

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

.isort.cfg

fix: minor patches for multimodal (#2441 )

2025-03-31 13:40:12 +07:00

.mypy.ini

Liger Kernel integration (#1861 )

2024-08-23 12:21:51 -04:00

.pre-commit-config.yaml

chore: update pre-commit hooks (#2745 )

2025-06-02 15:54:29 -07:00

.pylintrc

Fixing OSX installation (#2231 )

2025-01-07 13:42:01 +00:00

CNAME

feat: add CNAME (#2513 )

2025-04-10 12:34:25 +07:00

codecov.yml

update doc and use P2P=LOC for brittle grpo test (#2649 )

2025-05-12 14:17:25 -04:00

docker-compose.yaml

add git environment variables to compose: avoid checkout failure error 128 on build (#534 )

2023-09-08 15:59:49 -04:00

FAQS.md

Update FAQS.md

2023-06-10 23:36:14 -07:00

favicon.jpg

update favicon (#2801 )

2025-06-17 18:09:24 -04:00

index.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

LICENSE

add apache 2.0 license

2023-07-21 09:49:29 -04:00

MANIFEST.in

fix build w pyproject to respect insalled torch version (#2168 )

2024-12-10 16:25:25 -05:00

pyproject.toml

chore: update doc links (#2509 )

2025-04-11 09:53:18 -04:00

README.md

Config doc autogen (#2718 )

2025-06-18 15:36:53 -04:00

requirements-dev.txt

adding codecov reporting (#2372 ) [skip ci]

2025-04-16 15:02:17 -07:00

requirements-tests.txt

Codecov fixes / improvements (#2549 )

2025-04-23 10:33:30 -04:00

requirements.txt

bump transformers==4.52.4 (#2800 ) [skip ci]

2025-06-18 15:46:14 -04:00

setup.py

bump transformers==4.52.4 (#2800 ) [skip ci]

2025-06-18 15:46:14 -04:00

styles.css

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

TODO.md

fdsp config dict fix, todo list, add torchdistx support

2023-04-30 13:32:07 -04:00

README.md

🎉 Latest Updates

2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!
2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!
2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl's linearized version!
2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.
2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!
2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.
2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!
2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.

✨ Overview

Axolotl is a tool designed to streamline post-training for various AI models.

Features:

Multiple Model Support: Train various models like LLaMA, Mistral, Mixtral, Pythia, and more. We are compatible with HuggingFace transformers causal language models.
Training Methods: Full fine-tuning, LoRA, QLoRA, GPTQ, QAT, Preference Tuning (DPO, IPO, KTO, ORPO), RL (GRPO), Multimodal, and Reward Modelling (RM) / Process Reward Modelling (PRM).
Easy Configuration: Re-use a single YAML file between dataset preprocess, training, evaluation, quantization, and inference.
Performance Optimizations: Multipacking, Flash Attention, Xformers, Flex Attention, Liger Kernel, Cut Cross Entropy, Sequence Parallelism (SP), LoRA optimizations, Multi-GPU training (FSDP1, FSDP2, DeepSpeed), Multi-node training (Torchrun, Ray), and many more!
Flexible Dataset Handling: Load from local, HuggingFace, and cloud (S3, Azure, GCP, OCI) datasets.
Cloud Ready: We ship Docker images and also PyPI packages for use on cloud platforms and local hardware.

🚀 Quick Start

Requirements:

NVIDIA GPU (Ampere or newer for bf16 and Flash Attention) or AMD GPU
Python 3.11
PyTorch ≥2.5.1

Installation

pip3 install -U packaging==23.2 setuptools==75.8.0 wheel ninja
pip3 install --no-build-isolation axolotl[flash-attn,deepspeed]

# Download example axolotl configs, deepspeed configs
axolotl fetch examples
axolotl fetch deepspeed_configs  # OPTIONAL

Other installation approaches are described here.

Your First Fine-tune

# Fetch axolotl examples
axolotl fetch examples

# Or, specify a custom path
axolotl fetch examples --dest path/to/folder

# Train a model using LoRA
axolotl train examples/llama-3/lora-1b.yml

That's it! Check out our Getting Started Guide for a more detailed walkthrough.

📚 Documentation

Installation Options - Detailed setup instructions for different environments
Configuration Guide - Full configuration options and examples
Dataset Loading - Loading datasets from various sources
Dataset Guide - Supported formats and how to use them
Multi-GPU Training
Multi-Node Training
Multipacking
API Reference - Auto-generated code documentation
FAQ - Frequently asked questions

🤝 Getting Help

Join our Discord community for support
Check out our Examples directory
Read our Debugging Guide
Need dedicated support? Please contact ✉️wing@axolotl.ai for options

🌟 Contributing

Contributions are welcome! Please see our Contributing Guide for details.

❤️ Sponsors

Thank you to our sponsors who help make Axolotl possible:

Modal - Modal lets you run jobs in the cloud, by just writing a few lines of Python. Customers use Modal to deploy Gen AI models at large scale, fine-tune large language models, run protein folding simulations, and much more.

Interested in sponsoring? Contact us at wing@axolotl.ai

📜 License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.