axolotl

Go to file

Wing Lian 05f03b541a hf offline decorator for tests to workaround rate limits (#2452 ) [skip ci]

* hf offline decorator for tests to workaround rate limits

* fail quicker so we can see logs

* try new cache name

* limit files downloaded

* phi mini predownload

* offline decorator for phi tokenizer

* handle meta llama 8b offline too

* make sure to return fixtures if they are wrapped too

* more fixes

* more things offline

* more offline things

* fix the env var

* fix the model name

* handle gemma also

* force reload of modules to recheck offline status

* prefetch mistral too

* use reset_sessions so hub picks up offline mode

* more fixes

* rename so it doesn't seem like a context manager

* fix backoff

* switch out tinyshakespeare dataset since it runs a py script to fetch data and doesn't work offline

* include additional dataset

* more fixes

* more fixes

* replace tiny shakespeaere dataset

* skip some tests for now

* use more robust check using snapshot download to determine if a dataset name is on the hub

* typo for skip reason

* use local_files_only

* more fixtures

* remove local only

* use tiny shakespeare as pretrain dataset and streaming can't be offline even if precached

* make sure fixtures aren't offline

improve the offline reset
try bumping version of datasets
reorder reloading and setting
prime a new cache
run the tests now with fresh cache
try with a static cache

* now run all the ci again with hopefully a correct cache

* skip wonky tests for now

* skip wonky tests for now

* handle offline mode for model card creation

2025-03-28 19:20:46 -04:00

.github

hf offline decorator for tests to workaround rate limits (#2452 ) [skip ci]

2025-03-28 19:20:46 -04:00

.vscode

feat: enable trl's autounwrap (#1060 )

2024-01-11 08:43:41 -05:00

cicd

Sequence parallelism (#2412 )

2025-03-21 12:43:55 -04:00

deepspeed_configs

add deepspeed example with torch compile enabled (#2212 ) [skip ci]

2024-12-22 12:11:39 -05:00

devtools

remove fastchat and sharegpt (#2021 )

2024-11-08 13:45:49 -05:00

docker

add 12.8.1 cuda to the base matrix (#2426 )

2025-03-21 10:17:25 -04:00

docs

fix(doc): document config required to run eval_causal_lm_metrics (#2445 ) [skip ci]

2025-03-26 18:14:29 -04:00

examples

feat: add CCE for gemma3, cohere, and cohere2 (#2443 )

2025-03-26 18:13:51 -04:00

image

Readme updates v2 (#2078 )

2024-11-18 14:58:03 -05:00

scripts

adding pre-commit auto-update GH action and bumping plugin versions (#2428 )

2025-03-21 11:02:43 -04:00

src

hf offline decorator for tests to workaround rate limits (#2452 ) [skip ci]

2025-03-28 19:20:46 -04:00

tests

hf offline decorator for tests to workaround rate limits (#2452 ) [skip ci]

2025-03-28 19:20:46 -04:00

_quarto.yml

Feat: Rework multimodal support (mllama, llava, pixtral, qwen2, qwen25, gemma3, mistral3) (#2435 )

2025-03-23 11:08:51 -04:00

.bandit

Add bandit

2023-05-31 02:53:53 +09:00

.editorconfig

WIP for axolotl trainer

2023-04-14 00:20:05 -04:00

.flake8

Update ignores

2023-05-31 02:53:22 +09:00

.gitattributes

make it work with pythia in the cloud

2023-04-14 07:24:55 -04:00

.gitignore

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

.isort.cfg

Comet integration (#1939 )

2024-10-09 16:03:37 -04:00

.mypy.ini

Liger Kernel integration (#1861 )

2024-08-23 12:21:51 -04:00

.pre-commit-config.yaml

adding pre-commit auto-update GH action and bumping plugin versions (#2428 )

2025-03-21 11:02:43 -04:00

.pylintrc

Fixing OSX installation (#2231 )

2025-01-07 13:42:01 +00:00

docker-compose.yaml

add git environment variables to compose: avoid checkout failure error 128 on build (#534 )

2023-09-08 15:59:49 -04:00

FAQS.md

Update FAQS.md

2023-06-10 23:36:14 -07:00

favicon.jpg

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

index.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

LICENSE

add apache 2.0 license

2023-07-21 09:49:29 -04:00

MANIFEST.in

fix build w pyproject to respect insalled torch version (#2168 )

2024-12-10 16:25:25 -05:00

pyproject.toml

add 12.8.1 cuda to the base matrix (#2426 )

2025-03-21 10:17:25 -04:00

README.md

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

requirements-dev.txt

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

requirements-tests.txt

fix optimizer reset for relora sft (#1414 )

2024-12-03 08:58:23 -05:00

requirements.txt

hf offline decorator for tests to workaround rate limits (#2452 ) [skip ci]

2025-03-28 19:20:46 -04:00

setup.py

chore: minor optim changes (add apollo, improve docs, remove lion-pytorch) (#2444 )

2025-03-26 18:14:07 -04:00

styles.css

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

TODO.md

fdsp config dict fix, todo list, add torchdistx support

2023-04-30 13:32:07 -04:00

README.md

Axolotl is a tool designed to streamline post-training for various AI models. Post-training refers to any modifications or additional training performed on pre-trained models - including full model fine-tuning, parameter-efficient tuning (like LoRA and QLoRA), supervised fine-tuning (SFT), instruction tuning, and alignment techniques. With support for multiple model architectures and training configurations, Axolotl makes it easy to get started with these techniques.

Axolotl is designed to work with YAML config files that contain everything you need to preprocess a dataset, train or fine-tune a model, run model inference or evaluation, and much more.

Features:

Train various Huggingface models such as llama, pythia, falcon, mpt
Supports fullfinetune, lora, qlora, relora, and gptq
Customize configurations using a simple yaml file or CLI overwrite
Load different dataset formats, use custom formats, or bring your own tokenized datasets
Integrated with xformers, flash attention, liger kernel, rope scaling, and multipacking
Works with single GPU or multiple GPUs via FSDP or Deepspeed
Easily run with Docker locally or on the cloud
Log results and optionally checkpoints to wandb, mlflow or Comet
And more!

🚀 Quick Start

Requirements:

NVIDIA GPU (Ampere or newer for bf16 and Flash Attention) or AMD GPU
Python 3.11
PyTorch ≥2.4.1

Installation

pip3 install -U packaging==23.2 setuptools==75.8.0 wheel ninja
pip3 install --no-build-isolation axolotl[flash-attn,deepspeed]

# Download example axolotl configs, deepspeed configs
axolotl fetch examples
axolotl fetch deepspeed_configs  # OPTIONAL

Other installation approaches are described here.

Your First Fine-tune

# Fetch axolotl examples
axolotl fetch examples

# Or, specify a custom path
axolotl fetch examples --dest path/to/folder

# Train a model using LoRA
axolotl train examples/llama-3/lora-1b.yml

That's it! Check out our Getting Started Guide for a more detailed walkthrough.

✨ Key Features

Multiple Model Support: Train various models like LLaMA, Mistral, Mixtral, Pythia, and more
Training Methods: Full fine-tuning, LoRA, QLoRA, and more
Easy Configuration: Simple YAML files to control your training setup
Performance Optimizations: Flash Attention, xformers, multi-GPU training
Flexible Dataset Handling: Use various formats and custom datasets
Cloud Ready: Run on cloud platforms or local hardware

📚 Documentation

Installation Options - Detailed setup instructions for different environments
Configuration Guide - Full configuration options and examples
Dataset Guide - Supported formats and how to use them
Multi-GPU Training
Multi-Node Training
Multipacking
API Reference - Auto-generated code documentation
FAQ - Frequently asked questions

🤝 Getting Help

Join our Discord community for support
Check out our Examples directory
Read our Debugging Guide
Need dedicated support? Please contact ✉️wing@axolotl.ai for options

🌟 Contributing

Contributions are welcome! Please see our Contributing Guide for details.

Supported Models

	fp16/fp32	lora	qlora	gptq	gptq w/flash attn	flash attn	xformers attn
llama	✅	✅	✅	✅	✅	✅	✅
Mistral	✅	✅	✅	✅	✅	✅	✅
Mixtral-MoE	✅	✅	✅	❓	❓	❓	❓
Mixtral8X22	✅	✅	✅	❓	❓	❓	❓
Pythia	✅	✅	✅	❌	❌	❌	❓
cerebras	✅	✅	✅	❌	❌	❌	❓
btlm	✅	✅	✅	❌	❌	❌	❓
mpt	✅	❌	❓	❌	❌	❌	❓
falcon	✅	✅	✅	❌	❌	❌	❓
gpt-j	✅	✅	✅	❌	❌	❓	❓
XGen	✅	❓	✅	❓	❓	❓	✅
phi	✅	✅	✅	❓	❓	❓	❓
RWKV	✅	❓	❓	❓	❓	❓	❓
Qwen	✅	✅	✅	❓	❓	❓	❓
Gemma	✅	✅	✅	❓	❓	✅	❓
Jamba	✅	✅	✅	❓	❓	✅	❓

✅: supported ❌: not supported ❓: untested

❤️ Sponsors

Thank you to our sponsors who help make Axolotl possible:

Modal - Modal lets you run jobs in the cloud, by just writing a few lines of Python. Customers use Modal to deploy Gen AI models at large scale, fine-tune large language models, run protein folding simulations, and much more.

Interested in sponsoring? Contact us at wing@axolotl.ai

📜 License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.