Axolotl

🧠 Streamlined AI Model Post-Training

A powerful, flexible tool designed to streamline post-training for various AI models with enterprise-grade features and optimizations.

GitHub License GitHub Repo stars
tests codecov
contributors Releases
discord twitter
---

🎉 Latest Updates

📅 2025/06: Magistral Support Added

Magistral with mistral-common tokenizer support has been added to Axolotl. See examples →

📅 2025/05: Quantization Aware Training (QAT)

QAT support has been added to Axolotl. Explore the docs →

📅 2025/04: Llama 4 Support

Llama 4 support has been added in Axolotl. See examples →

📅 2025/03: Sequence Parallelism & Multimodal Support

• Sequence Parallelism (SP) for scaling context length - Blog | Docs

• (Beta) Multimodal models fine-tuning - Check docs →

📅 2025/02: LoRA Optimizations & GRPO Support

• LoRA optimizations for better memory usage and speed - Docs →

• GRPO support added - Blog | Example

📅 2025/01: Reward Modelling Support

Reward Modelling / Process Reward Modelling fine-tuning support added. See docs →

---

What Makes Axolotl Special

🚀 Multiple Model Support

Train LLaMA, Mistral, Mixtral, Pythia, and more. Full compatibility with HuggingFace transformers causal language models.

🎯 Advanced Training Methods

Full fine-tuning, LoRA, QLoRA, GPTQ, QAT, Preference Tuning (DPO, IPO, KTO, ORPO), RL (GRPO), Multimodal, and Reward Modelling.

⚙️ Easy Configuration

Reuse a single YAML file across dataset preprocessing, training, evaluation, quantization, and inference.

⚡ Performance Optimizations

Multipacking, Flash Attention, Xformers, Flex Attention, Liger Kernel, Sequence Parallelism, and Multi-GPU training.

📊 Flexible Dataset Handling

Load from local files, HuggingFace datasets, and cloud storage (S3, Azure, GCP, OCI).

☁️ Cloud Ready

Pre-built Docker images and PyPI packages for seamless deployment on cloud platforms and local hardware.

---

🚀 Quick Start

📋 Requirements

💾 Installation

# Install dependencies
pip3 install -U packaging==23.2 setuptools==75.8.0 wheel ninja

# Install Axolotl with Flash Attention and DeepSpeed
pip3 install --no-build-isolation axolotl[flash-attn,deepspeed]

# Download examples and configs
axolotl fetch examples
axolotl fetch deepspeed_configs  # OPTIONAL

Other installation methods available in our documentation

🎯 Your First Fine-tune

# Fetch examples
axolotl fetch examples

# Or specify custom path
axolotl fetch examples --dest path/to/folder

# Start training with LoRA
axolotl train examples/llama-3/lora-1b.yml

That's it! Check our Getting Started Guide for detailed walkthrough

---

📚 Documentation Hub

🔧 Installation Options

Detailed setup instructions for different environments

⚙️ Configuration Guide

Full configuration options and examples

📊 Dataset Loading

Loading datasets from various sources

📋 Dataset Guide

Supported formats and usage instructions

🖥️ Multi-GPU Training

Scale your training across multiple GPUs

🌐 Multi-Node Training

Distributed training across multiple machines

📦 Multipacking

Efficient batch packing for training

🔍 API Reference

Auto-generated code documentation

❓ FAQ

Frequently asked questions

---

🤝 Getting Help

💬

Community Support

Join thousands of developers in our Discord

Join Discord
📖

Examples

Browse our comprehensive examples

View Examples
🔧

Debugging

Troubleshooting and debugging guide

Debug Guide
✉️

Enterprise Support

Need dedicated support? Contact us

Contact Us
---

🌟 Contributing

We welcome contributions from the community! Whether it's bug fixes,