🎉 Latest Updates

📅 2025/06: Magistral Support Added

Magistral with mistral-common tokenizer support has been added to Axolotl. See examples →

📅 2025/05: Quantization Aware Training (QAT)

QAT support has been added to Axolotl. Explore the docs →

📅 2025/04: Llama 4 Support

Llama 4 support has been added in Axolotl. See examples →

📅 2025/03: Sequence Parallelism & Multimodal Support

• Sequence Parallelism (SP) for scaling context length - Blog | Docs

• (Beta) Multimodal models fine-tuning - Check docs →

📅 2025/02: LoRA Optimizations & GRPO Support

• LoRA optimizations for better memory usage and speed - Docs →

• GRPO support added - Blog | Example

📅 2025/01: Reward Modelling Support

Reward Modelling / Process Reward Modelling fine-tuning support added. See docs →

✨ What Makes Axolotl Special

🚀 Multiple Model Support

Train LLaMA, Mistral, Mixtral, Pythia, and more. Full compatibility with HuggingFace transformers causal language models.

🎯 Advanced Training Methods

Full fine-tuning, LoRA, QLoRA, GPTQ, QAT, Preference Tuning (DPO, IPO, KTO, ORPO), RL (GRPO), Multimodal, and Reward Modelling.

⚙️ Easy Configuration

Reuse a single YAML file across dataset preprocessing, training, evaluation, quantization, and inference.

⚡ Performance Optimizations

Multipacking, Flash Attention, Xformers, Flex Attention, Liger Kernel, Sequence Parallelism, and Multi-GPU training.

📊 Flexible Dataset Handling

Load from local files, HuggingFace datasets, and cloud storage (S3, Azure, GCP, OCI).

☁️ Cloud Ready

Pre-built Docker images and PyPI packages for seamless deployment on cloud platforms and local hardware.

🚀 Quick Start

📋 Requirements

GPU: NVIDIA GPU (Ampere or newer for bf16 and Flash Attention) or AMD GPU
Python: 3.11+
PyTorch: ≥2.5.1

💾 Installation

# Install dependencies
pip3 install -U packaging==23.2 setuptools==75.8.0 wheel ninja

# Install Axolotl with Flash Attention and DeepSpeed
pip3 install --no-build-isolation axolotl[flash-attn,deepspeed]

# Download examples and configs
axolotl fetch examples
axolotl fetch deepspeed_configs  # OPTIONAL

Other installation methods available in our documentation

🎯 Your First Fine-tune

# Fetch examples
axolotl fetch examples

# Or specify custom path
axolotl fetch examples --dest path/to/folder

# Start training with LoRA
axolotl train examples/llama-3/lora-1b.yml

That's it! Check our Getting Started Guide for detailed walkthrough

🤝 Getting Help

💬

Community Support

Join thousands of developers in our Discord

Join Discord

📖

Examples

Browse our comprehensive examples

View Examples

🔧

Debugging

Troubleshooting and debugging guide

Debug Guide

✉️

Enterprise Support

Need dedicated support? Contact us

🧠 Streamlined AI Model Post-Training

🎉 Latest Updates

✨ What Makes Axolotl Special

🚀 Multiple Model Support

🎯 Advanced Training Methods

⚙️ Easy Configuration

⚡ Performance Optimizations

📊 Flexible Dataset Handling

☁️ Cloud Ready

🚀 Quick Start

📋 Requirements

💾 Installation

🎯 Your First Fine-tune

📚 Documentation Hub

🔧 Installation Options

⚙️ Configuration Guide

📊 Dataset Loading

📋 Dataset Guide

🖥️ Multi-GPU Training

🌐 Multi-Node Training

📦 Multipacking

🔍 API Reference

❓ FAQ

🤝 Getting Help

Community Support

Examples

Debugging

Enterprise Support

🌟 Contributing