Remove null from a dictionary-like obj or list.
-These can appear due to Dataset loading causing schema merge.
-See https://github.com/axolotl-ai-cloud/axolotl/pull/2909
%%capture# This step can take ~5-10 minutes to install dependencies!pip install --no-build-isolation axolotl[flash-attn]>=0.9.1
-!pip install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88"
2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!
2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!
2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!
2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!
diff --git a/search.json b/search.json
index 212ce1243..d3f20c606 100644
--- a/search.json
+++ b/search.json
@@ -18,7 +18,7 @@
"href": "index.html#latest-updates",
"title": "Axolotl",
"section": "🎉 Latest Updates",
- "text": "🎉 Latest Updates\n\n2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.",
+ "text": "🎉 Latest Updates\n\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.",
"crumbs": [
"Home"
]
@@ -1153,7 +1153,7 @@
"href": "docs/custom_integrations.html#cut-cross-entropy",
"title": "Custom Integrations",
"section": "Cut Cross Entropy",
- "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here",
+ "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here",
"crumbs": [
"Advanced Features",
"Custom Integrations"
@@ -3539,7 +3539,7 @@
"href": "docs/api/utils.dict.html",
"title": "utils.dict",
"section": "",
- "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909"
+ "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys."
},
{
"objectID": "docs/api/utils.dict.html#classes",
@@ -3548,13 +3548,6 @@
"section": "",
"text": "Name\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys."
},
- {
- "objectID": "docs/api/utils.dict.html#functions",
- "href": "docs/api/utils.dict.html#functions",
- "title": "utils.dict",
- "section": "",
- "text": "Name\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909"
- },
{
"objectID": "docs/api/core.builders.rl.html",
"href": "docs/api/core.builders.rl.html",
diff --git a/sitemap.xml b/sitemap.xml
index 08d5a7736..06669a5a6 100644
--- a/sitemap.xml
+++ b/sitemap.xml
@@ -2,766 +2,766 @@
https://docs.axolotl.ai/TODO.html
- 2025-07-30T10:21:25.037Z
+ 2025-07-30T10:29:12.470Zhttps://docs.axolotl.ai/index.html
- 2025-07-30T10:21:25.058Z
+ 2025-07-30T10:29:12.491Zhttps://docs.axolotl.ai/docs/debugging.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/amd_hpc.html
- 2025-07-30T10:21:25.038Z
+ 2025-07-30T10:29:12.471Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.mlflow_.html
- 2025-07-30T10:24:39.969Z
+ 2025-07-30T10:32:25.500Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_expand_mask.html
- 2025-07-30T10:24:39.393Z
+ 2025-07-30T10:32:24.931Zhttps://docs.axolotl.ai/docs/api/loaders.patch_manager.html
- 2025-07-30T10:24:38.994Z
+ 2025-07-30T10:32:24.540Zhttps://docs.axolotl.ai/docs/api/core.chat.format.llama3x.html
- 2025-07-30T10:24:38.682Z
+ 2025-07-30T10:32:24.231Zhttps://docs.axolotl.ai/docs/api/cli.train.html
- 2025-07-30T10:24:38.736Z
+ 2025-07-30T10:32:24.285Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.perplexity.html
- 2025-07-30T10:24:39.960Z
+ 2025-07-30T10:32:25.491Zhttps://docs.axolotl.ai/docs/api/core.chat.messages.html
- 2025-07-30T10:24:38.679Z
+ 2025-07-30T10:32:24.228Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.lisa.html
- 2025-07-30T10:24:39.966Z
+ 2025-07-30T10:32:25.496Zhttps://docs.axolotl.ai/docs/api/cli.merge_sharded_fsdp_weights.html
- 2025-07-30T10:24:38.823Z
+ 2025-07-30T10:32:24.371Zhttps://docs.axolotl.ai/docs/api/cli.sweeps.html
- 2025-07-30T10:24:38.838Z
+ 2025-07-30T10:32:24.385Zhttps://docs.axolotl.ai/docs/api/utils.chat_templates.html
- 2025-07-30T10:24:39.491Z
+ 2025-07-30T10:32:25.028Zhttps://docs.axolotl.ai/docs/api/core.chat.format.shared.html
- 2025-07-30T10:24:38.684Z
+ 2025-07-30T10:32:24.233Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.optimizer.html
- 2025-07-30T10:24:39.001Z
+ 2025-07-30T10:32:24.547Zhttps://docs.axolotl.ai/docs/api/utils.collators.mamba.html
- 2025-07-30T10:24:39.908Z
+ 2025-07-30T10:32:25.437Zhttps://docs.axolotl.ai/docs/api/logging_config.html
- 2025-07-30T10:24:38.628Z
+ 2025-07-30T10:32:24.177Zhttps://docs.axolotl.ai/docs/api/utils.collators.mm_chat.html
- 2025-07-30T10:24:39.913Z
+ 2025-07-30T10:32:25.441Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.completion.html
- 2025-07-30T10:24:39.124Z
+ 2025-07-30T10:32:24.667Zhttps://docs.axolotl.ai/docs/api/kernels.utils.html
- 2025-07-30T10:24:39.341Z
+ 2025-07-30T10:32:24.881Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chat_template.html
- 2025-07-30T10:24:39.159Z
+ 2025-07-30T10:32:24.700Zhttps://docs.axolotl.ai/docs/api/kernels.swiglu.html
- 2025-07-30T10:24:39.332Z
+ 2025-07-30T10:32:24.872Zhttps://docs.axolotl.ai/docs/api/common.const.html
- 2025-07-30T10:24:39.868Z
+ 2025-07-30T10:32:25.397Zhttps://docs.axolotl.ai/docs/api/cli.cloud.base.html
- 2025-07-30T10:24:38.880Z
+ 2025-07-30T10:32:24.427Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.comet_.html
- 2025-07-30T10:24:39.973Z
+ 2025-07-30T10:32:25.503Zhttps://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_cpu.html
- 2025-07-30T10:24:39.457Z
+ 2025-07-30T10:32:24.994Zhttps://docs.axolotl.ai/docs/api/common.architectures.html
- 2025-07-30T10:24:39.867Z
+ 2025-07-30T10:32:25.396Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.pygmalion.html
- 2025-07-30T10:24:39.153Z
+ 2025-07-30T10:32:24.695Zhttps://docs.axolotl.ai/docs/api/utils.schemas.peft.html
- 2025-07-30T10:24:39.670Z
+ 2025-07-30T10:32:25.202Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.user_defined.html
- 2025-07-30T10:24:39.182Z
+ 2025-07-30T10:32:24.724Zhttps://docs.axolotl.ai/docs/api/utils.schemas.datasets.html
- 2025-07-30T10:24:39.662Z
+ 2025-07-30T10:32:25.194Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_w_system.html
- 2025-07-30T10:24:39.098Z
+ 2025-07-30T10:32:24.640Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.base.html
- 2025-07-30T10:24:39.037Z
+ 2025-07-30T10:32:24.581Zhttps://docs.axolotl.ai/docs/api/utils.lora.html
- 2025-07-30T10:24:39.496Z
+ 2025-07-30T10:32:25.033Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.input_output.html
- 2025-07-30T10:24:39.130Z
+ 2025-07-30T10:32:24.673Zhttps://docs.axolotl.ai/docs/api/utils.schemas.trl.html
- 2025-07-30T10:24:39.674Z
+ 2025-07-30T10:32:25.205Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.zephyr.html
- 2025-07-30T10:24:39.181Z
+ 2025-07-30T10:32:24.722Zhttps://docs.axolotl.ai/docs/api/integrations.kd.trainer.html
- 2025-07-30T10:24:39.855Z
+ 2025-07-30T10:32:25.384Zhttps://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_disk.html
- 2025-07-30T10:24:39.482Z
+ 2025-07-30T10:32:25.019Zhttps://docs.axolotl.ai/docs/api/utils.optimizers.adopt.html
- 2025-07-30T10:24:39.587Z
+ 2025-07-30T10:32:25.122Zhttps://docs.axolotl.ai/docs/api/monkeypatch.data.batch_dataset_fetcher.html
- 2025-07-30T10:24:39.452Z
+ 2025-07-30T10:32:24.989Zhttps://docs.axolotl.ai/docs/api/cli.cloud.modal_.html
- 2025-07-30T10:24:38.886Z
+ 2025-07-30T10:32:24.433Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_chat.html
- 2025-07-30T10:24:39.084Z
+ 2025-07-30T10:32:24.627Zhttps://docs.axolotl.ai/docs/api/utils.freeze.html
- 2025-07-30T10:24:39.513Z
+ 2025-07-30T10:32:25.049Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.bradley_terry.llama3.html
- 2025-07-30T10:24:39.226Z
+ 2025-07-30T10:32:24.767Zhttps://docs.axolotl.ai/docs/api/integrations.base.html
- 2025-07-30T10:24:39.843Z
+ 2025-07-30T10:32:25.372Zhttps://docs.axolotl.ai/docs/api/monkeypatch.unsloth_.html
- 2025-07-30T10:24:39.450Z
+ 2025-07-30T10:32:24.987Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.chatml.html
- 2025-07-30T10:24:39.200Z
+ 2025-07-30T10:32:24.741Zhttps://docs.axolotl.ai/docs/api/cli.main.html
- 2025-07-30T10:24:38.728Z
+ 2025-07-30T10:32:24.277Zhttps://docs.axolotl.ai/docs/api/common.datasets.html
- 2025-07-30T10:24:39.883Z
+ 2025-07-30T10:32:25.412Zhttps://docs.axolotl.ai/docs/api/train.html
- 2025-07-30T10:24:38.541Z
+ 2025-07-30T10:32:24.092Zhttps://docs.axolotl.ai/docs/api/core.trainers.base.html
- 2025-07-30T10:24:38.902Z
+ 2025-07-30T10:32:24.449Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.scheduler.html
- 2025-07-30T10:24:39.011Z
+ 2025-07-30T10:32:24.556Zhttps://docs.axolotl.ai/docs/api/utils.ctx_managers.sequence_parallel.html
- 2025-07-30T10:24:39.035Z
+ 2025-07-30T10:32:24.580Zhttps://docs.axolotl.ai/docs/api/utils.schemas.config.html
- 2025-07-30T10:24:39.631Z
+ 2025-07-30T10:32:25.164Zhttps://docs.axolotl.ai/docs/api/loaders.tokenizer.html
- 2025-07-30T10:24:38.977Z
+ 2025-07-30T10:32:24.522Zhttps://docs.axolotl.ai/docs/api/integrations.liger.args.html
- 2025-07-30T10:24:39.859Z
+ 2025-07-30T10:32:25.388Zhttps://docs.axolotl.ai/docs/api/cli.config.html
- 2025-07-30T10:24:38.788Z
+ 2025-07-30T10:32:24.337Zhttps://docs.axolotl.ai/docs/api/loaders.processor.html
- 2025-07-30T10:24:38.978Z
+ 2025-07-30T10:32:24.524Zhttps://docs.axolotl.ai/docs/api/monkeypatch.utils.html
- 2025-07-30T10:24:39.430Z
+ 2025-07-30T10:32:24.967Zhttps://docs.axolotl.ai/docs/api/integrations.cut_cross_entropy.args.html
- 2025-07-30T10:24:39.847Z
+ 2025-07-30T10:32:25.376Zhttps://docs.axolotl.ai/docs/api/core.trainers.dpo.trainer.html
- 2025-07-30T10:24:38.933Z
+ 2025-07-30T10:32:24.480Zhttps://docs.axolotl.ai/docs/api/loaders.adapter.html
- 2025-07-30T10:24:38.984Z
+ 2025-07-30T10:32:24.529Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.llama3.html
- 2025-07-30T10:24:39.169Z
+ 2025-07-30T10:32:24.711Zhttps://docs.axolotl.ai/docs/api/core.datasets.transforms.chat_builder.html
- 2025-07-30T10:24:38.697Z
+ 2025-07-30T10:32:24.245Zhttps://docs.axolotl.ai/docs/api/monkeypatch.transformers_fa_utils.html
- 2025-07-30T10:24:39.449Z
+ 2025-07-30T10:32:24.986Zhttps://docs.axolotl.ai/docs/api/datasets.html
- 2025-07-30T10:24:38.562Z
+ 2025-07-30T10:32:24.113Zhttps://docs.axolotl.ai/docs/api/monkeypatch.btlm_attn_hijack_flash.html
- 2025-07-30T10:24:39.432Z
+ 2025-07-30T10:32:24.969Zhttps://docs.axolotl.ai/docs/api/cli.inference.html
- 2025-07-30T10:24:38.802Z
+ 2025-07-30T10:32:24.351Zhttps://docs.axolotl.ai/docs/api/cli.utils.html
- 2025-07-30T10:24:38.870Z
+ 2025-07-30T10:32:24.417Zhttps://docs.axolotl.ai/docs/api/cli.preprocess.html
- 2025-07-30T10:24:38.831Z
+ 2025-07-30T10:32:24.379Zhttps://docs.axolotl.ai/docs/api/loaders.model.html
- 2025-07-30T10:24:38.968Z
+ 2025-07-30T10:32:24.514Zhttps://docs.axolotl.ai/docs/api/core.trainers.grpo.sampler.html
- 2025-07-30T10:24:38.957Z
+ 2025-07-30T10:32:24.503Zhttps://docs.axolotl.ai/docs/api/core.trainers.grpo.trainer.html
- 2025-07-30T10:24:38.944Z
+ 2025-07-30T10:32:24.491Zhttps://docs.axolotl.ai/docs/batch_vs_grad.html
- 2025-07-30T10:21:25.038Z
+ 2025-07-30T10:29:12.471Zhttps://docs.axolotl.ai/docs/custom_integrations.html
- 2025-07-30T10:21:25.038Z
+ 2025-07-30T10:29:12.471Zhttps://docs.axolotl.ai/docs/quantize.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/installation.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/docker.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/reward_modelling.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.476Zhttps://docs.axolotl.ai/docs/dataset-formats/inst_tune.html
- 2025-07-30T10:21:25.038Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/dataset-formats/conversation.html
- 2025-07-30T10:21:25.038Z
+ 2025-07-30T10:29:12.471Zhttps://docs.axolotl.ai/docs/dataset-formats/tokenized.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/rlhf.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.476Zhttps://docs.axolotl.ai/docs/sequence_parallelism.html
- 2025-07-30T10:21:25.043Z
+ 2025-07-30T10:29:12.476Zhttps://docs.axolotl.ai/docs/inference.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/fsdp_qlora.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/multi-node.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/lora_optims.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/getting-started.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/dataset_loading.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/lr_groups.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/mac.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html
- 2025-07-30T10:21:25.062Z
+ 2025-07-30T10:29:12.495Zhttps://docs.axolotl.ai/src/axolotl/integrations/LICENSE.html
- 2025-07-30T10:21:25.062Z
+ 2025-07-30T10:29:12.495Zhttps://docs.axolotl.ai/docs/input_output.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/gradient_checkpointing.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/qat.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/faq.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/dataset_preprocessing.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/nccl.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/cli.html
- 2025-07-30T10:21:25.038Z
+ 2025-07-30T10:29:12.471Zhttps://docs.axolotl.ai/docs/torchao.html
- 2025-07-30T10:21:25.043Z
+ 2025-07-30T10:29:12.476Zhttps://docs.axolotl.ai/docs/multi-gpu.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/dataset-formats/pretraining.html
- 2025-07-30T10:21:25.038Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/dataset-formats/index.html
- 2025-07-30T10:21:25.038Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/dataset-formats/template_free.html
- 2025-07-30T10:21:25.039Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/dataset-formats/stepwise_supervised.html
- 2025-07-30T10:21:25.038Z
+ 2025-07-30T10:29:12.472Zhttps://docs.axolotl.ai/docs/ray-integration.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.476Zhttps://docs.axolotl.ai/docs/unsloth.html
- 2025-07-30T10:21:25.043Z
+ 2025-07-30T10:29:12.476Zhttps://docs.axolotl.ai/docs/mixed_precision.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/multimodal.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/docs/config-reference.html
- 2025-07-30T10:24:53.939Z
+ 2025-07-30T10:32:38.090Zhttps://docs.axolotl.ai/docs/api/prompt_tokenizers.html
- 2025-07-30T10:24:38.618Z
+ 2025-07-30T10:32:24.168Zhttps://docs.axolotl.ai/docs/api/utils.schedulers.html
- 2025-07-30T10:24:39.554Z
+ 2025-07-30T10:32:25.091Zhttps://docs.axolotl.ai/docs/api/utils.samplers.multipack.html
- 2025-07-30T10:24:39.954Z
+ 2025-07-30T10:32:25.484Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.orcamini.html
- 2025-07-30T10:24:39.146Z
+ 2025-07-30T10:32:24.688Zhttps://docs.axolotl.ai/docs/api/core.trainers.trl.html
- 2025-07-30T10:24:38.917Z
+ 2025-07-30T10:32:24.463Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.metharme.html
- 2025-07-30T10:24:39.142Z
+ 2025-07-30T10:32:24.684Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.profiler.html
- 2025-07-30T10:24:39.964Z
+ 2025-07-30T10:32:25.495Zhttps://docs.axolotl.ai/docs/api/utils.schemas.enums.html
- 2025-07-30T10:24:39.702Z
+ 2025-07-30T10:32:25.233Zhttps://docs.axolotl.ai/docs/api/core.trainers.mamba.html
- 2025-07-30T10:24:38.922Z
+ 2025-07-30T10:32:24.469Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_flash.html
- 2025-07-30T10:24:39.367Z
+ 2025-07-30T10:32:24.906Zhttps://docs.axolotl.ai/docs/api/monkeypatch.relora.html
- 2025-07-30T10:24:39.392Z
+ 2025-07-30T10:32:24.930Zhttps://docs.axolotl.ai/docs/api/monkeypatch.stablelm_attn_hijack_flash.html
- 2025-07-30T10:24:39.439Z
+ 2025-07-30T10:32:24.976Zhttps://docs.axolotl.ai/docs/api/loaders.constants.html
- 2025-07-30T10:24:38.995Z
+ 2025-07-30T10:32:24.541Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.qat.html
- 2025-07-30T10:24:39.980Z
+ 2025-07-30T10:32:25.510Zhttps://docs.axolotl.ai/docs/api/utils.schemas.model.html
- 2025-07-30T10:24:39.638Z
+ 2025-07-30T10:32:25.171Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.stepwise_supervised.html
- 2025-07-30T10:24:39.135Z
+ 2025-07-30T10:32:24.677Zhttps://docs.axolotl.ai/docs/api/integrations.grokfast.optimizer.html
- 2025-07-30T10:24:39.848Z
+ 2025-07-30T10:32:25.377Zhttps://docs.axolotl.ai/docs/api/convert.html
- 2025-07-30T10:24:38.576Z
+ 2025-07-30T10:32:24.126Zhttps://docs.axolotl.ai/docs/api/kernels.quantize.html
- 2025-07-30T10:24:39.340Z
+ 2025-07-30T10:32:24.879Zhttps://docs.axolotl.ai/docs/api/core.training_args.html
- 2025-07-30T10:24:38.656Z
+ 2025-07-30T10:32:24.205Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.chat_template.html
- 2025-07-30T10:24:39.070Z
+ 2025-07-30T10:32:24.614Zhttps://docs.axolotl.ai/docs/api/index.html
- 2025-07-30T10:24:38.479Z
+ 2025-07-30T10:32:24.030Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.llama2_chat.html
- 2025-07-30T10:24:39.118Z
+ 2025-07-30T10:32:24.661Zhttps://docs.axolotl.ai/docs/api/utils.trainer.html
- 2025-07-30T10:24:39.530Z
+ 2025-07-30T10:32:25.067Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.messages.chat.html
- 2025-07-30T10:24:39.157Z
+ 2025-07-30T10:32:24.699Zhttps://docs.axolotl.ai/docs/api/monkeypatch.lora_kernels.html
- 2025-07-30T10:24:39.422Z
+ 2025-07-30T10:32:24.960Zhttps://docs.axolotl.ai/docs/api/kernels.lora.html
- 2025-07-30T10:24:39.311Z
+ 2025-07-30T10:32:24.851Zhttps://docs.axolotl.ai/docs/api/cli.vllm_serve.html
- 2025-07-30T10:24:38.877Z
+ 2025-07-30T10:32:24.424Zhttps://docs.axolotl.ai/docs/api/utils.schemas.multimodal.html
- 2025-07-30T10:24:39.679Z
+ 2025-07-30T10:32:25.211Zhttps://docs.axolotl.ai/docs/api/utils.schemas.utils.html
- 2025-07-30T10:24:39.707Z
+ 2025-07-30T10:32:25.239Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_xformers.html
- 2025-07-30T10:24:39.369Z
+ 2025-07-30T10:32:24.907Zhttps://docs.axolotl.ai/docs/api/integrations.lm_eval.args.html
- 2025-07-30T10:24:39.862Z
+ 2025-07-30T10:32:25.391Zhttps://docs.axolotl.ai/docs/api/monkeypatch.mistral_attn_hijack_flash.html
- 2025-07-30T10:24:39.383Z
+ 2025-07-30T10:32:24.922Zhttps://docs.axolotl.ai/docs/api/utils.collators.core.html
- 2025-07-30T10:24:39.886Z
+ 2025-07-30T10:32:25.414Zhttps://docs.axolotl.ai/docs/api/core.chat.format.chatml.html
- 2025-07-30T10:24:38.681Z
+ 2025-07-30T10:32:24.230Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.passthrough.html
- 2025-07-30T10:24:39.184Z
+ 2025-07-30T10:32:24.725Zhttps://docs.axolotl.ai/docs/api/core.datasets.chat.html
- 2025-07-30T10:24:38.689Z
+ 2025-07-30T10:32:24.238Zhttps://docs.axolotl.ai/docs/api/utils.bench.html
- 2025-07-30T10:24:39.505Z
+ 2025-07-30T10:32:25.042Zhttps://docs.axolotl.ai/docs/api/utils.schemas.training.html
- 2025-07-30T10:24:39.643Z
+ 2025-07-30T10:32:25.176Zhttps://docs.axolotl.ai/docs/api/utils.collators.batching.html
- 2025-07-30T10:24:39.905Z
+ 2025-07-30T10:32:25.433Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_patch_multipack.html
- 2025-07-30T10:24:39.433Z
+ 2025-07-30T10:32:24.970Zhttps://docs.axolotl.ai/docs/api/monkeypatch.multipack.html
- 2025-07-30T10:24:39.385Z
+ 2025-07-30T10:32:24.923Zhttps://docs.axolotl.ai/docs/api/core.builders.causal.html
- 2025-07-30T10:24:38.639Z
+ 2025-07-30T10:32:24.188Zhttps://docs.axolotl.ai/docs/api/cli.evaluate.html
- 2025-07-30T10:24:38.744Z
+ 2025-07-30T10:32:24.293Zhttps://docs.axolotl.ai/docs/api/monkeypatch.trainer_fsdp_optim.html
- 2025-07-30T10:24:39.442Z
+ 2025-07-30T10:32:24.979Zhttps://docs.axolotl.ai/docs/api/core.trainers.utils.html
- 2025-07-30T10:24:38.958Z
+ 2025-07-30T10:32:24.505Zhttps://docs.axolotl.ai/docs/api/utils.schemas.integrations.html
- 2025-07-30T10:24:39.691Z
+ 2025-07-30T10:32:25.223Zhttps://docs.axolotl.ai/docs/api/utils.dict.html
- 2025-07-30T10:24:39.580Z
+ 2025-07-30T10:32:25.114Zhttps://docs.axolotl.ai/docs/api/core.builders.rl.html
- 2025-07-30T10:24:38.643Z
+ 2025-07-30T10:32:24.193Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.orpo.chat_template.html
- 2025-07-30T10:24:39.222Z
+ 2025-07-30T10:32:24.763Zhttps://docs.axolotl.ai/docs/api/core.trainers.relora.html
- 2025-07-30T10:24:38.927Z
+ 2025-07-30T10:32:24.474Zhttps://docs.axolotl.ai/docs/api/integrations.spectrum.args.html
- 2025-07-30T10:24:39.865Z
+ 2025-07-30T10:32:25.394Zhttps://docs.axolotl.ai/docs/api/cli.quantize.html
- 2025-07-30T10:24:38.891Z
+ 2025-07-30T10:32:24.438Zhttps://docs.axolotl.ai/docs/api/cli.checks.html
- 2025-07-30T10:24:38.770Z
+ 2025-07-30T10:32:24.319Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.llama3.html
- 2025-07-30T10:24:39.192Z
+ 2025-07-30T10:32:24.733Zhttps://docs.axolotl.ai/docs/api/utils.model_shard_quant.html
- 2025-07-30T10:24:39.501Z
+ 2025-07-30T10:32:25.038Zhttps://docs.axolotl.ai/docs/api/utils.quantization.html
- 2025-07-30T10:24:39.617Z
+ 2025-07-30T10:32:25.150Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.rng_state_loader.html
- 2025-07-30T10:24:39.004Z
+ 2025-07-30T10:32:24.550Zhttps://docs.axolotl.ai/docs/api/kernels.geglu.html
- 2025-07-30T10:24:39.322Z
+ 2025-07-30T10:32:24.862Zhttps://docs.axolotl.ai/docs/api/utils.data.pretraining.html
- 2025-07-30T10:24:39.589Z
+ 2025-07-30T10:32:25.123Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.user_defined.html
- 2025-07-30T10:24:39.202Z
+ 2025-07-30T10:32:24.743Zhttps://docs.axolotl.ai/docs/api/core.builders.base.html
- 2025-07-30T10:24:38.634Z
+ 2025-07-30T10:32:24.183Zhttps://docs.axolotl.ai/docs/api/cli.merge_lora.html
- 2025-07-30T10:24:38.811Z
+ 2025-07-30T10:32:24.359Zhttps://docs.axolotl.ai/docs/api/monkeypatch.mixtral.html
- 2025-07-30T10:24:39.453Z
+ 2025-07-30T10:32:24.990Zhttps://docs.axolotl.ai/docs/api/utils.data.sft.html
- 2025-07-30T10:24:39.596Z
+ 2025-07-30T10:32:25.130Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.user_defined.html
- 2025-07-30T10:24:39.106Z
+ 2025-07-30T10:32:24.648Zhttps://docs.axolotl.ai/docs/api/utils.tokenization.html
- 2025-07-30T10:24:39.489Z
+ 2025-07-30T10:32:25.026Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chatml.html
- 2025-07-30T10:24:39.179Z
+ 2025-07-30T10:32:24.721Zhttps://docs.axolotl.ai/docs/api/models.mamba.modeling_mamba.html
- 2025-07-30T10:24:39.884Z
+ 2025-07-30T10:32:25.413Zhttps://docs.axolotl.ai/docs/api/cli.args.html
- 2025-07-30T10:24:38.764Z
+ 2025-07-30T10:32:24.313Zhttps://docs.axolotl.ai/docs/api/evaluate.html
- 2025-07-30T10:24:38.551Z
+ 2025-07-30T10:32:24.102Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_instruct.html
- 2025-07-30T10:24:39.086Z
+ 2025-07-30T10:32:24.629Zhttps://docs.axolotl.ai/docs/api/utils.distributed.html
- 2025-07-30T10:24:39.574Z
+ 2025-07-30T10:32:25.111Zhttps://docs.axolotl.ai/docs/multipack.html
- 2025-07-30T10:21:25.042Z
+ 2025-07-30T10:29:12.475Zhttps://docs.axolotl.ai/examples/colab-notebooks/colab-axolotl-example.html
- 2025-07-30T10:21:25.046Z
+ 2025-07-30T10:29:12.480Zhttps://docs.axolotl.ai/FAQS.html
- 2025-07-30T10:21:25.037Z
+ 2025-07-30T10:29:12.470Z