Remove null from a dictionary-like obj or list.
+These can appear due to Dataset loading causing schema merge.
+See https://github.com/axolotl-ai-cloud/axolotl/pull/2909
%%capture# This step can take ~5-10 minutes to install dependencies!pip install --no-build-isolation axolotl[flash-attn]>=0.9.1
-!pip install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646"
2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!
2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!
2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!
2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!
diff --git a/search.json b/search.json
index d3f20c606..212ce1243 100644
--- a/search.json
+++ b/search.json
@@ -18,7 +18,7 @@
"href": "index.html#latest-updates",
"title": "Axolotl",
"section": "🎉 Latest Updates",
- "text": "🎉 Latest Updates\n\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.",
+ "text": "🎉 Latest Updates\n\n2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.",
"crumbs": [
"Home"
]
@@ -1153,7 +1153,7 @@
"href": "docs/custom_integrations.html#cut-cross-entropy",
"title": "Custom Integrations",
"section": "Cut Cross Entropy",
- "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here",
+ "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here",
"crumbs": [
"Advanced Features",
"Custom Integrations"
@@ -3539,7 +3539,7 @@
"href": "docs/api/utils.dict.html",
"title": "utils.dict",
"section": "",
- "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys."
+ "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909"
},
{
"objectID": "docs/api/utils.dict.html#classes",
@@ -3548,6 +3548,13 @@
"section": "",
"text": "Name\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys."
},
+ {
+ "objectID": "docs/api/utils.dict.html#functions",
+ "href": "docs/api/utils.dict.html#functions",
+ "title": "utils.dict",
+ "section": "",
+ "text": "Name\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909"
+ },
{
"objectID": "docs/api/core.builders.rl.html",
"href": "docs/api/core.builders.rl.html",
diff --git a/sitemap.xml b/sitemap.xml
index 06669a5a6..9129d7daa 100644
--- a/sitemap.xml
+++ b/sitemap.xml
@@ -2,766 +2,766 @@
https://docs.axolotl.ai/TODO.html
- 2025-07-30T10:29:12.470Z
+ 2025-07-30T10:49:57.096Zhttps://docs.axolotl.ai/index.html
- 2025-07-30T10:29:12.491Z
+ 2025-07-30T10:49:57.116Zhttps://docs.axolotl.ai/docs/debugging.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.098Zhttps://docs.axolotl.ai/docs/amd_hpc.html
- 2025-07-30T10:29:12.471Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.mlflow_.html
- 2025-07-30T10:32:25.500Z
+ 2025-07-30T10:53:08.360Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_expand_mask.html
- 2025-07-30T10:32:24.931Z
+ 2025-07-30T10:53:07.792Zhttps://docs.axolotl.ai/docs/api/loaders.patch_manager.html
- 2025-07-30T10:32:24.540Z
+ 2025-07-30T10:53:07.402Zhttps://docs.axolotl.ai/docs/api/core.chat.format.llama3x.html
- 2025-07-30T10:32:24.231Z
+ 2025-07-30T10:53:07.095Zhttps://docs.axolotl.ai/docs/api/cli.train.html
- 2025-07-30T10:32:24.285Z
+ 2025-07-30T10:53:07.148Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.perplexity.html
- 2025-07-30T10:32:25.491Z
+ 2025-07-30T10:53:08.351Zhttps://docs.axolotl.ai/docs/api/core.chat.messages.html
- 2025-07-30T10:32:24.228Z
+ 2025-07-30T10:53:07.092Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.lisa.html
- 2025-07-30T10:32:25.496Z
+ 2025-07-30T10:53:08.356Zhttps://docs.axolotl.ai/docs/api/cli.merge_sharded_fsdp_weights.html
- 2025-07-30T10:32:24.371Z
+ 2025-07-30T10:53:07.234Zhttps://docs.axolotl.ai/docs/api/cli.sweeps.html
- 2025-07-30T10:32:24.385Z
+ 2025-07-30T10:53:07.248Zhttps://docs.axolotl.ai/docs/api/utils.chat_templates.html
- 2025-07-30T10:32:25.028Z
+ 2025-07-30T10:53:07.888Zhttps://docs.axolotl.ai/docs/api/core.chat.format.shared.html
- 2025-07-30T10:32:24.233Z
+ 2025-07-30T10:53:07.096Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.optimizer.html
- 2025-07-30T10:32:24.547Z
+ 2025-07-30T10:53:07.409Zhttps://docs.axolotl.ai/docs/api/utils.collators.mamba.html
- 2025-07-30T10:32:25.437Z
+ 2025-07-30T10:53:08.300Zhttps://docs.axolotl.ai/docs/api/logging_config.html
- 2025-07-30T10:32:24.177Z
+ 2025-07-30T10:53:07.041Zhttps://docs.axolotl.ai/docs/api/utils.collators.mm_chat.html
- 2025-07-30T10:32:25.441Z
+ 2025-07-30T10:53:08.304Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.completion.html
- 2025-07-30T10:32:24.667Z
+ 2025-07-30T10:53:07.529Zhttps://docs.axolotl.ai/docs/api/kernels.utils.html
- 2025-07-30T10:32:24.881Z
+ 2025-07-30T10:53:07.741Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chat_template.html
- 2025-07-30T10:32:24.700Z
+ 2025-07-30T10:53:07.562Zhttps://docs.axolotl.ai/docs/api/kernels.swiglu.html
- 2025-07-30T10:32:24.872Z
+ 2025-07-30T10:53:07.732Zhttps://docs.axolotl.ai/docs/api/common.const.html
- 2025-07-30T10:32:25.397Z
+ 2025-07-30T10:53:08.260Zhttps://docs.axolotl.ai/docs/api/cli.cloud.base.html
- 2025-07-30T10:32:24.427Z
+ 2025-07-30T10:53:07.290Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.comet_.html
- 2025-07-30T10:32:25.503Z
+ 2025-07-30T10:53:08.363Zhttps://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_cpu.html
- 2025-07-30T10:32:24.994Z
+ 2025-07-30T10:53:07.854Zhttps://docs.axolotl.ai/docs/api/common.architectures.html
- 2025-07-30T10:32:25.396Z
+ 2025-07-30T10:53:08.259Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.pygmalion.html
- 2025-07-30T10:32:24.695Z
+ 2025-07-30T10:53:07.557Zhttps://docs.axolotl.ai/docs/api/utils.schemas.peft.html
- 2025-07-30T10:32:25.202Z
+ 2025-07-30T10:53:08.065Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.user_defined.html
- 2025-07-30T10:32:24.724Z
+ 2025-07-30T10:53:07.586Zhttps://docs.axolotl.ai/docs/api/utils.schemas.datasets.html
- 2025-07-30T10:32:25.194Z
+ 2025-07-30T10:53:08.057Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_w_system.html
- 2025-07-30T10:32:24.640Z
+ 2025-07-30T10:53:07.503Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.base.html
- 2025-07-30T10:32:24.581Z
+ 2025-07-30T10:53:07.444Zhttps://docs.axolotl.ai/docs/api/utils.lora.html
- 2025-07-30T10:32:25.033Z
+ 2025-07-30T10:53:07.893Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.input_output.html
- 2025-07-30T10:32:24.673Z
+ 2025-07-30T10:53:07.535Zhttps://docs.axolotl.ai/docs/api/utils.schemas.trl.html
- 2025-07-30T10:32:25.205Z
+ 2025-07-30T10:53:08.069Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.zephyr.html
- 2025-07-30T10:32:24.722Z
+ 2025-07-30T10:53:07.584Zhttps://docs.axolotl.ai/docs/api/integrations.kd.trainer.html
- 2025-07-30T10:32:25.384Z
+ 2025-07-30T10:53:08.248Zhttps://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_disk.html
- 2025-07-30T10:32:25.019Z
+ 2025-07-30T10:53:07.880Zhttps://docs.axolotl.ai/docs/api/utils.optimizers.adopt.html
- 2025-07-30T10:32:25.122Z
+ 2025-07-30T10:53:07.984Zhttps://docs.axolotl.ai/docs/api/monkeypatch.data.batch_dataset_fetcher.html
- 2025-07-30T10:32:24.989Z
+ 2025-07-30T10:53:07.850Zhttps://docs.axolotl.ai/docs/api/cli.cloud.modal_.html
- 2025-07-30T10:32:24.433Z
+ 2025-07-30T10:53:07.296Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_chat.html
- 2025-07-30T10:32:24.627Z
+ 2025-07-30T10:53:07.490Zhttps://docs.axolotl.ai/docs/api/utils.freeze.html
- 2025-07-30T10:32:25.049Z
+ 2025-07-30T10:53:07.910Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.bradley_terry.llama3.html
- 2025-07-30T10:32:24.767Z
+ 2025-07-30T10:53:07.629Zhttps://docs.axolotl.ai/docs/api/integrations.base.html
- 2025-07-30T10:32:25.372Z
+ 2025-07-30T10:53:08.236Zhttps://docs.axolotl.ai/docs/api/monkeypatch.unsloth_.html
- 2025-07-30T10:32:24.987Z
+ 2025-07-30T10:53:07.848Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.chatml.html
- 2025-07-30T10:32:24.741Z
+ 2025-07-30T10:53:07.604Zhttps://docs.axolotl.ai/docs/api/cli.main.html
- 2025-07-30T10:32:24.277Z
+ 2025-07-30T10:53:07.140Zhttps://docs.axolotl.ai/docs/api/common.datasets.html
- 2025-07-30T10:32:25.412Z
+ 2025-07-30T10:53:08.275Zhttps://docs.axolotl.ai/docs/api/train.html
- 2025-07-30T10:32:24.092Z
+ 2025-07-30T10:53:06.956Zhttps://docs.axolotl.ai/docs/api/core.trainers.base.html
- 2025-07-30T10:32:24.449Z
+ 2025-07-30T10:53:07.311Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.scheduler.html
- 2025-07-30T10:32:24.556Z
+ 2025-07-30T10:53:07.419Zhttps://docs.axolotl.ai/docs/api/utils.ctx_managers.sequence_parallel.html
- 2025-07-30T10:32:24.580Z
+ 2025-07-30T10:53:07.442Zhttps://docs.axolotl.ai/docs/api/utils.schemas.config.html
- 2025-07-30T10:32:25.164Z
+ 2025-07-30T10:53:08.027Zhttps://docs.axolotl.ai/docs/api/loaders.tokenizer.html
- 2025-07-30T10:32:24.522Z
+ 2025-07-30T10:53:07.385Zhttps://docs.axolotl.ai/docs/api/integrations.liger.args.html
- 2025-07-30T10:32:25.388Z
+ 2025-07-30T10:53:08.251Zhttps://docs.axolotl.ai/docs/api/cli.config.html
- 2025-07-30T10:32:24.337Z
+ 2025-07-30T10:53:07.200Zhttps://docs.axolotl.ai/docs/api/loaders.processor.html
- 2025-07-30T10:32:24.524Z
+ 2025-07-30T10:53:07.387Zhttps://docs.axolotl.ai/docs/api/monkeypatch.utils.html
- 2025-07-30T10:32:24.967Z
+ 2025-07-30T10:53:07.828Zhttps://docs.axolotl.ai/docs/api/integrations.cut_cross_entropy.args.html
- 2025-07-30T10:32:25.376Z
+ 2025-07-30T10:53:08.239Zhttps://docs.axolotl.ai/docs/api/core.trainers.dpo.trainer.html
- 2025-07-30T10:32:24.480Z
+ 2025-07-30T10:53:07.343Zhttps://docs.axolotl.ai/docs/api/loaders.adapter.html
- 2025-07-30T10:32:24.529Z
+ 2025-07-30T10:53:07.392Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.llama3.html
- 2025-07-30T10:32:24.711Z
+ 2025-07-30T10:53:07.573Zhttps://docs.axolotl.ai/docs/api/core.datasets.transforms.chat_builder.html
- 2025-07-30T10:32:24.245Z
+ 2025-07-30T10:53:07.109Zhttps://docs.axolotl.ai/docs/api/monkeypatch.transformers_fa_utils.html
- 2025-07-30T10:32:24.986Z
+ 2025-07-30T10:53:07.847Zhttps://docs.axolotl.ai/docs/api/datasets.html
- 2025-07-30T10:32:24.113Z
+ 2025-07-30T10:53:06.977Zhttps://docs.axolotl.ai/docs/api/monkeypatch.btlm_attn_hijack_flash.html
- 2025-07-30T10:32:24.969Z
+ 2025-07-30T10:53:07.830Zhttps://docs.axolotl.ai/docs/api/cli.inference.html
- 2025-07-30T10:32:24.351Z
+ 2025-07-30T10:53:07.214Zhttps://docs.axolotl.ai/docs/api/cli.utils.html
- 2025-07-30T10:32:24.417Z
+ 2025-07-30T10:53:07.280Zhttps://docs.axolotl.ai/docs/api/cli.preprocess.html
- 2025-07-30T10:32:24.379Z
+ 2025-07-30T10:53:07.242Zhttps://docs.axolotl.ai/docs/api/loaders.model.html
- 2025-07-30T10:32:24.514Z
+ 2025-07-30T10:53:07.377Zhttps://docs.axolotl.ai/docs/api/core.trainers.grpo.sampler.html
- 2025-07-30T10:32:24.503Z
+ 2025-07-30T10:53:07.366Zhttps://docs.axolotl.ai/docs/api/core.trainers.grpo.trainer.html
- 2025-07-30T10:32:24.491Z
+ 2025-07-30T10:53:07.354Zhttps://docs.axolotl.ai/docs/batch_vs_grad.html
- 2025-07-30T10:29:12.471Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/custom_integrations.html
- 2025-07-30T10:29:12.471Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/quantize.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/installation.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.100Zhttps://docs.axolotl.ai/docs/docker.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.098Zhttps://docs.axolotl.ai/docs/reward_modelling.html
- 2025-07-30T10:29:12.476Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/dataset-formats/inst_tune.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/dataset-formats/conversation.html
- 2025-07-30T10:29:12.471Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/dataset-formats/tokenized.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/rlhf.html
- 2025-07-30T10:29:12.476Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/sequence_parallelism.html
- 2025-07-30T10:29:12.476Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/inference.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.100Zhttps://docs.axolotl.ai/docs/fsdp_qlora.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.098Zhttps://docs.axolotl.ai/docs/multi-node.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/lora_optims.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/getting-started.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.098Zhttps://docs.axolotl.ai/docs/dataset_loading.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/lr_groups.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/mac.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html
- 2025-07-30T10:29:12.495Z
+ 2025-07-30T10:49:57.120Zhttps://docs.axolotl.ai/src/axolotl/integrations/LICENSE.html
- 2025-07-30T10:29:12.495Z
+ 2025-07-30T10:49:57.120Zhttps://docs.axolotl.ai/docs/input_output.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.100Zhttps://docs.axolotl.ai/docs/gradient_checkpointing.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.098Zhttps://docs.axolotl.ai/docs/qat.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/faq.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.098Zhttps://docs.axolotl.ai/docs/dataset_preprocessing.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/nccl.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/cli.html
- 2025-07-30T10:29:12.471Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/torchao.html
- 2025-07-30T10:29:12.476Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/multi-gpu.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/dataset-formats/pretraining.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/dataset-formats/index.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/dataset-formats/template_free.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/dataset-formats/stepwise_supervised.html
- 2025-07-30T10:29:12.472Z
+ 2025-07-30T10:49:57.097Zhttps://docs.axolotl.ai/docs/ray-integration.html
- 2025-07-30T10:29:12.476Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/unsloth.html
- 2025-07-30T10:29:12.476Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/mixed_precision.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/multimodal.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/docs/config-reference.html
- 2025-07-30T10:32:38.090Z
+ 2025-07-30T10:53:21.071Zhttps://docs.axolotl.ai/docs/api/prompt_tokenizers.html
- 2025-07-30T10:32:24.168Z
+ 2025-07-30T10:53:07.032Zhttps://docs.axolotl.ai/docs/api/utils.schedulers.html
- 2025-07-30T10:32:25.091Z
+ 2025-07-30T10:53:07.952Zhttps://docs.axolotl.ai/docs/api/utils.samplers.multipack.html
- 2025-07-30T10:32:25.484Z
+ 2025-07-30T10:53:08.345Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.orcamini.html
- 2025-07-30T10:32:24.688Z
+ 2025-07-30T10:53:07.550Zhttps://docs.axolotl.ai/docs/api/core.trainers.trl.html
- 2025-07-30T10:32:24.463Z
+ 2025-07-30T10:53:07.326Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.metharme.html
- 2025-07-30T10:32:24.684Z
+ 2025-07-30T10:53:07.547Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.profiler.html
- 2025-07-30T10:32:25.495Z
+ 2025-07-30T10:53:08.355Zhttps://docs.axolotl.ai/docs/api/utils.schemas.enums.html
- 2025-07-30T10:32:25.233Z
+ 2025-07-30T10:53:08.096Zhttps://docs.axolotl.ai/docs/api/core.trainers.mamba.html
- 2025-07-30T10:32:24.469Z
+ 2025-07-30T10:53:07.332Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_flash.html
- 2025-07-30T10:32:24.906Z
+ 2025-07-30T10:53:07.766Zhttps://docs.axolotl.ai/docs/api/monkeypatch.relora.html
- 2025-07-30T10:32:24.930Z
+ 2025-07-30T10:53:07.790Zhttps://docs.axolotl.ai/docs/api/monkeypatch.stablelm_attn_hijack_flash.html
- 2025-07-30T10:32:24.976Z
+ 2025-07-30T10:53:07.837Zhttps://docs.axolotl.ai/docs/api/loaders.constants.html
- 2025-07-30T10:32:24.541Z
+ 2025-07-30T10:53:07.404Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.qat.html
- 2025-07-30T10:32:25.510Z
+ 2025-07-30T10:53:08.370Zhttps://docs.axolotl.ai/docs/api/utils.schemas.model.html
- 2025-07-30T10:32:25.171Z
+ 2025-07-30T10:53:08.034Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.stepwise_supervised.html
- 2025-07-30T10:32:24.677Z
+ 2025-07-30T10:53:07.540Zhttps://docs.axolotl.ai/docs/api/integrations.grokfast.optimizer.html
- 2025-07-30T10:32:25.377Z
+ 2025-07-30T10:53:08.240Zhttps://docs.axolotl.ai/docs/api/convert.html
- 2025-07-30T10:32:24.126Z
+ 2025-07-30T10:53:06.990Zhttps://docs.axolotl.ai/docs/api/kernels.quantize.html
- 2025-07-30T10:32:24.879Z
+ 2025-07-30T10:53:07.740Zhttps://docs.axolotl.ai/docs/api/core.training_args.html
- 2025-07-30T10:32:24.205Z
+ 2025-07-30T10:53:07.069Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.chat_template.html
- 2025-07-30T10:32:24.614Z
+ 2025-07-30T10:53:07.476Zhttps://docs.axolotl.ai/docs/api/index.html
- 2025-07-30T10:32:24.030Z
+ 2025-07-30T10:53:06.895Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.llama2_chat.html
- 2025-07-30T10:32:24.661Z
+ 2025-07-30T10:53:07.523Zhttps://docs.axolotl.ai/docs/api/utils.trainer.html
- 2025-07-30T10:32:25.067Z
+ 2025-07-30T10:53:07.927Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.messages.chat.html
- 2025-07-30T10:32:24.699Z
+ 2025-07-30T10:53:07.561Zhttps://docs.axolotl.ai/docs/api/monkeypatch.lora_kernels.html
- 2025-07-30T10:32:24.960Z
+ 2025-07-30T10:53:07.820Zhttps://docs.axolotl.ai/docs/api/kernels.lora.html
- 2025-07-30T10:32:24.851Z
+ 2025-07-30T10:53:07.712Zhttps://docs.axolotl.ai/docs/api/cli.vllm_serve.html
- 2025-07-30T10:32:24.424Z
+ 2025-07-30T10:53:07.287Zhttps://docs.axolotl.ai/docs/api/utils.schemas.multimodal.html
- 2025-07-30T10:32:25.211Z
+ 2025-07-30T10:53:08.074Zhttps://docs.axolotl.ai/docs/api/utils.schemas.utils.html
- 2025-07-30T10:32:25.239Z
+ 2025-07-30T10:53:08.102Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_xformers.html
- 2025-07-30T10:32:24.907Z
+ 2025-07-30T10:53:07.768Zhttps://docs.axolotl.ai/docs/api/integrations.lm_eval.args.html
- 2025-07-30T10:32:25.391Z
+ 2025-07-30T10:53:08.254Zhttps://docs.axolotl.ai/docs/api/monkeypatch.mistral_attn_hijack_flash.html
- 2025-07-30T10:32:24.922Z
+ 2025-07-30T10:53:07.782Zhttps://docs.axolotl.ai/docs/api/utils.collators.core.html
- 2025-07-30T10:32:25.414Z
+ 2025-07-30T10:53:08.278Zhttps://docs.axolotl.ai/docs/api/core.chat.format.chatml.html
- 2025-07-30T10:32:24.230Z
+ 2025-07-30T10:53:07.093Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.passthrough.html
- 2025-07-30T10:32:24.725Z
+ 2025-07-30T10:53:07.587Zhttps://docs.axolotl.ai/docs/api/core.datasets.chat.html
- 2025-07-30T10:32:24.238Z
+ 2025-07-30T10:53:07.101Zhttps://docs.axolotl.ai/docs/api/utils.bench.html
- 2025-07-30T10:32:25.042Z
+ 2025-07-30T10:53:07.903Zhttps://docs.axolotl.ai/docs/api/utils.schemas.training.html
- 2025-07-30T10:32:25.176Z
+ 2025-07-30T10:53:08.039Zhttps://docs.axolotl.ai/docs/api/utils.collators.batching.html
- 2025-07-30T10:32:25.433Z
+ 2025-07-30T10:53:08.296Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_patch_multipack.html
- 2025-07-30T10:32:24.970Z
+ 2025-07-30T10:53:07.831Zhttps://docs.axolotl.ai/docs/api/monkeypatch.multipack.html
- 2025-07-30T10:32:24.923Z
+ 2025-07-30T10:53:07.784Zhttps://docs.axolotl.ai/docs/api/core.builders.causal.html
- 2025-07-30T10:32:24.188Z
+ 2025-07-30T10:53:07.052Zhttps://docs.axolotl.ai/docs/api/cli.evaluate.html
- 2025-07-30T10:32:24.293Z
+ 2025-07-30T10:53:07.156Zhttps://docs.axolotl.ai/docs/api/monkeypatch.trainer_fsdp_optim.html
- 2025-07-30T10:32:24.979Z
+ 2025-07-30T10:53:07.840Zhttps://docs.axolotl.ai/docs/api/core.trainers.utils.html
- 2025-07-30T10:32:24.505Z
+ 2025-07-30T10:53:07.367Zhttps://docs.axolotl.ai/docs/api/utils.schemas.integrations.html
- 2025-07-30T10:32:25.223Z
+ 2025-07-30T10:53:08.086Zhttps://docs.axolotl.ai/docs/api/utils.dict.html
- 2025-07-30T10:32:25.114Z
+ 2025-07-30T10:53:07.977Zhttps://docs.axolotl.ai/docs/api/core.builders.rl.html
- 2025-07-30T10:32:24.193Z
+ 2025-07-30T10:53:07.056Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.orpo.chat_template.html
- 2025-07-30T10:32:24.763Z
+ 2025-07-30T10:53:07.625Zhttps://docs.axolotl.ai/docs/api/core.trainers.relora.html
- 2025-07-30T10:32:24.474Z
+ 2025-07-30T10:53:07.336Zhttps://docs.axolotl.ai/docs/api/integrations.spectrum.args.html
- 2025-07-30T10:32:25.394Z
+ 2025-07-30T10:53:08.257Zhttps://docs.axolotl.ai/docs/api/cli.quantize.html
- 2025-07-30T10:32:24.438Z
+ 2025-07-30T10:53:07.301Zhttps://docs.axolotl.ai/docs/api/cli.checks.html
- 2025-07-30T10:32:24.319Z
+ 2025-07-30T10:53:07.182Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.llama3.html
- 2025-07-30T10:32:24.733Z
+ 2025-07-30T10:53:07.595Zhttps://docs.axolotl.ai/docs/api/utils.model_shard_quant.html
- 2025-07-30T10:32:25.038Z
+ 2025-07-30T10:53:07.899Zhttps://docs.axolotl.ai/docs/api/utils.quantization.html
- 2025-07-30T10:32:25.150Z
+ 2025-07-30T10:53:08.013Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.rng_state_loader.html
- 2025-07-30T10:32:24.550Z
+ 2025-07-30T10:53:07.412Zhttps://docs.axolotl.ai/docs/api/kernels.geglu.html
- 2025-07-30T10:32:24.862Z
+ 2025-07-30T10:53:07.722Zhttps://docs.axolotl.ai/docs/api/utils.data.pretraining.html
- 2025-07-30T10:32:25.123Z
+ 2025-07-30T10:53:07.986Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.user_defined.html
- 2025-07-30T10:32:24.743Z
+ 2025-07-30T10:53:07.605Zhttps://docs.axolotl.ai/docs/api/core.builders.base.html
- 2025-07-30T10:32:24.183Z
+ 2025-07-30T10:53:07.047Zhttps://docs.axolotl.ai/docs/api/cli.merge_lora.html
- 2025-07-30T10:32:24.359Z
+ 2025-07-30T10:53:07.222Zhttps://docs.axolotl.ai/docs/api/monkeypatch.mixtral.html
- 2025-07-30T10:32:24.990Z
+ 2025-07-30T10:53:07.851Zhttps://docs.axolotl.ai/docs/api/utils.data.sft.html
- 2025-07-30T10:32:25.130Z
+ 2025-07-30T10:53:07.993Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.user_defined.html
- 2025-07-30T10:32:24.648Z
+ 2025-07-30T10:53:07.511Zhttps://docs.axolotl.ai/docs/api/utils.tokenization.html
- 2025-07-30T10:32:25.026Z
+ 2025-07-30T10:53:07.887Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chatml.html
- 2025-07-30T10:32:24.721Z
+ 2025-07-30T10:53:07.583Zhttps://docs.axolotl.ai/docs/api/models.mamba.modeling_mamba.html
- 2025-07-30T10:32:25.413Z
+ 2025-07-30T10:53:08.276Zhttps://docs.axolotl.ai/docs/api/cli.args.html
- 2025-07-30T10:32:24.313Z
+ 2025-07-30T10:53:07.176Zhttps://docs.axolotl.ai/docs/api/evaluate.html
- 2025-07-30T10:32:24.102Z
+ 2025-07-30T10:53:06.966Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_instruct.html
- 2025-07-30T10:32:24.629Z
+ 2025-07-30T10:53:07.491Zhttps://docs.axolotl.ai/docs/api/utils.distributed.html
- 2025-07-30T10:32:25.111Z
+ 2025-07-30T10:53:07.971Zhttps://docs.axolotl.ai/docs/multipack.html
- 2025-07-30T10:29:12.475Z
+ 2025-07-30T10:49:57.101Zhttps://docs.axolotl.ai/examples/colab-notebooks/colab-axolotl-example.html
- 2025-07-30T10:29:12.480Z
+ 2025-07-30T10:49:57.105Zhttps://docs.axolotl.ai/FAQS.html
- 2025-07-30T10:29:12.470Z
+ 2025-07-30T10:49:57.095Z