Remove null from a dictionary-like obj or list.
+These can appear due to Dataset loading causing schema merge.
+See https://github.com/axolotl-ai-cloud/axolotl/pull/2909
%%capture# This step can take ~5-10 minutes to install dependencies!pip install --no-build-isolation axolotl[flash-attn]>=0.9.1
-!pip install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646"
2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!
2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!
2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!
2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!
diff --git a/search.json b/search.json
index d3f20c606..212ce1243 100644
--- a/search.json
+++ b/search.json
@@ -18,7 +18,7 @@
"href": "index.html#latest-updates",
"title": "Axolotl",
"section": "🎉 Latest Updates",
- "text": "🎉 Latest Updates\n\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.",
+ "text": "🎉 Latest Updates\n\n2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.",
"crumbs": [
"Home"
]
@@ -1153,7 +1153,7 @@
"href": "docs/custom_integrations.html#cut-cross-entropy",
"title": "Custom Integrations",
"section": "Cut Cross Entropy",
- "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here",
+ "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here",
"crumbs": [
"Advanced Features",
"Custom Integrations"
@@ -3539,7 +3539,7 @@
"href": "docs/api/utils.dict.html",
"title": "utils.dict",
"section": "",
- "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys."
+ "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909"
},
{
"objectID": "docs/api/utils.dict.html#classes",
@@ -3548,6 +3548,13 @@
"section": "",
"text": "Name\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys."
},
+ {
+ "objectID": "docs/api/utils.dict.html#functions",
+ "href": "docs/api/utils.dict.html#functions",
+ "title": "utils.dict",
+ "section": "",
+ "text": "Name\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909"
+ },
{
"objectID": "docs/api/core.builders.rl.html",
"href": "docs/api/core.builders.rl.html",
diff --git a/sitemap.xml b/sitemap.xml
index 2ac2fd115..1d55d6b64 100644
--- a/sitemap.xml
+++ b/sitemap.xml
@@ -2,766 +2,766 @@
https://docs.axolotl.ai/TODO.html
- 2025-07-27T21:05:22.821Z
+ 2025-07-30T08:57:15.648Zhttps://docs.axolotl.ai/index.html
- 2025-07-27T21:05:22.842Z
+ 2025-07-30T08:57:15.669Zhttps://docs.axolotl.ai/docs/debugging.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/amd_hpc.html
- 2025-07-27T21:05:22.822Z
+ 2025-07-30T08:57:15.649Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.mlflow_.html
- 2025-07-27T21:08:44.217Z
+ 2025-07-30T09:00:36.541Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_expand_mask.html
- 2025-07-27T21:08:43.649Z
+ 2025-07-30T09:00:35.952Zhttps://docs.axolotl.ai/docs/api/loaders.patch_manager.html
- 2025-07-27T21:08:43.258Z
+ 2025-07-30T09:00:35.546Zhttps://docs.axolotl.ai/docs/api/core.chat.format.llama3x.html
- 2025-07-27T21:08:42.950Z
+ 2025-07-30T09:00:35.225Zhttps://docs.axolotl.ai/docs/api/cli.train.html
- 2025-07-27T21:08:43.005Z
+ 2025-07-30T09:00:35.282Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.perplexity.html
- 2025-07-27T21:08:44.208Z
+ 2025-07-30T09:00:36.531Zhttps://docs.axolotl.ai/docs/api/core.chat.messages.html
- 2025-07-27T21:08:42.947Z
+ 2025-07-30T09:00:35.222Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.lisa.html
- 2025-07-27T21:08:44.213Z
+ 2025-07-30T09:00:36.537Zhttps://docs.axolotl.ai/docs/api/cli.merge_sharded_fsdp_weights.html
- 2025-07-27T21:08:43.091Z
+ 2025-07-30T09:00:35.372Zhttps://docs.axolotl.ai/docs/api/cli.sweeps.html
- 2025-07-27T21:08:43.105Z
+ 2025-07-30T09:00:35.386Zhttps://docs.axolotl.ai/docs/api/utils.chat_templates.html
- 2025-07-27T21:08:43.746Z
+ 2025-07-30T09:00:36.052Zhttps://docs.axolotl.ai/docs/api/core.chat.format.shared.html
- 2025-07-27T21:08:42.951Z
+ 2025-07-30T09:00:35.227Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.optimizer.html
- 2025-07-27T21:08:43.265Z
+ 2025-07-30T09:00:35.554Zhttps://docs.axolotl.ai/docs/api/utils.collators.mamba.html
- 2025-07-27T21:08:44.157Z
+ 2025-07-30T09:00:36.479Zhttps://docs.axolotl.ai/docs/api/logging_config.html
- 2025-07-27T21:08:42.896Z
+ 2025-07-30T09:00:35.169Zhttps://docs.axolotl.ai/docs/api/utils.collators.mm_chat.html
- 2025-07-27T21:08:44.161Z
+ 2025-07-30T09:00:36.483Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.completion.html
- 2025-07-27T21:08:43.385Z
+ 2025-07-30T09:00:35.677Zhttps://docs.axolotl.ai/docs/api/kernels.utils.html
- 2025-07-27T21:08:43.598Z
+ 2025-07-30T09:00:35.898Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chat_template.html
- 2025-07-27T21:08:43.418Z
+ 2025-07-30T09:00:35.712Zhttps://docs.axolotl.ai/docs/api/kernels.swiglu.html
- 2025-07-27T21:08:43.589Z
+ 2025-07-30T09:00:35.889Zhttps://docs.axolotl.ai/docs/api/common.const.html
- 2025-07-27T21:08:44.117Z
+ 2025-07-30T09:00:36.438Zhttps://docs.axolotl.ai/docs/api/cli.cloud.base.html
- 2025-07-27T21:08:43.147Z
+ 2025-07-30T09:00:35.430Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.comet_.html
- 2025-07-27T21:08:44.220Z
+ 2025-07-30T09:00:36.545Zhttps://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_cpu.html
- 2025-07-27T21:08:43.712Z
+ 2025-07-30T09:00:36.017Zhttps://docs.axolotl.ai/docs/api/common.architectures.html
- 2025-07-27T21:08:44.116Z
+ 2025-07-30T09:00:36.436Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.pygmalion.html
- 2025-07-27T21:08:43.413Z
+ 2025-07-30T09:00:35.706Zhttps://docs.axolotl.ai/docs/api/utils.schemas.peft.html
- 2025-07-27T21:08:43.921Z
+ 2025-07-30T09:00:36.236Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.user_defined.html
- 2025-07-27T21:08:43.442Z
+ 2025-07-30T09:00:35.736Zhttps://docs.axolotl.ai/docs/api/utils.schemas.datasets.html
- 2025-07-27T21:08:43.912Z
+ 2025-07-30T09:00:36.228Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_w_system.html
- 2025-07-27T21:08:43.359Z
+ 2025-07-30T09:00:35.650Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.base.html
- 2025-07-27T21:08:43.300Z
+ 2025-07-30T09:00:35.590Zhttps://docs.axolotl.ai/docs/api/utils.lora.html
- 2025-07-27T21:08:43.751Z
+ 2025-07-30T09:00:36.057Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.input_output.html
- 2025-07-27T21:08:43.391Z
+ 2025-07-30T09:00:35.683Zhttps://docs.axolotl.ai/docs/api/utils.schemas.trl.html
- 2025-07-27T21:08:43.924Z
+ 2025-07-30T09:00:36.240Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.zephyr.html
- 2025-07-27T21:08:43.440Z
+ 2025-07-30T09:00:35.734Zhttps://docs.axolotl.ai/docs/api/integrations.kd.trainer.html
- 2025-07-27T21:08:44.104Z
+ 2025-07-30T09:00:36.425Zhttps://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_disk.html
- 2025-07-27T21:08:43.738Z
+ 2025-07-30T09:00:36.043Zhttps://docs.axolotl.ai/docs/api/utils.optimizers.adopt.html
- 2025-07-27T21:08:43.840Z
+ 2025-07-30T09:00:36.152Zhttps://docs.axolotl.ai/docs/api/monkeypatch.data.batch_dataset_fetcher.html
- 2025-07-27T21:08:43.707Z
+ 2025-07-30T09:00:36.012Zhttps://docs.axolotl.ai/docs/api/cli.cloud.modal_.html
- 2025-07-27T21:08:43.153Z
+ 2025-07-30T09:00:35.436Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_chat.html
- 2025-07-27T21:08:43.346Z
+ 2025-07-30T09:00:35.637Zhttps://docs.axolotl.ai/docs/api/utils.freeze.html
- 2025-07-27T21:08:43.768Z
+ 2025-07-30T09:00:36.074Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.bradley_terry.llama3.html
- 2025-07-27T21:08:43.485Z
+ 2025-07-30T09:00:35.780Zhttps://docs.axolotl.ai/docs/api/integrations.base.html
- 2025-07-27T21:08:44.092Z
+ 2025-07-30T09:00:36.412Zhttps://docs.axolotl.ai/docs/api/monkeypatch.unsloth_.html
- 2025-07-27T21:08:43.706Z
+ 2025-07-30T09:00:36.010Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.chatml.html
- 2025-07-27T21:08:43.460Z
+ 2025-07-30T09:00:35.754Zhttps://docs.axolotl.ai/docs/api/cli.main.html
- 2025-07-27T21:08:42.996Z
+ 2025-07-30T09:00:35.273Zhttps://docs.axolotl.ai/docs/api/common.datasets.html
- 2025-07-27T21:08:44.132Z
+ 2025-07-30T09:00:36.453Zhttps://docs.axolotl.ai/docs/api/train.html
- 2025-07-27T21:08:42.811Z
+ 2025-07-30T09:00:35.081Zhttps://docs.axolotl.ai/docs/api/core.trainers.base.html
- 2025-07-27T21:08:43.168Z
+ 2025-07-30T09:00:35.452Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.scheduler.html
- 2025-07-27T21:08:43.275Z
+ 2025-07-30T09:00:35.564Zhttps://docs.axolotl.ai/docs/api/utils.ctx_managers.sequence_parallel.html
- 2025-07-27T21:08:43.298Z
+ 2025-07-30T09:00:35.588Zhttps://docs.axolotl.ai/docs/api/utils.schemas.config.html
- 2025-07-27T21:08:43.883Z
+ 2025-07-30T09:00:36.197Zhttps://docs.axolotl.ai/docs/api/loaders.tokenizer.html
- 2025-07-27T21:08:43.241Z
+ 2025-07-30T09:00:35.529Zhttps://docs.axolotl.ai/docs/api/integrations.liger.args.html
- 2025-07-27T21:08:44.108Z
+ 2025-07-30T09:00:36.428Zhttps://docs.axolotl.ai/docs/api/cli.config.html
- 2025-07-27T21:08:43.057Z
+ 2025-07-30T09:00:35.336Zhttps://docs.axolotl.ai/docs/api/loaders.processor.html
- 2025-07-27T21:08:43.242Z
+ 2025-07-30T09:00:35.530Zhttps://docs.axolotl.ai/docs/api/monkeypatch.utils.html
- 2025-07-27T21:08:43.686Z
+ 2025-07-30T09:00:35.990Zhttps://docs.axolotl.ai/docs/api/integrations.cut_cross_entropy.args.html
- 2025-07-27T21:08:44.096Z
+ 2025-07-30T09:00:36.416Zhttps://docs.axolotl.ai/docs/api/core.trainers.dpo.trainer.html
- 2025-07-27T21:08:43.199Z
+ 2025-07-30T09:00:35.485Zhttps://docs.axolotl.ai/docs/api/loaders.adapter.html
- 2025-07-27T21:08:43.248Z
+ 2025-07-30T09:00:35.536Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.llama3.html
- 2025-07-27T21:08:43.429Z
+ 2025-07-30T09:00:35.722Zhttps://docs.axolotl.ai/docs/api/core.datasets.transforms.chat_builder.html
- 2025-07-27T21:08:42.964Z
+ 2025-07-30T09:00:35.240Zhttps://docs.axolotl.ai/docs/api/monkeypatch.transformers_fa_utils.html
- 2025-07-27T21:08:43.704Z
+ 2025-07-30T09:00:36.009Zhttps://docs.axolotl.ai/docs/api/datasets.html
- 2025-07-27T21:08:42.832Z
+ 2025-07-30T09:00:35.103Zhttps://docs.axolotl.ai/docs/api/monkeypatch.btlm_attn_hijack_flash.html
- 2025-07-27T21:08:43.687Z
+ 2025-07-30T09:00:35.991Zhttps://docs.axolotl.ai/docs/api/cli.inference.html
- 2025-07-27T21:08:43.071Z
+ 2025-07-30T09:00:35.351Zhttps://docs.axolotl.ai/docs/api/cli.utils.html
- 2025-07-27T21:08:43.137Z
+ 2025-07-30T09:00:35.419Zhttps://docs.axolotl.ai/docs/api/cli.preprocess.html
- 2025-07-27T21:08:43.099Z
+ 2025-07-30T09:00:35.380Zhttps://docs.axolotl.ai/docs/api/loaders.model.html
- 2025-07-27T21:08:43.233Z
+ 2025-07-30T09:00:35.520Zhttps://docs.axolotl.ai/docs/api/core.trainers.grpo.sampler.html
- 2025-07-27T21:08:43.222Z
+ 2025-07-30T09:00:35.508Zhttps://docs.axolotl.ai/docs/api/core.trainers.grpo.trainer.html
- 2025-07-27T21:08:43.210Z
+ 2025-07-30T09:00:35.495Zhttps://docs.axolotl.ai/docs/batch_vs_grad.html
- 2025-07-27T21:05:22.822Z
+ 2025-07-30T08:57:15.649Zhttps://docs.axolotl.ai/docs/custom_integrations.html
- 2025-07-27T21:05:22.822Z
+ 2025-07-30T08:57:15.649Zhttps://docs.axolotl.ai/docs/quantize.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/installation.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/docker.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/reward_modelling.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/dataset-formats/inst_tune.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/dataset-formats/conversation.html
- 2025-07-27T21:05:22.822Z
+ 2025-07-30T08:57:15.649Zhttps://docs.axolotl.ai/docs/dataset-formats/tokenized.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/rlhf.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/sequence_parallelism.html
- 2025-07-27T21:05:22.827Z
+ 2025-07-30T08:57:15.654Zhttps://docs.axolotl.ai/docs/inference.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/fsdp_qlora.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/multi-node.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/lora_optims.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/getting-started.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/dataset_loading.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/lr_groups.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/mac.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html
- 2025-07-27T21:05:22.846Z
+ 2025-07-30T08:57:15.673Zhttps://docs.axolotl.ai/src/axolotl/integrations/LICENSE.html
- 2025-07-27T21:05:22.846Z
+ 2025-07-30T08:57:15.673Zhttps://docs.axolotl.ai/docs/input_output.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/gradient_checkpointing.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/qat.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/faq.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/dataset_preprocessing.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/nccl.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/cli.html
- 2025-07-27T21:05:22.822Z
+ 2025-07-30T08:57:15.649Zhttps://docs.axolotl.ai/docs/torchao.html
- 2025-07-27T21:05:22.827Z
+ 2025-07-30T08:57:15.654Zhttps://docs.axolotl.ai/docs/multi-gpu.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/dataset-formats/pretraining.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/dataset-formats/index.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/dataset-formats/template_free.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/dataset-formats/stepwise_supervised.html
- 2025-07-27T21:05:22.823Z
+ 2025-07-30T08:57:15.650Zhttps://docs.axolotl.ai/docs/ray-integration.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/unsloth.html
- 2025-07-27T21:05:22.827Z
+ 2025-07-30T08:57:15.654Zhttps://docs.axolotl.ai/docs/mixed_precision.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/multimodal.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/docs/config-reference.html
- 2025-07-27T21:08:56.925Z
+ 2025-07-30T09:00:50.227Zhttps://docs.axolotl.ai/docs/api/prompt_tokenizers.html
- 2025-07-27T21:08:42.886Z
+ 2025-07-30T09:00:35.160Zhttps://docs.axolotl.ai/docs/api/utils.schedulers.html
- 2025-07-27T21:08:43.809Z
+ 2025-07-30T09:00:36.118Zhttps://docs.axolotl.ai/docs/api/utils.samplers.multipack.html
- 2025-07-27T21:08:44.202Z
+ 2025-07-30T09:00:36.525Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.orcamini.html
- 2025-07-27T21:08:43.406Z
+ 2025-07-30T09:00:35.699Zhttps://docs.axolotl.ai/docs/api/core.trainers.trl.html
- 2025-07-27T21:08:43.183Z
+ 2025-07-30T09:00:35.467Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.metharme.html
- 2025-07-27T21:08:43.402Z
+ 2025-07-30T09:00:35.695Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.profiler.html
- 2025-07-27T21:08:44.212Z
+ 2025-07-30T09:00:36.535Zhttps://docs.axolotl.ai/docs/api/utils.schemas.enums.html
- 2025-07-27T21:08:43.952Z
+ 2025-07-30T09:00:36.268Zhttps://docs.axolotl.ai/docs/api/core.trainers.mamba.html
- 2025-07-27T21:08:43.188Z
+ 2025-07-30T09:00:35.473Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_flash.html
- 2025-07-27T21:08:43.624Z
+ 2025-07-30T09:00:35.925Zhttps://docs.axolotl.ai/docs/api/monkeypatch.relora.html
- 2025-07-27T21:08:43.648Z
+ 2025-07-30T09:00:35.950Zhttps://docs.axolotl.ai/docs/api/monkeypatch.stablelm_attn_hijack_flash.html
- 2025-07-27T21:08:43.694Z
+ 2025-07-30T09:00:35.999Zhttps://docs.axolotl.ai/docs/api/loaders.constants.html
- 2025-07-27T21:08:43.259Z
+ 2025-07-30T09:00:35.548Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.qat.html
- 2025-07-27T21:08:44.227Z
+ 2025-07-30T09:00:36.552Zhttps://docs.axolotl.ai/docs/api/utils.schemas.model.html
- 2025-07-27T21:08:43.889Z
+ 2025-07-30T09:00:36.204Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.stepwise_supervised.html
- 2025-07-27T21:08:43.396Z
+ 2025-07-30T09:00:35.688Zhttps://docs.axolotl.ai/docs/api/integrations.grokfast.optimizer.html
- 2025-07-27T21:08:44.097Z
+ 2025-07-30T09:00:36.417Zhttps://docs.axolotl.ai/docs/api/convert.html
- 2025-07-27T21:08:42.845Z
+ 2025-07-30T09:00:35.116Zhttps://docs.axolotl.ai/docs/api/kernels.quantize.html
- 2025-07-27T21:08:43.596Z
+ 2025-07-30T09:00:35.897Zhttps://docs.axolotl.ai/docs/api/core.training_args.html
- 2025-07-27T21:08:42.924Z
+ 2025-07-30T09:00:35.199Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.chat_template.html
- 2025-07-27T21:08:43.332Z
+ 2025-07-30T09:00:35.623Zhttps://docs.axolotl.ai/docs/api/index.html
- 2025-07-27T21:08:42.749Z
+ 2025-07-30T09:00:35.018Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.llama2_chat.html
- 2025-07-27T21:08:43.379Z
+ 2025-07-30T09:00:35.671Zhttps://docs.axolotl.ai/docs/api/utils.trainer.html
- 2025-07-27T21:08:43.784Z
+ 2025-07-30T09:00:36.092Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.messages.chat.html
- 2025-07-27T21:08:43.417Z
+ 2025-07-30T09:00:35.710Zhttps://docs.axolotl.ai/docs/api/monkeypatch.lora_kernels.html
- 2025-07-27T21:08:43.678Z
+ 2025-07-30T09:00:35.982Zhttps://docs.axolotl.ai/docs/api/kernels.lora.html
- 2025-07-27T21:08:43.568Z
+ 2025-07-30T09:00:35.866Zhttps://docs.axolotl.ai/docs/api/cli.vllm_serve.html
- 2025-07-27T21:08:43.144Z
+ 2025-07-30T09:00:35.427Zhttps://docs.axolotl.ai/docs/api/utils.schemas.multimodal.html
- 2025-07-27T21:08:43.930Z
+ 2025-07-30T09:00:36.245Zhttps://docs.axolotl.ai/docs/api/utils.schemas.utils.html
- 2025-07-27T21:08:43.958Z
+ 2025-07-30T09:00:36.274Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_xformers.html
- 2025-07-27T21:08:43.625Z
+ 2025-07-30T09:00:35.926Zhttps://docs.axolotl.ai/docs/api/integrations.lm_eval.args.html
- 2025-07-27T21:08:44.111Z
+ 2025-07-30T09:00:36.431Zhttps://docs.axolotl.ai/docs/api/monkeypatch.mistral_attn_hijack_flash.html
- 2025-07-27T21:08:43.640Z
+ 2025-07-30T09:00:35.942Zhttps://docs.axolotl.ai/docs/api/utils.collators.core.html
- 2025-07-27T21:08:44.135Z
+ 2025-07-30T09:00:36.456Zhttps://docs.axolotl.ai/docs/api/core.chat.format.chatml.html
- 2025-07-27T21:08:42.948Z
+ 2025-07-30T09:00:35.224Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.passthrough.html
- 2025-07-27T21:08:43.443Z
+ 2025-07-30T09:00:35.737Zhttps://docs.axolotl.ai/docs/api/core.datasets.chat.html
- 2025-07-27T21:08:42.956Z
+ 2025-07-30T09:00:35.232Zhttps://docs.axolotl.ai/docs/api/utils.bench.html
- 2025-07-27T21:08:43.760Z
+ 2025-07-30T09:00:36.066Zhttps://docs.axolotl.ai/docs/api/utils.schemas.training.html
- 2025-07-27T21:08:43.895Z
+ 2025-07-30T09:00:36.210Zhttps://docs.axolotl.ai/docs/api/utils.collators.batching.html
- 2025-07-27T21:08:44.153Z
+ 2025-07-30T09:00:36.475Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_patch_multipack.html
- 2025-07-27T21:08:43.689Z
+ 2025-07-30T09:00:35.993Zhttps://docs.axolotl.ai/docs/api/monkeypatch.multipack.html
- 2025-07-27T21:08:43.641Z
+ 2025-07-30T09:00:35.944Zhttps://docs.axolotl.ai/docs/api/core.builders.causal.html
- 2025-07-27T21:08:42.907Z
+ 2025-07-30T09:00:35.181Zhttps://docs.axolotl.ai/docs/api/cli.evaluate.html
- 2025-07-27T21:08:43.013Z
+ 2025-07-30T09:00:35.290Zhttps://docs.axolotl.ai/docs/api/monkeypatch.trainer_fsdp_optim.html
- 2025-07-27T21:08:43.698Z
+ 2025-07-30T09:00:36.002Zhttps://docs.axolotl.ai/docs/api/core.trainers.utils.html
- 2025-07-27T21:08:43.223Z
+ 2025-07-30T09:00:35.509Zhttps://docs.axolotl.ai/docs/api/utils.schemas.integrations.html
- 2025-07-27T21:08:43.942Z
+ 2025-07-30T09:00:36.258Zhttps://docs.axolotl.ai/docs/api/utils.dict.html
- 2025-07-27T21:08:43.832Z
+ 2025-07-30T09:00:36.144Zhttps://docs.axolotl.ai/docs/api/core.builders.rl.html
- 2025-07-27T21:08:42.911Z
+ 2025-07-30T09:00:35.186Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.orpo.chat_template.html
- 2025-07-27T21:08:43.481Z
+ 2025-07-30T09:00:35.777Zhttps://docs.axolotl.ai/docs/api/core.trainers.relora.html
- 2025-07-27T21:08:43.193Z
+ 2025-07-30T09:00:35.478Zhttps://docs.axolotl.ai/docs/api/integrations.spectrum.args.html
- 2025-07-27T21:08:44.114Z
+ 2025-07-30T09:00:36.435Zhttps://docs.axolotl.ai/docs/api/cli.quantize.html
- 2025-07-27T21:08:43.158Z
+ 2025-07-30T09:00:35.442Zhttps://docs.axolotl.ai/docs/api/cli.checks.html
- 2025-07-27T21:08:43.039Z
+ 2025-07-30T09:00:35.317Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.llama3.html
- 2025-07-27T21:08:43.451Z
+ 2025-07-30T09:00:35.745Zhttps://docs.axolotl.ai/docs/api/utils.model_shard_quant.html
- 2025-07-27T21:08:43.756Z
+ 2025-07-30T09:00:36.062Zhttps://docs.axolotl.ai/docs/api/utils.quantization.html
- 2025-07-27T21:08:43.869Z
+ 2025-07-30T09:00:36.183Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.rng_state_loader.html
- 2025-07-27T21:08:43.268Z
+ 2025-07-30T09:00:35.557Zhttps://docs.axolotl.ai/docs/api/kernels.geglu.html
- 2025-07-27T21:08:43.579Z
+ 2025-07-30T09:00:35.877Zhttps://docs.axolotl.ai/docs/api/utils.data.pretraining.html
- 2025-07-27T21:08:43.841Z
+ 2025-07-30T09:00:36.154Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.user_defined.html
- 2025-07-27T21:08:43.461Z
+ 2025-07-30T09:00:35.755Zhttps://docs.axolotl.ai/docs/api/core.builders.base.html
- 2025-07-27T21:08:42.902Z
+ 2025-07-30T09:00:35.176Zhttps://docs.axolotl.ai/docs/api/cli.merge_lora.html
- 2025-07-27T21:08:43.079Z
+ 2025-07-30T09:00:35.359Zhttps://docs.axolotl.ai/docs/api/monkeypatch.mixtral.html
- 2025-07-27T21:08:43.709Z
+ 2025-07-30T09:00:36.013Zhttps://docs.axolotl.ai/docs/api/utils.data.sft.html
- 2025-07-27T21:08:43.849Z
+ 2025-07-30T09:00:36.161Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.user_defined.html
- 2025-07-27T21:08:43.367Z
+ 2025-07-30T09:00:35.659Zhttps://docs.axolotl.ai/docs/api/utils.tokenization.html
- 2025-07-27T21:08:43.744Z
+ 2025-07-30T09:00:36.050Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chatml.html
- 2025-07-27T21:08:43.439Z
+ 2025-07-30T09:00:35.732Zhttps://docs.axolotl.ai/docs/api/models.mamba.modeling_mamba.html
- 2025-07-27T21:08:44.133Z
+ 2025-07-30T09:00:36.454Zhttps://docs.axolotl.ai/docs/api/cli.args.html
- 2025-07-27T21:08:43.032Z
+ 2025-07-30T09:00:35.310Zhttps://docs.axolotl.ai/docs/api/evaluate.html
- 2025-07-27T21:08:42.821Z
+ 2025-07-30T09:00:35.092Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_instruct.html
- 2025-07-27T21:08:43.347Z
+ 2025-07-30T09:00:35.638Zhttps://docs.axolotl.ai/docs/api/utils.distributed.html
- 2025-07-27T21:08:43.829Z
+ 2025-07-30T09:00:36.139Zhttps://docs.axolotl.ai/docs/multipack.html
- 2025-07-27T21:05:22.826Z
+ 2025-07-30T08:57:15.653Zhttps://docs.axolotl.ai/examples/colab-notebooks/colab-axolotl-example.html
- 2025-07-27T21:05:22.831Z
+ 2025-07-30T08:57:15.657Zhttps://docs.axolotl.ai/FAQS.html
- 2025-07-27T21:05:22.821Z
+ 2025-07-30T08:57:15.648Z