diff --git a/.nojekyll b/.nojekyll index fbaa6d79e..df3be972b 100644 --- a/.nojekyll +++ b/.nojekyll @@ -1 +1 @@ -0ec66adc \ No newline at end of file +4cff29b7 \ No newline at end of file diff --git a/docs/api/utils.dict.html b/docs/api/utils.dict.html index d71c1f889..5514987af 100644 --- a/docs/api/utils.dict.html +++ b/docs/api/utils.dict.html @@ -488,6 +488,10 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true}); +
  • Functions +
  • @@ -522,6 +526,30 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true});

    DictDefault

    utils.dict.DictDefault()

    A Dict that returns None instead of returning empty Dict for missing keys.

    + + +
    +

    Functions

    + + + + + + + + + + + + + +
    NameDescription
    remove_none_valuesRemove null from a dictionary-like obj or list.
    +
    +

    remove_none_values

    +
    utils.dict.remove_none_values(obj)
    +

    Remove null from a dictionary-like obj or list. +These can appear due to Dataset loading causing schema merge. +See https://github.com/axolotl-ai-cloud/axolotl/pull/2909

    diff --git a/docs/custom_integrations.html b/docs/custom_integrations.html index a6194dac2..2c480d38e 100644 --- a/docs/custom_integrations.html +++ b/docs/custom_integrations.html @@ -574,7 +574,7 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true}); -
    pip3 uninstall -y cut-cross-entropy && pip3 install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646"
    +
    pip3 uninstall -y cut-cross-entropy && pip3 install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88"

    Usage

    diff --git a/examples/colab-notebooks/colab-axolotl-example.html b/examples/colab-notebooks/colab-axolotl-example.html index 6e94b0ed5..5f4329b66 100644 --- a/examples/colab-notebooks/colab-axolotl-example.html +++ b/examples/colab-notebooks/colab-axolotl-example.html @@ -538,7 +538,7 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true});
    %%capture
     # This step can take ~5-10 minutes to install dependencies
     !pip install --no-build-isolation axolotl[flash-attn]>=0.9.1
    -!pip install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646"
    +!pip install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88"

    Demo: Talk Like a Pirate

    diff --git a/index.html b/index.html index 4ed7603b4..424de4cca 100644 --- a/index.html +++ b/index.html @@ -528,6 +528,7 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true});

    🎉 Latest Updates

      +
    • 2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!
    • 2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!
    • 2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!
    • 2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!
    • diff --git a/search.json b/search.json index d3f20c606..212ce1243 100644 --- a/search.json +++ b/search.json @@ -18,7 +18,7 @@ "href": "index.html#latest-updates", "title": "Axolotl", "section": "🎉 Latest Updates", - "text": "🎉 Latest Updates\n\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.", + "text": "🎉 Latest Updates\n\n2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.", "crumbs": [ "Home" ] @@ -1153,7 +1153,7 @@ "href": "docs/custom_integrations.html#cut-cross-entropy", "title": "Custom Integrations", "section": "Cut Cross Entropy", - "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here", + "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here", "crumbs": [ "Advanced Features", "Custom Integrations" @@ -3539,7 +3539,7 @@ "href": "docs/api/utils.dict.html", "title": "utils.dict", "section": "", - "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys." + "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909" }, { "objectID": "docs/api/utils.dict.html#classes", @@ -3548,6 +3548,13 @@ "section": "", "text": "Name\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys." }, + { + "objectID": "docs/api/utils.dict.html#functions", + "href": "docs/api/utils.dict.html#functions", + "title": "utils.dict", + "section": "", + "text": "Name\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909" + }, { "objectID": "docs/api/core.builders.rl.html", "href": "docs/api/core.builders.rl.html", diff --git a/sitemap.xml b/sitemap.xml index 06669a5a6..9129d7daa 100644 --- a/sitemap.xml +++ b/sitemap.xml @@ -2,766 +2,766 @@ https://docs.axolotl.ai/TODO.html - 2025-07-30T10:29:12.470Z + 2025-07-30T10:49:57.096Z https://docs.axolotl.ai/index.html - 2025-07-30T10:29:12.491Z + 2025-07-30T10:49:57.116Z https://docs.axolotl.ai/docs/debugging.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.098Z https://docs.axolotl.ai/docs/amd_hpc.html - 2025-07-30T10:29:12.471Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/api/utils.callbacks.mlflow_.html - 2025-07-30T10:32:25.500Z + 2025-07-30T10:53:08.360Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_expand_mask.html - 2025-07-30T10:32:24.931Z + 2025-07-30T10:53:07.792Z https://docs.axolotl.ai/docs/api/loaders.patch_manager.html - 2025-07-30T10:32:24.540Z + 2025-07-30T10:53:07.402Z https://docs.axolotl.ai/docs/api/core.chat.format.llama3x.html - 2025-07-30T10:32:24.231Z + 2025-07-30T10:53:07.095Z https://docs.axolotl.ai/docs/api/cli.train.html - 2025-07-30T10:32:24.285Z + 2025-07-30T10:53:07.148Z https://docs.axolotl.ai/docs/api/utils.callbacks.perplexity.html - 2025-07-30T10:32:25.491Z + 2025-07-30T10:53:08.351Z https://docs.axolotl.ai/docs/api/core.chat.messages.html - 2025-07-30T10:32:24.228Z + 2025-07-30T10:53:07.092Z https://docs.axolotl.ai/docs/api/utils.callbacks.lisa.html - 2025-07-30T10:32:25.496Z + 2025-07-30T10:53:08.356Z https://docs.axolotl.ai/docs/api/cli.merge_sharded_fsdp_weights.html - 2025-07-30T10:32:24.371Z + 2025-07-30T10:53:07.234Z https://docs.axolotl.ai/docs/api/cli.sweeps.html - 2025-07-30T10:32:24.385Z + 2025-07-30T10:53:07.248Z https://docs.axolotl.ai/docs/api/utils.chat_templates.html - 2025-07-30T10:32:25.028Z + 2025-07-30T10:53:07.888Z https://docs.axolotl.ai/docs/api/core.chat.format.shared.html - 2025-07-30T10:32:24.233Z + 2025-07-30T10:53:07.096Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.optimizer.html - 2025-07-30T10:32:24.547Z + 2025-07-30T10:53:07.409Z https://docs.axolotl.ai/docs/api/utils.collators.mamba.html - 2025-07-30T10:32:25.437Z + 2025-07-30T10:53:08.300Z https://docs.axolotl.ai/docs/api/logging_config.html - 2025-07-30T10:32:24.177Z + 2025-07-30T10:53:07.041Z https://docs.axolotl.ai/docs/api/utils.collators.mm_chat.html - 2025-07-30T10:32:25.441Z + 2025-07-30T10:53:08.304Z https://docs.axolotl.ai/docs/api/prompt_strategies.completion.html - 2025-07-30T10:32:24.667Z + 2025-07-30T10:53:07.529Z https://docs.axolotl.ai/docs/api/kernels.utils.html - 2025-07-30T10:32:24.881Z + 2025-07-30T10:53:07.741Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chat_template.html - 2025-07-30T10:32:24.700Z + 2025-07-30T10:53:07.562Z https://docs.axolotl.ai/docs/api/kernels.swiglu.html - 2025-07-30T10:32:24.872Z + 2025-07-30T10:53:07.732Z https://docs.axolotl.ai/docs/api/common.const.html - 2025-07-30T10:32:25.397Z + 2025-07-30T10:53:08.260Z https://docs.axolotl.ai/docs/api/cli.cloud.base.html - 2025-07-30T10:32:24.427Z + 2025-07-30T10:53:07.290Z https://docs.axolotl.ai/docs/api/utils.callbacks.comet_.html - 2025-07-30T10:32:25.503Z + 2025-07-30T10:53:08.363Z https://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_cpu.html - 2025-07-30T10:32:24.994Z + 2025-07-30T10:53:07.854Z https://docs.axolotl.ai/docs/api/common.architectures.html - 2025-07-30T10:32:25.396Z + 2025-07-30T10:53:08.259Z https://docs.axolotl.ai/docs/api/prompt_strategies.pygmalion.html - 2025-07-30T10:32:24.695Z + 2025-07-30T10:53:07.557Z https://docs.axolotl.ai/docs/api/utils.schemas.peft.html - 2025-07-30T10:32:25.202Z + 2025-07-30T10:53:08.065Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.user_defined.html - 2025-07-30T10:32:24.724Z + 2025-07-30T10:53:07.586Z https://docs.axolotl.ai/docs/api/utils.schemas.datasets.html - 2025-07-30T10:32:25.194Z + 2025-07-30T10:53:08.057Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_w_system.html - 2025-07-30T10:32:24.640Z + 2025-07-30T10:53:07.503Z https://docs.axolotl.ai/docs/api/prompt_strategies.base.html - 2025-07-30T10:32:24.581Z + 2025-07-30T10:53:07.444Z https://docs.axolotl.ai/docs/api/utils.lora.html - 2025-07-30T10:32:25.033Z + 2025-07-30T10:53:07.893Z https://docs.axolotl.ai/docs/api/prompt_strategies.input_output.html - 2025-07-30T10:32:24.673Z + 2025-07-30T10:53:07.535Z https://docs.axolotl.ai/docs/api/utils.schemas.trl.html - 2025-07-30T10:32:25.205Z + 2025-07-30T10:53:08.069Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.zephyr.html - 2025-07-30T10:32:24.722Z + 2025-07-30T10:53:07.584Z https://docs.axolotl.ai/docs/api/integrations.kd.trainer.html - 2025-07-30T10:32:25.384Z + 2025-07-30T10:53:08.248Z https://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_disk.html - 2025-07-30T10:32:25.019Z + 2025-07-30T10:53:07.880Z https://docs.axolotl.ai/docs/api/utils.optimizers.adopt.html - 2025-07-30T10:32:25.122Z + 2025-07-30T10:53:07.984Z https://docs.axolotl.ai/docs/api/monkeypatch.data.batch_dataset_fetcher.html - 2025-07-30T10:32:24.989Z + 2025-07-30T10:53:07.850Z https://docs.axolotl.ai/docs/api/cli.cloud.modal_.html - 2025-07-30T10:32:24.433Z + 2025-07-30T10:53:07.296Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_chat.html - 2025-07-30T10:32:24.627Z + 2025-07-30T10:53:07.490Z https://docs.axolotl.ai/docs/api/utils.freeze.html - 2025-07-30T10:32:25.049Z + 2025-07-30T10:53:07.910Z https://docs.axolotl.ai/docs/api/prompt_strategies.bradley_terry.llama3.html - 2025-07-30T10:32:24.767Z + 2025-07-30T10:53:07.629Z https://docs.axolotl.ai/docs/api/integrations.base.html - 2025-07-30T10:32:25.372Z + 2025-07-30T10:53:08.236Z https://docs.axolotl.ai/docs/api/monkeypatch.unsloth_.html - 2025-07-30T10:32:24.987Z + 2025-07-30T10:53:07.848Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.chatml.html - 2025-07-30T10:32:24.741Z + 2025-07-30T10:53:07.604Z https://docs.axolotl.ai/docs/api/cli.main.html - 2025-07-30T10:32:24.277Z + 2025-07-30T10:53:07.140Z https://docs.axolotl.ai/docs/api/common.datasets.html - 2025-07-30T10:32:25.412Z + 2025-07-30T10:53:08.275Z https://docs.axolotl.ai/docs/api/train.html - 2025-07-30T10:32:24.092Z + 2025-07-30T10:53:06.956Z https://docs.axolotl.ai/docs/api/core.trainers.base.html - 2025-07-30T10:32:24.449Z + 2025-07-30T10:53:07.311Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.scheduler.html - 2025-07-30T10:32:24.556Z + 2025-07-30T10:53:07.419Z https://docs.axolotl.ai/docs/api/utils.ctx_managers.sequence_parallel.html - 2025-07-30T10:32:24.580Z + 2025-07-30T10:53:07.442Z https://docs.axolotl.ai/docs/api/utils.schemas.config.html - 2025-07-30T10:32:25.164Z + 2025-07-30T10:53:08.027Z https://docs.axolotl.ai/docs/api/loaders.tokenizer.html - 2025-07-30T10:32:24.522Z + 2025-07-30T10:53:07.385Z https://docs.axolotl.ai/docs/api/integrations.liger.args.html - 2025-07-30T10:32:25.388Z + 2025-07-30T10:53:08.251Z https://docs.axolotl.ai/docs/api/cli.config.html - 2025-07-30T10:32:24.337Z + 2025-07-30T10:53:07.200Z https://docs.axolotl.ai/docs/api/loaders.processor.html - 2025-07-30T10:32:24.524Z + 2025-07-30T10:53:07.387Z https://docs.axolotl.ai/docs/api/monkeypatch.utils.html - 2025-07-30T10:32:24.967Z + 2025-07-30T10:53:07.828Z https://docs.axolotl.ai/docs/api/integrations.cut_cross_entropy.args.html - 2025-07-30T10:32:25.376Z + 2025-07-30T10:53:08.239Z https://docs.axolotl.ai/docs/api/core.trainers.dpo.trainer.html - 2025-07-30T10:32:24.480Z + 2025-07-30T10:53:07.343Z https://docs.axolotl.ai/docs/api/loaders.adapter.html - 2025-07-30T10:32:24.529Z + 2025-07-30T10:53:07.392Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.llama3.html - 2025-07-30T10:32:24.711Z + 2025-07-30T10:53:07.573Z https://docs.axolotl.ai/docs/api/core.datasets.transforms.chat_builder.html - 2025-07-30T10:32:24.245Z + 2025-07-30T10:53:07.109Z https://docs.axolotl.ai/docs/api/monkeypatch.transformers_fa_utils.html - 2025-07-30T10:32:24.986Z + 2025-07-30T10:53:07.847Z https://docs.axolotl.ai/docs/api/datasets.html - 2025-07-30T10:32:24.113Z + 2025-07-30T10:53:06.977Z https://docs.axolotl.ai/docs/api/monkeypatch.btlm_attn_hijack_flash.html - 2025-07-30T10:32:24.969Z + 2025-07-30T10:53:07.830Z https://docs.axolotl.ai/docs/api/cli.inference.html - 2025-07-30T10:32:24.351Z + 2025-07-30T10:53:07.214Z https://docs.axolotl.ai/docs/api/cli.utils.html - 2025-07-30T10:32:24.417Z + 2025-07-30T10:53:07.280Z https://docs.axolotl.ai/docs/api/cli.preprocess.html - 2025-07-30T10:32:24.379Z + 2025-07-30T10:53:07.242Z https://docs.axolotl.ai/docs/api/loaders.model.html - 2025-07-30T10:32:24.514Z + 2025-07-30T10:53:07.377Z https://docs.axolotl.ai/docs/api/core.trainers.grpo.sampler.html - 2025-07-30T10:32:24.503Z + 2025-07-30T10:53:07.366Z https://docs.axolotl.ai/docs/api/core.trainers.grpo.trainer.html - 2025-07-30T10:32:24.491Z + 2025-07-30T10:53:07.354Z https://docs.axolotl.ai/docs/batch_vs_grad.html - 2025-07-30T10:29:12.471Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/custom_integrations.html - 2025-07-30T10:29:12.471Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/quantize.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/installation.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.100Z https://docs.axolotl.ai/docs/docker.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.098Z https://docs.axolotl.ai/docs/reward_modelling.html - 2025-07-30T10:29:12.476Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/dataset-formats/inst_tune.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/dataset-formats/conversation.html - 2025-07-30T10:29:12.471Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/dataset-formats/tokenized.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/rlhf.html - 2025-07-30T10:29:12.476Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/sequence_parallelism.html - 2025-07-30T10:29:12.476Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/inference.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.100Z https://docs.axolotl.ai/docs/fsdp_qlora.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.098Z https://docs.axolotl.ai/docs/multi-node.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/lora_optims.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/getting-started.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.098Z https://docs.axolotl.ai/docs/dataset_loading.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/lr_groups.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/mac.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html - 2025-07-30T10:29:12.495Z + 2025-07-30T10:49:57.120Z https://docs.axolotl.ai/src/axolotl/integrations/LICENSE.html - 2025-07-30T10:29:12.495Z + 2025-07-30T10:49:57.120Z https://docs.axolotl.ai/docs/input_output.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.100Z https://docs.axolotl.ai/docs/gradient_checkpointing.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.098Z https://docs.axolotl.ai/docs/qat.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/faq.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.098Z https://docs.axolotl.ai/docs/dataset_preprocessing.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/nccl.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/cli.html - 2025-07-30T10:29:12.471Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/torchao.html - 2025-07-30T10:29:12.476Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/multi-gpu.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/dataset-formats/pretraining.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/dataset-formats/index.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/dataset-formats/template_free.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/dataset-formats/stepwise_supervised.html - 2025-07-30T10:29:12.472Z + 2025-07-30T10:49:57.097Z https://docs.axolotl.ai/docs/ray-integration.html - 2025-07-30T10:29:12.476Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/unsloth.html - 2025-07-30T10:29:12.476Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/mixed_precision.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/multimodal.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/docs/config-reference.html - 2025-07-30T10:32:38.090Z + 2025-07-30T10:53:21.071Z https://docs.axolotl.ai/docs/api/prompt_tokenizers.html - 2025-07-30T10:32:24.168Z + 2025-07-30T10:53:07.032Z https://docs.axolotl.ai/docs/api/utils.schedulers.html - 2025-07-30T10:32:25.091Z + 2025-07-30T10:53:07.952Z https://docs.axolotl.ai/docs/api/utils.samplers.multipack.html - 2025-07-30T10:32:25.484Z + 2025-07-30T10:53:08.345Z https://docs.axolotl.ai/docs/api/prompt_strategies.orcamini.html - 2025-07-30T10:32:24.688Z + 2025-07-30T10:53:07.550Z https://docs.axolotl.ai/docs/api/core.trainers.trl.html - 2025-07-30T10:32:24.463Z + 2025-07-30T10:53:07.326Z https://docs.axolotl.ai/docs/api/prompt_strategies.metharme.html - 2025-07-30T10:32:24.684Z + 2025-07-30T10:53:07.547Z https://docs.axolotl.ai/docs/api/utils.callbacks.profiler.html - 2025-07-30T10:32:25.495Z + 2025-07-30T10:53:08.355Z https://docs.axolotl.ai/docs/api/utils.schemas.enums.html - 2025-07-30T10:32:25.233Z + 2025-07-30T10:53:08.096Z https://docs.axolotl.ai/docs/api/core.trainers.mamba.html - 2025-07-30T10:32:24.469Z + 2025-07-30T10:53:07.332Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_flash.html - 2025-07-30T10:32:24.906Z + 2025-07-30T10:53:07.766Z https://docs.axolotl.ai/docs/api/monkeypatch.relora.html - 2025-07-30T10:32:24.930Z + 2025-07-30T10:53:07.790Z https://docs.axolotl.ai/docs/api/monkeypatch.stablelm_attn_hijack_flash.html - 2025-07-30T10:32:24.976Z + 2025-07-30T10:53:07.837Z https://docs.axolotl.ai/docs/api/loaders.constants.html - 2025-07-30T10:32:24.541Z + 2025-07-30T10:53:07.404Z https://docs.axolotl.ai/docs/api/utils.callbacks.qat.html - 2025-07-30T10:32:25.510Z + 2025-07-30T10:53:08.370Z https://docs.axolotl.ai/docs/api/utils.schemas.model.html - 2025-07-30T10:32:25.171Z + 2025-07-30T10:53:08.034Z https://docs.axolotl.ai/docs/api/prompt_strategies.stepwise_supervised.html - 2025-07-30T10:32:24.677Z + 2025-07-30T10:53:07.540Z https://docs.axolotl.ai/docs/api/integrations.grokfast.optimizer.html - 2025-07-30T10:32:25.377Z + 2025-07-30T10:53:08.240Z https://docs.axolotl.ai/docs/api/convert.html - 2025-07-30T10:32:24.126Z + 2025-07-30T10:53:06.990Z https://docs.axolotl.ai/docs/api/kernels.quantize.html - 2025-07-30T10:32:24.879Z + 2025-07-30T10:53:07.740Z https://docs.axolotl.ai/docs/api/core.training_args.html - 2025-07-30T10:32:24.205Z + 2025-07-30T10:53:07.069Z https://docs.axolotl.ai/docs/api/prompt_strategies.chat_template.html - 2025-07-30T10:32:24.614Z + 2025-07-30T10:53:07.476Z https://docs.axolotl.ai/docs/api/index.html - 2025-07-30T10:32:24.030Z + 2025-07-30T10:53:06.895Z https://docs.axolotl.ai/docs/api/prompt_strategies.llama2_chat.html - 2025-07-30T10:32:24.661Z + 2025-07-30T10:53:07.523Z https://docs.axolotl.ai/docs/api/utils.trainer.html - 2025-07-30T10:32:25.067Z + 2025-07-30T10:53:07.927Z https://docs.axolotl.ai/docs/api/prompt_strategies.messages.chat.html - 2025-07-30T10:32:24.699Z + 2025-07-30T10:53:07.561Z https://docs.axolotl.ai/docs/api/monkeypatch.lora_kernels.html - 2025-07-30T10:32:24.960Z + 2025-07-30T10:53:07.820Z https://docs.axolotl.ai/docs/api/kernels.lora.html - 2025-07-30T10:32:24.851Z + 2025-07-30T10:53:07.712Z https://docs.axolotl.ai/docs/api/cli.vllm_serve.html - 2025-07-30T10:32:24.424Z + 2025-07-30T10:53:07.287Z https://docs.axolotl.ai/docs/api/utils.schemas.multimodal.html - 2025-07-30T10:32:25.211Z + 2025-07-30T10:53:08.074Z https://docs.axolotl.ai/docs/api/utils.schemas.utils.html - 2025-07-30T10:32:25.239Z + 2025-07-30T10:53:08.102Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_xformers.html - 2025-07-30T10:32:24.907Z + 2025-07-30T10:53:07.768Z https://docs.axolotl.ai/docs/api/integrations.lm_eval.args.html - 2025-07-30T10:32:25.391Z + 2025-07-30T10:53:08.254Z https://docs.axolotl.ai/docs/api/monkeypatch.mistral_attn_hijack_flash.html - 2025-07-30T10:32:24.922Z + 2025-07-30T10:53:07.782Z https://docs.axolotl.ai/docs/api/utils.collators.core.html - 2025-07-30T10:32:25.414Z + 2025-07-30T10:53:08.278Z https://docs.axolotl.ai/docs/api/core.chat.format.chatml.html - 2025-07-30T10:32:24.230Z + 2025-07-30T10:53:07.093Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.passthrough.html - 2025-07-30T10:32:24.725Z + 2025-07-30T10:53:07.587Z https://docs.axolotl.ai/docs/api/core.datasets.chat.html - 2025-07-30T10:32:24.238Z + 2025-07-30T10:53:07.101Z https://docs.axolotl.ai/docs/api/utils.bench.html - 2025-07-30T10:32:25.042Z + 2025-07-30T10:53:07.903Z https://docs.axolotl.ai/docs/api/utils.schemas.training.html - 2025-07-30T10:32:25.176Z + 2025-07-30T10:53:08.039Z https://docs.axolotl.ai/docs/api/utils.collators.batching.html - 2025-07-30T10:32:25.433Z + 2025-07-30T10:53:08.296Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_patch_multipack.html - 2025-07-30T10:32:24.970Z + 2025-07-30T10:53:07.831Z https://docs.axolotl.ai/docs/api/monkeypatch.multipack.html - 2025-07-30T10:32:24.923Z + 2025-07-30T10:53:07.784Z https://docs.axolotl.ai/docs/api/core.builders.causal.html - 2025-07-30T10:32:24.188Z + 2025-07-30T10:53:07.052Z https://docs.axolotl.ai/docs/api/cli.evaluate.html - 2025-07-30T10:32:24.293Z + 2025-07-30T10:53:07.156Z https://docs.axolotl.ai/docs/api/monkeypatch.trainer_fsdp_optim.html - 2025-07-30T10:32:24.979Z + 2025-07-30T10:53:07.840Z https://docs.axolotl.ai/docs/api/core.trainers.utils.html - 2025-07-30T10:32:24.505Z + 2025-07-30T10:53:07.367Z https://docs.axolotl.ai/docs/api/utils.schemas.integrations.html - 2025-07-30T10:32:25.223Z + 2025-07-30T10:53:08.086Z https://docs.axolotl.ai/docs/api/utils.dict.html - 2025-07-30T10:32:25.114Z + 2025-07-30T10:53:07.977Z https://docs.axolotl.ai/docs/api/core.builders.rl.html - 2025-07-30T10:32:24.193Z + 2025-07-30T10:53:07.056Z https://docs.axolotl.ai/docs/api/prompt_strategies.orpo.chat_template.html - 2025-07-30T10:32:24.763Z + 2025-07-30T10:53:07.625Z https://docs.axolotl.ai/docs/api/core.trainers.relora.html - 2025-07-30T10:32:24.474Z + 2025-07-30T10:53:07.336Z https://docs.axolotl.ai/docs/api/integrations.spectrum.args.html - 2025-07-30T10:32:25.394Z + 2025-07-30T10:53:08.257Z https://docs.axolotl.ai/docs/api/cli.quantize.html - 2025-07-30T10:32:24.438Z + 2025-07-30T10:53:07.301Z https://docs.axolotl.ai/docs/api/cli.checks.html - 2025-07-30T10:32:24.319Z + 2025-07-30T10:53:07.182Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.llama3.html - 2025-07-30T10:32:24.733Z + 2025-07-30T10:53:07.595Z https://docs.axolotl.ai/docs/api/utils.model_shard_quant.html - 2025-07-30T10:32:25.038Z + 2025-07-30T10:53:07.899Z https://docs.axolotl.ai/docs/api/utils.quantization.html - 2025-07-30T10:32:25.150Z + 2025-07-30T10:53:08.013Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.rng_state_loader.html - 2025-07-30T10:32:24.550Z + 2025-07-30T10:53:07.412Z https://docs.axolotl.ai/docs/api/kernels.geglu.html - 2025-07-30T10:32:24.862Z + 2025-07-30T10:53:07.722Z https://docs.axolotl.ai/docs/api/utils.data.pretraining.html - 2025-07-30T10:32:25.123Z + 2025-07-30T10:53:07.986Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.user_defined.html - 2025-07-30T10:32:24.743Z + 2025-07-30T10:53:07.605Z https://docs.axolotl.ai/docs/api/core.builders.base.html - 2025-07-30T10:32:24.183Z + 2025-07-30T10:53:07.047Z https://docs.axolotl.ai/docs/api/cli.merge_lora.html - 2025-07-30T10:32:24.359Z + 2025-07-30T10:53:07.222Z https://docs.axolotl.ai/docs/api/monkeypatch.mixtral.html - 2025-07-30T10:32:24.990Z + 2025-07-30T10:53:07.851Z https://docs.axolotl.ai/docs/api/utils.data.sft.html - 2025-07-30T10:32:25.130Z + 2025-07-30T10:53:07.993Z https://docs.axolotl.ai/docs/api/prompt_strategies.user_defined.html - 2025-07-30T10:32:24.648Z + 2025-07-30T10:53:07.511Z https://docs.axolotl.ai/docs/api/utils.tokenization.html - 2025-07-30T10:32:25.026Z + 2025-07-30T10:53:07.887Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chatml.html - 2025-07-30T10:32:24.721Z + 2025-07-30T10:53:07.583Z https://docs.axolotl.ai/docs/api/models.mamba.modeling_mamba.html - 2025-07-30T10:32:25.413Z + 2025-07-30T10:53:08.276Z https://docs.axolotl.ai/docs/api/cli.args.html - 2025-07-30T10:32:24.313Z + 2025-07-30T10:53:07.176Z https://docs.axolotl.ai/docs/api/evaluate.html - 2025-07-30T10:32:24.102Z + 2025-07-30T10:53:06.966Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_instruct.html - 2025-07-30T10:32:24.629Z + 2025-07-30T10:53:07.491Z https://docs.axolotl.ai/docs/api/utils.distributed.html - 2025-07-30T10:32:25.111Z + 2025-07-30T10:53:07.971Z https://docs.axolotl.ai/docs/multipack.html - 2025-07-30T10:29:12.475Z + 2025-07-30T10:49:57.101Z https://docs.axolotl.ai/examples/colab-notebooks/colab-axolotl-example.html - 2025-07-30T10:29:12.480Z + 2025-07-30T10:49:57.105Z https://docs.axolotl.ai/FAQS.html - 2025-07-30T10:29:12.470Z + 2025-07-30T10:49:57.095Z