diff --git a/.nojekyll b/.nojekyll index 14c687bce..fbaa6d79e 100644 --- a/.nojekyll +++ b/.nojekyll @@ -1 +1 @@ -146a5d7b \ No newline at end of file +0ec66adc \ No newline at end of file diff --git a/docs/api/utils.dict.html b/docs/api/utils.dict.html index 5514987af..d71c1f889 100644 --- a/docs/api/utils.dict.html +++ b/docs/api/utils.dict.html @@ -488,10 +488,6 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true}); -
  • Functions -
  • @@ -526,30 +522,6 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true});

    DictDefault

    utils.dict.DictDefault()

    A Dict that returns None instead of returning empty Dict for missing keys.

    - - -
    -

    Functions

    - - - - - - - - - - - - - -
    NameDescription
    remove_none_valuesRemove null from a dictionary-like obj or list.
    -
    -

    remove_none_values

    -
    utils.dict.remove_none_values(obj)
    -

    Remove null from a dictionary-like obj or list. -These can appear due to Dataset loading causing schema merge. -See https://github.com/axolotl-ai-cloud/axolotl/pull/2909

    diff --git a/docs/custom_integrations.html b/docs/custom_integrations.html index 2c480d38e..a6194dac2 100644 --- a/docs/custom_integrations.html +++ b/docs/custom_integrations.html @@ -574,7 +574,7 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true}); -
    pip3 uninstall -y cut-cross-entropy && pip3 install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88"
    +
    pip3 uninstall -y cut-cross-entropy && pip3 install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646"

    Usage

    diff --git a/examples/colab-notebooks/colab-axolotl-example.html b/examples/colab-notebooks/colab-axolotl-example.html index 5f4329b66..6e94b0ed5 100644 --- a/examples/colab-notebooks/colab-axolotl-example.html +++ b/examples/colab-notebooks/colab-axolotl-example.html @@ -538,7 +538,7 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true});
    %%capture
     # This step can take ~5-10 minutes to install dependencies
     !pip install --no-build-isolation axolotl[flash-attn]>=0.9.1
    -!pip install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88"
    +!pip install "cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646"

    Demo: Talk Like a Pirate

    diff --git a/index.html b/index.html index 424de4cca..4ed7603b4 100644 --- a/index.html +++ b/index.html @@ -528,7 +528,6 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true});

    🎉 Latest Updates

      -
    • 2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!
    • 2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!
    • 2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!
    • 2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!
    • diff --git a/search.json b/search.json index 212ce1243..d3f20c606 100644 --- a/search.json +++ b/search.json @@ -18,7 +18,7 @@ "href": "index.html#latest-updates", "title": "Axolotl", "section": "🎉 Latest Updates", - "text": "🎉 Latest Updates\n\n2025/07: Voxtral with mistral-common tokenizer support has been integrated in Axolotl. Read the docs!\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.", + "text": "🎉 Latest Updates\n\n2025/07: TiledMLP support for single-GPU to multi-GPU training with DDP, DeepSpeed and FSDP support has been added to support Arctic Long Sequence Training. (ALST). See examples for using ALST with Axolotl!\n2025/06: Magistral with mistral-common tokenizer support has been added to Axolotl. See examples to start training your own Magistral models with Axolotl!\n2025/05: Quantization Aware Training (QAT) support has been added to Axolotl. Explore the docs to learn more!\n2025/04: Llama 4 support has been added in Axolotl. See examples to start training your own Llama 4 models with Axolotl’s linearized version!\n2025/03: Axolotl has implemented Sequence Parallelism (SP) support. Read the blog and docs to learn how to scale your context length when fine-tuning.\n2025/03: (Beta) Fine-tuning Multimodal models is now supported in Axolotl. Check out the docs to fine-tune your own!\n2025/02: Axolotl has added LoRA optimizations to reduce memory usage and improve training speed for LoRA and QLoRA in single GPU and multi-GPU training (DDP and DeepSpeed). Jump into the docs to give it a try.\n2025/02: Axolotl has added GRPO support. Dive into our blog and GRPO example and have some fun!\n2025/01: Axolotl has added Reward Modelling / Process Reward Modelling fine-tuning support. See docs.", "crumbs": [ "Home" ] @@ -1153,7 +1153,7 @@ "href": "docs/custom_integrations.html#cut-cross-entropy", "title": "Custom Integrations", "section": "Cut Cross Entropy", - "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@010c3ac3f1e725098961832830303eeb4142dd88\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here", + "text": "Cut Cross Entropy\nCut Cross Entropy (CCE) reduces VRAM usage through optimization on the cross-entropy operation during loss calculation.\nSee https://github.com/apple/ml-cross-entropy\n\nRequirements\n\nPyTorch 2.4.0 or higher\n\n\n\nInstallation\nRun the following command to install cut_cross_entropy[transformers] if you don’t have it already.\n\nIf you are in dev environment\n\npython scripts/cutcrossentropy_install.py | sh\n\nIf you are installing from pip\n\npip3 uninstall -y cut-cross-entropy && pip3 install \"cut-cross-entropy[transformers] @ git+https://github.com/axolotl-ai-cloud/ml-cross-entropy.git@631d646\"\n\n\nUsage\nplugins:\n - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin\n\n\nSupported Models\n\ncohere\ncohere2\ngemma\ngemma2\ngemma3\ngemma3_text\ngemma3n\ngemma3n_text\nglm\nglm4\nllama\nllama4\nllama4_text\nmistral\nmistral3\nmllama\nphi\nphi3\nphi4_multimodal\nqwen2\nqwen2_vl\nqwen2_moe\nqwen2_5_vl\nqwen3\nqwen3_moe\n\n\n\nCitation\n@article{wijmans2024cut,\n author = {Erik Wijmans and\n Brody Huval and\n Alexander Hertzberg and\n Vladlen Koltun and\n Philipp Kr\\\"ahenb\\\"uhl},\n title = {Cut Your Losses in Large-Vocabulary Language Models},\n journal = {arXiv},\n year = {2024},\n url = {https://arxiv.org/abs/2411.09009},\n}\nPlease see reference here", "crumbs": [ "Advanced Features", "Custom Integrations" @@ -3539,7 +3539,7 @@ "href": "docs/api/utils.dict.html", "title": "utils.dict", "section": "", - "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909" + "text": "utils.dict\nModule containing the DictDefault class\n\n\n\n\n\nName\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys." }, { "objectID": "docs/api/utils.dict.html#classes", @@ -3548,13 +3548,6 @@ "section": "", "text": "Name\nDescription\n\n\n\n\nDictDefault\nA Dict that returns None instead of returning empty Dict for missing keys.\n\n\n\n\n\nutils.dict.DictDefault()\nA Dict that returns None instead of returning empty Dict for missing keys." }, - { - "objectID": "docs/api/utils.dict.html#functions", - "href": "docs/api/utils.dict.html#functions", - "title": "utils.dict", - "section": "", - "text": "Name\nDescription\n\n\n\n\nremove_none_values\nRemove null from a dictionary-like obj or list.\n\n\n\n\n\nutils.dict.remove_none_values(obj)\nRemove null from a dictionary-like obj or list.\nThese can appear due to Dataset loading causing schema merge.\nSee https://github.com/axolotl-ai-cloud/axolotl/pull/2909" - }, { "objectID": "docs/api/core.builders.rl.html", "href": "docs/api/core.builders.rl.html", diff --git a/sitemap.xml b/sitemap.xml index 08d5a7736..06669a5a6 100644 --- a/sitemap.xml +++ b/sitemap.xml @@ -2,766 +2,766 @@ https://docs.axolotl.ai/TODO.html - 2025-07-30T10:21:25.037Z + 2025-07-30T10:29:12.470Z https://docs.axolotl.ai/index.html - 2025-07-30T10:21:25.058Z + 2025-07-30T10:29:12.491Z https://docs.axolotl.ai/docs/debugging.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/amd_hpc.html - 2025-07-30T10:21:25.038Z + 2025-07-30T10:29:12.471Z https://docs.axolotl.ai/docs/api/utils.callbacks.mlflow_.html - 2025-07-30T10:24:39.969Z + 2025-07-30T10:32:25.500Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_expand_mask.html - 2025-07-30T10:24:39.393Z + 2025-07-30T10:32:24.931Z https://docs.axolotl.ai/docs/api/loaders.patch_manager.html - 2025-07-30T10:24:38.994Z + 2025-07-30T10:32:24.540Z https://docs.axolotl.ai/docs/api/core.chat.format.llama3x.html - 2025-07-30T10:24:38.682Z + 2025-07-30T10:32:24.231Z https://docs.axolotl.ai/docs/api/cli.train.html - 2025-07-30T10:24:38.736Z + 2025-07-30T10:32:24.285Z https://docs.axolotl.ai/docs/api/utils.callbacks.perplexity.html - 2025-07-30T10:24:39.960Z + 2025-07-30T10:32:25.491Z https://docs.axolotl.ai/docs/api/core.chat.messages.html - 2025-07-30T10:24:38.679Z + 2025-07-30T10:32:24.228Z https://docs.axolotl.ai/docs/api/utils.callbacks.lisa.html - 2025-07-30T10:24:39.966Z + 2025-07-30T10:32:25.496Z https://docs.axolotl.ai/docs/api/cli.merge_sharded_fsdp_weights.html - 2025-07-30T10:24:38.823Z + 2025-07-30T10:32:24.371Z https://docs.axolotl.ai/docs/api/cli.sweeps.html - 2025-07-30T10:24:38.838Z + 2025-07-30T10:32:24.385Z https://docs.axolotl.ai/docs/api/utils.chat_templates.html - 2025-07-30T10:24:39.491Z + 2025-07-30T10:32:25.028Z https://docs.axolotl.ai/docs/api/core.chat.format.shared.html - 2025-07-30T10:24:38.684Z + 2025-07-30T10:32:24.233Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.optimizer.html - 2025-07-30T10:24:39.001Z + 2025-07-30T10:32:24.547Z https://docs.axolotl.ai/docs/api/utils.collators.mamba.html - 2025-07-30T10:24:39.908Z + 2025-07-30T10:32:25.437Z https://docs.axolotl.ai/docs/api/logging_config.html - 2025-07-30T10:24:38.628Z + 2025-07-30T10:32:24.177Z https://docs.axolotl.ai/docs/api/utils.collators.mm_chat.html - 2025-07-30T10:24:39.913Z + 2025-07-30T10:32:25.441Z https://docs.axolotl.ai/docs/api/prompt_strategies.completion.html - 2025-07-30T10:24:39.124Z + 2025-07-30T10:32:24.667Z https://docs.axolotl.ai/docs/api/kernels.utils.html - 2025-07-30T10:24:39.341Z + 2025-07-30T10:32:24.881Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chat_template.html - 2025-07-30T10:24:39.159Z + 2025-07-30T10:32:24.700Z https://docs.axolotl.ai/docs/api/kernels.swiglu.html - 2025-07-30T10:24:39.332Z + 2025-07-30T10:32:24.872Z https://docs.axolotl.ai/docs/api/common.const.html - 2025-07-30T10:24:39.868Z + 2025-07-30T10:32:25.397Z https://docs.axolotl.ai/docs/api/cli.cloud.base.html - 2025-07-30T10:24:38.880Z + 2025-07-30T10:32:24.427Z https://docs.axolotl.ai/docs/api/utils.callbacks.comet_.html - 2025-07-30T10:24:39.973Z + 2025-07-30T10:32:25.503Z https://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_cpu.html - 2025-07-30T10:24:39.457Z + 2025-07-30T10:32:24.994Z https://docs.axolotl.ai/docs/api/common.architectures.html - 2025-07-30T10:24:39.867Z + 2025-07-30T10:32:25.396Z https://docs.axolotl.ai/docs/api/prompt_strategies.pygmalion.html - 2025-07-30T10:24:39.153Z + 2025-07-30T10:32:24.695Z https://docs.axolotl.ai/docs/api/utils.schemas.peft.html - 2025-07-30T10:24:39.670Z + 2025-07-30T10:32:25.202Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.user_defined.html - 2025-07-30T10:24:39.182Z + 2025-07-30T10:32:24.724Z https://docs.axolotl.ai/docs/api/utils.schemas.datasets.html - 2025-07-30T10:24:39.662Z + 2025-07-30T10:32:25.194Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_w_system.html - 2025-07-30T10:24:39.098Z + 2025-07-30T10:32:24.640Z https://docs.axolotl.ai/docs/api/prompt_strategies.base.html - 2025-07-30T10:24:39.037Z + 2025-07-30T10:32:24.581Z https://docs.axolotl.ai/docs/api/utils.lora.html - 2025-07-30T10:24:39.496Z + 2025-07-30T10:32:25.033Z https://docs.axolotl.ai/docs/api/prompt_strategies.input_output.html - 2025-07-30T10:24:39.130Z + 2025-07-30T10:32:24.673Z https://docs.axolotl.ai/docs/api/utils.schemas.trl.html - 2025-07-30T10:24:39.674Z + 2025-07-30T10:32:25.205Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.zephyr.html - 2025-07-30T10:24:39.181Z + 2025-07-30T10:32:24.722Z https://docs.axolotl.ai/docs/api/integrations.kd.trainer.html - 2025-07-30T10:24:39.855Z + 2025-07-30T10:32:25.384Z https://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_disk.html - 2025-07-30T10:24:39.482Z + 2025-07-30T10:32:25.019Z https://docs.axolotl.ai/docs/api/utils.optimizers.adopt.html - 2025-07-30T10:24:39.587Z + 2025-07-30T10:32:25.122Z https://docs.axolotl.ai/docs/api/monkeypatch.data.batch_dataset_fetcher.html - 2025-07-30T10:24:39.452Z + 2025-07-30T10:32:24.989Z https://docs.axolotl.ai/docs/api/cli.cloud.modal_.html - 2025-07-30T10:24:38.886Z + 2025-07-30T10:32:24.433Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_chat.html - 2025-07-30T10:24:39.084Z + 2025-07-30T10:32:24.627Z https://docs.axolotl.ai/docs/api/utils.freeze.html - 2025-07-30T10:24:39.513Z + 2025-07-30T10:32:25.049Z https://docs.axolotl.ai/docs/api/prompt_strategies.bradley_terry.llama3.html - 2025-07-30T10:24:39.226Z + 2025-07-30T10:32:24.767Z https://docs.axolotl.ai/docs/api/integrations.base.html - 2025-07-30T10:24:39.843Z + 2025-07-30T10:32:25.372Z https://docs.axolotl.ai/docs/api/monkeypatch.unsloth_.html - 2025-07-30T10:24:39.450Z + 2025-07-30T10:32:24.987Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.chatml.html - 2025-07-30T10:24:39.200Z + 2025-07-30T10:32:24.741Z https://docs.axolotl.ai/docs/api/cli.main.html - 2025-07-30T10:24:38.728Z + 2025-07-30T10:32:24.277Z https://docs.axolotl.ai/docs/api/common.datasets.html - 2025-07-30T10:24:39.883Z + 2025-07-30T10:32:25.412Z https://docs.axolotl.ai/docs/api/train.html - 2025-07-30T10:24:38.541Z + 2025-07-30T10:32:24.092Z https://docs.axolotl.ai/docs/api/core.trainers.base.html - 2025-07-30T10:24:38.902Z + 2025-07-30T10:32:24.449Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.scheduler.html - 2025-07-30T10:24:39.011Z + 2025-07-30T10:32:24.556Z https://docs.axolotl.ai/docs/api/utils.ctx_managers.sequence_parallel.html - 2025-07-30T10:24:39.035Z + 2025-07-30T10:32:24.580Z https://docs.axolotl.ai/docs/api/utils.schemas.config.html - 2025-07-30T10:24:39.631Z + 2025-07-30T10:32:25.164Z https://docs.axolotl.ai/docs/api/loaders.tokenizer.html - 2025-07-30T10:24:38.977Z + 2025-07-30T10:32:24.522Z https://docs.axolotl.ai/docs/api/integrations.liger.args.html - 2025-07-30T10:24:39.859Z + 2025-07-30T10:32:25.388Z https://docs.axolotl.ai/docs/api/cli.config.html - 2025-07-30T10:24:38.788Z + 2025-07-30T10:32:24.337Z https://docs.axolotl.ai/docs/api/loaders.processor.html - 2025-07-30T10:24:38.978Z + 2025-07-30T10:32:24.524Z https://docs.axolotl.ai/docs/api/monkeypatch.utils.html - 2025-07-30T10:24:39.430Z + 2025-07-30T10:32:24.967Z https://docs.axolotl.ai/docs/api/integrations.cut_cross_entropy.args.html - 2025-07-30T10:24:39.847Z + 2025-07-30T10:32:25.376Z https://docs.axolotl.ai/docs/api/core.trainers.dpo.trainer.html - 2025-07-30T10:24:38.933Z + 2025-07-30T10:32:24.480Z https://docs.axolotl.ai/docs/api/loaders.adapter.html - 2025-07-30T10:24:38.984Z + 2025-07-30T10:32:24.529Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.llama3.html - 2025-07-30T10:24:39.169Z + 2025-07-30T10:32:24.711Z https://docs.axolotl.ai/docs/api/core.datasets.transforms.chat_builder.html - 2025-07-30T10:24:38.697Z + 2025-07-30T10:32:24.245Z https://docs.axolotl.ai/docs/api/monkeypatch.transformers_fa_utils.html - 2025-07-30T10:24:39.449Z + 2025-07-30T10:32:24.986Z https://docs.axolotl.ai/docs/api/datasets.html - 2025-07-30T10:24:38.562Z + 2025-07-30T10:32:24.113Z https://docs.axolotl.ai/docs/api/monkeypatch.btlm_attn_hijack_flash.html - 2025-07-30T10:24:39.432Z + 2025-07-30T10:32:24.969Z https://docs.axolotl.ai/docs/api/cli.inference.html - 2025-07-30T10:24:38.802Z + 2025-07-30T10:32:24.351Z https://docs.axolotl.ai/docs/api/cli.utils.html - 2025-07-30T10:24:38.870Z + 2025-07-30T10:32:24.417Z https://docs.axolotl.ai/docs/api/cli.preprocess.html - 2025-07-30T10:24:38.831Z + 2025-07-30T10:32:24.379Z https://docs.axolotl.ai/docs/api/loaders.model.html - 2025-07-30T10:24:38.968Z + 2025-07-30T10:32:24.514Z https://docs.axolotl.ai/docs/api/core.trainers.grpo.sampler.html - 2025-07-30T10:24:38.957Z + 2025-07-30T10:32:24.503Z https://docs.axolotl.ai/docs/api/core.trainers.grpo.trainer.html - 2025-07-30T10:24:38.944Z + 2025-07-30T10:32:24.491Z https://docs.axolotl.ai/docs/batch_vs_grad.html - 2025-07-30T10:21:25.038Z + 2025-07-30T10:29:12.471Z https://docs.axolotl.ai/docs/custom_integrations.html - 2025-07-30T10:21:25.038Z + 2025-07-30T10:29:12.471Z https://docs.axolotl.ai/docs/quantize.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/installation.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/docker.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/reward_modelling.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.476Z https://docs.axolotl.ai/docs/dataset-formats/inst_tune.html - 2025-07-30T10:21:25.038Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/dataset-formats/conversation.html - 2025-07-30T10:21:25.038Z + 2025-07-30T10:29:12.471Z https://docs.axolotl.ai/docs/dataset-formats/tokenized.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/rlhf.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.476Z https://docs.axolotl.ai/docs/sequence_parallelism.html - 2025-07-30T10:21:25.043Z + 2025-07-30T10:29:12.476Z https://docs.axolotl.ai/docs/inference.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/fsdp_qlora.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/multi-node.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/lora_optims.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/getting-started.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/dataset_loading.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/lr_groups.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/mac.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html - 2025-07-30T10:21:25.062Z + 2025-07-30T10:29:12.495Z https://docs.axolotl.ai/src/axolotl/integrations/LICENSE.html - 2025-07-30T10:21:25.062Z + 2025-07-30T10:29:12.495Z https://docs.axolotl.ai/docs/input_output.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/gradient_checkpointing.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/qat.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/faq.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/dataset_preprocessing.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/nccl.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/cli.html - 2025-07-30T10:21:25.038Z + 2025-07-30T10:29:12.471Z https://docs.axolotl.ai/docs/torchao.html - 2025-07-30T10:21:25.043Z + 2025-07-30T10:29:12.476Z https://docs.axolotl.ai/docs/multi-gpu.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/dataset-formats/pretraining.html - 2025-07-30T10:21:25.038Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/dataset-formats/index.html - 2025-07-30T10:21:25.038Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/dataset-formats/template_free.html - 2025-07-30T10:21:25.039Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/dataset-formats/stepwise_supervised.html - 2025-07-30T10:21:25.038Z + 2025-07-30T10:29:12.472Z https://docs.axolotl.ai/docs/ray-integration.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.476Z https://docs.axolotl.ai/docs/unsloth.html - 2025-07-30T10:21:25.043Z + 2025-07-30T10:29:12.476Z https://docs.axolotl.ai/docs/mixed_precision.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/multimodal.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/docs/config-reference.html - 2025-07-30T10:24:53.939Z + 2025-07-30T10:32:38.090Z https://docs.axolotl.ai/docs/api/prompt_tokenizers.html - 2025-07-30T10:24:38.618Z + 2025-07-30T10:32:24.168Z https://docs.axolotl.ai/docs/api/utils.schedulers.html - 2025-07-30T10:24:39.554Z + 2025-07-30T10:32:25.091Z https://docs.axolotl.ai/docs/api/utils.samplers.multipack.html - 2025-07-30T10:24:39.954Z + 2025-07-30T10:32:25.484Z https://docs.axolotl.ai/docs/api/prompt_strategies.orcamini.html - 2025-07-30T10:24:39.146Z + 2025-07-30T10:32:24.688Z https://docs.axolotl.ai/docs/api/core.trainers.trl.html - 2025-07-30T10:24:38.917Z + 2025-07-30T10:32:24.463Z https://docs.axolotl.ai/docs/api/prompt_strategies.metharme.html - 2025-07-30T10:24:39.142Z + 2025-07-30T10:32:24.684Z https://docs.axolotl.ai/docs/api/utils.callbacks.profiler.html - 2025-07-30T10:24:39.964Z + 2025-07-30T10:32:25.495Z https://docs.axolotl.ai/docs/api/utils.schemas.enums.html - 2025-07-30T10:24:39.702Z + 2025-07-30T10:32:25.233Z https://docs.axolotl.ai/docs/api/core.trainers.mamba.html - 2025-07-30T10:24:38.922Z + 2025-07-30T10:32:24.469Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_flash.html - 2025-07-30T10:24:39.367Z + 2025-07-30T10:32:24.906Z https://docs.axolotl.ai/docs/api/monkeypatch.relora.html - 2025-07-30T10:24:39.392Z + 2025-07-30T10:32:24.930Z https://docs.axolotl.ai/docs/api/monkeypatch.stablelm_attn_hijack_flash.html - 2025-07-30T10:24:39.439Z + 2025-07-30T10:32:24.976Z https://docs.axolotl.ai/docs/api/loaders.constants.html - 2025-07-30T10:24:38.995Z + 2025-07-30T10:32:24.541Z https://docs.axolotl.ai/docs/api/utils.callbacks.qat.html - 2025-07-30T10:24:39.980Z + 2025-07-30T10:32:25.510Z https://docs.axolotl.ai/docs/api/utils.schemas.model.html - 2025-07-30T10:24:39.638Z + 2025-07-30T10:32:25.171Z https://docs.axolotl.ai/docs/api/prompt_strategies.stepwise_supervised.html - 2025-07-30T10:24:39.135Z + 2025-07-30T10:32:24.677Z https://docs.axolotl.ai/docs/api/integrations.grokfast.optimizer.html - 2025-07-30T10:24:39.848Z + 2025-07-30T10:32:25.377Z https://docs.axolotl.ai/docs/api/convert.html - 2025-07-30T10:24:38.576Z + 2025-07-30T10:32:24.126Z https://docs.axolotl.ai/docs/api/kernels.quantize.html - 2025-07-30T10:24:39.340Z + 2025-07-30T10:32:24.879Z https://docs.axolotl.ai/docs/api/core.training_args.html - 2025-07-30T10:24:38.656Z + 2025-07-30T10:32:24.205Z https://docs.axolotl.ai/docs/api/prompt_strategies.chat_template.html - 2025-07-30T10:24:39.070Z + 2025-07-30T10:32:24.614Z https://docs.axolotl.ai/docs/api/index.html - 2025-07-30T10:24:38.479Z + 2025-07-30T10:32:24.030Z https://docs.axolotl.ai/docs/api/prompt_strategies.llama2_chat.html - 2025-07-30T10:24:39.118Z + 2025-07-30T10:32:24.661Z https://docs.axolotl.ai/docs/api/utils.trainer.html - 2025-07-30T10:24:39.530Z + 2025-07-30T10:32:25.067Z https://docs.axolotl.ai/docs/api/prompt_strategies.messages.chat.html - 2025-07-30T10:24:39.157Z + 2025-07-30T10:32:24.699Z https://docs.axolotl.ai/docs/api/monkeypatch.lora_kernels.html - 2025-07-30T10:24:39.422Z + 2025-07-30T10:32:24.960Z https://docs.axolotl.ai/docs/api/kernels.lora.html - 2025-07-30T10:24:39.311Z + 2025-07-30T10:32:24.851Z https://docs.axolotl.ai/docs/api/cli.vllm_serve.html - 2025-07-30T10:24:38.877Z + 2025-07-30T10:32:24.424Z https://docs.axolotl.ai/docs/api/utils.schemas.multimodal.html - 2025-07-30T10:24:39.679Z + 2025-07-30T10:32:25.211Z https://docs.axolotl.ai/docs/api/utils.schemas.utils.html - 2025-07-30T10:24:39.707Z + 2025-07-30T10:32:25.239Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_xformers.html - 2025-07-30T10:24:39.369Z + 2025-07-30T10:32:24.907Z https://docs.axolotl.ai/docs/api/integrations.lm_eval.args.html - 2025-07-30T10:24:39.862Z + 2025-07-30T10:32:25.391Z https://docs.axolotl.ai/docs/api/monkeypatch.mistral_attn_hijack_flash.html - 2025-07-30T10:24:39.383Z + 2025-07-30T10:32:24.922Z https://docs.axolotl.ai/docs/api/utils.collators.core.html - 2025-07-30T10:24:39.886Z + 2025-07-30T10:32:25.414Z https://docs.axolotl.ai/docs/api/core.chat.format.chatml.html - 2025-07-30T10:24:38.681Z + 2025-07-30T10:32:24.230Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.passthrough.html - 2025-07-30T10:24:39.184Z + 2025-07-30T10:32:24.725Z https://docs.axolotl.ai/docs/api/core.datasets.chat.html - 2025-07-30T10:24:38.689Z + 2025-07-30T10:32:24.238Z https://docs.axolotl.ai/docs/api/utils.bench.html - 2025-07-30T10:24:39.505Z + 2025-07-30T10:32:25.042Z https://docs.axolotl.ai/docs/api/utils.schemas.training.html - 2025-07-30T10:24:39.643Z + 2025-07-30T10:32:25.176Z https://docs.axolotl.ai/docs/api/utils.collators.batching.html - 2025-07-30T10:24:39.905Z + 2025-07-30T10:32:25.433Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_patch_multipack.html - 2025-07-30T10:24:39.433Z + 2025-07-30T10:32:24.970Z https://docs.axolotl.ai/docs/api/monkeypatch.multipack.html - 2025-07-30T10:24:39.385Z + 2025-07-30T10:32:24.923Z https://docs.axolotl.ai/docs/api/core.builders.causal.html - 2025-07-30T10:24:38.639Z + 2025-07-30T10:32:24.188Z https://docs.axolotl.ai/docs/api/cli.evaluate.html - 2025-07-30T10:24:38.744Z + 2025-07-30T10:32:24.293Z https://docs.axolotl.ai/docs/api/monkeypatch.trainer_fsdp_optim.html - 2025-07-30T10:24:39.442Z + 2025-07-30T10:32:24.979Z https://docs.axolotl.ai/docs/api/core.trainers.utils.html - 2025-07-30T10:24:38.958Z + 2025-07-30T10:32:24.505Z https://docs.axolotl.ai/docs/api/utils.schemas.integrations.html - 2025-07-30T10:24:39.691Z + 2025-07-30T10:32:25.223Z https://docs.axolotl.ai/docs/api/utils.dict.html - 2025-07-30T10:24:39.580Z + 2025-07-30T10:32:25.114Z https://docs.axolotl.ai/docs/api/core.builders.rl.html - 2025-07-30T10:24:38.643Z + 2025-07-30T10:32:24.193Z https://docs.axolotl.ai/docs/api/prompt_strategies.orpo.chat_template.html - 2025-07-30T10:24:39.222Z + 2025-07-30T10:32:24.763Z https://docs.axolotl.ai/docs/api/core.trainers.relora.html - 2025-07-30T10:24:38.927Z + 2025-07-30T10:32:24.474Z https://docs.axolotl.ai/docs/api/integrations.spectrum.args.html - 2025-07-30T10:24:39.865Z + 2025-07-30T10:32:25.394Z https://docs.axolotl.ai/docs/api/cli.quantize.html - 2025-07-30T10:24:38.891Z + 2025-07-30T10:32:24.438Z https://docs.axolotl.ai/docs/api/cli.checks.html - 2025-07-30T10:24:38.770Z + 2025-07-30T10:32:24.319Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.llama3.html - 2025-07-30T10:24:39.192Z + 2025-07-30T10:32:24.733Z https://docs.axolotl.ai/docs/api/utils.model_shard_quant.html - 2025-07-30T10:24:39.501Z + 2025-07-30T10:32:25.038Z https://docs.axolotl.ai/docs/api/utils.quantization.html - 2025-07-30T10:24:39.617Z + 2025-07-30T10:32:25.150Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.rng_state_loader.html - 2025-07-30T10:24:39.004Z + 2025-07-30T10:32:24.550Z https://docs.axolotl.ai/docs/api/kernels.geglu.html - 2025-07-30T10:24:39.322Z + 2025-07-30T10:32:24.862Z https://docs.axolotl.ai/docs/api/utils.data.pretraining.html - 2025-07-30T10:24:39.589Z + 2025-07-30T10:32:25.123Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.user_defined.html - 2025-07-30T10:24:39.202Z + 2025-07-30T10:32:24.743Z https://docs.axolotl.ai/docs/api/core.builders.base.html - 2025-07-30T10:24:38.634Z + 2025-07-30T10:32:24.183Z https://docs.axolotl.ai/docs/api/cli.merge_lora.html - 2025-07-30T10:24:38.811Z + 2025-07-30T10:32:24.359Z https://docs.axolotl.ai/docs/api/monkeypatch.mixtral.html - 2025-07-30T10:24:39.453Z + 2025-07-30T10:32:24.990Z https://docs.axolotl.ai/docs/api/utils.data.sft.html - 2025-07-30T10:24:39.596Z + 2025-07-30T10:32:25.130Z https://docs.axolotl.ai/docs/api/prompt_strategies.user_defined.html - 2025-07-30T10:24:39.106Z + 2025-07-30T10:32:24.648Z https://docs.axolotl.ai/docs/api/utils.tokenization.html - 2025-07-30T10:24:39.489Z + 2025-07-30T10:32:25.026Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chatml.html - 2025-07-30T10:24:39.179Z + 2025-07-30T10:32:24.721Z https://docs.axolotl.ai/docs/api/models.mamba.modeling_mamba.html - 2025-07-30T10:24:39.884Z + 2025-07-30T10:32:25.413Z https://docs.axolotl.ai/docs/api/cli.args.html - 2025-07-30T10:24:38.764Z + 2025-07-30T10:32:24.313Z https://docs.axolotl.ai/docs/api/evaluate.html - 2025-07-30T10:24:38.551Z + 2025-07-30T10:32:24.102Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_instruct.html - 2025-07-30T10:24:39.086Z + 2025-07-30T10:32:24.629Z https://docs.axolotl.ai/docs/api/utils.distributed.html - 2025-07-30T10:24:39.574Z + 2025-07-30T10:32:25.111Z https://docs.axolotl.ai/docs/multipack.html - 2025-07-30T10:21:25.042Z + 2025-07-30T10:29:12.475Z https://docs.axolotl.ai/examples/colab-notebooks/colab-axolotl-example.html - 2025-07-30T10:21:25.046Z + 2025-07-30T10:29:12.480Z https://docs.axolotl.ai/FAQS.html - 2025-07-30T10:21:25.037Z + 2025-07-30T10:29:12.470Z