ZeRO Stage 3 can be used for training on a single GPU by manually setting the environment variables:
-WORLD_SIZE=1 LOCAL_RANK=0 MASTER_ADDR=0.0.0.0 MASTER_PORT=29500
-
-
@@ -1292,132 +1277,123 @@ single sequence causes OOM errors during model training.
:::
-::: {.callout-tip}
+## Fully Sharded Data Parallel (FSDP) {#sec-fsdp}
-Using ZeRO Stage 3 with Single-GPU training
+::: {.callout-note}
-ZeRO Stage 3 can be used for training on a single GPU by manually setting the environment variables:
-`WORLD_SIZE=1 LOCAL_RANK=0 MASTER_ADDR=0.0.0.0 MASTER_PORT=29500`
-
-:::
-
-## Fully Sharded Data Parallel (FSDP) {#sec-fsdp}
-
-::: {.callout-note}
+FSDP2 is recommended for new users. FSDP1 is deprecated and will be removed in an upcoming release of Axolotl.
+
+:::
+
+### Migrating from FSDP1 to FSDP2 {#sec-migrate-fsdp1-fsdp2}
+
+To migrate your config from FSDP1 to FSDP2, you must use the `fsdp_version` top-level config field to specify the FSDP version, and
+also follow the config field mapping below to update field names.
-FSDP2 is recommended for new users. FSDP1 is deprecated and will be removed in an upcoming release of Axolotl.
+#### Config mapping
-:::
-
-### Migrating from FSDP1 to FSDP2 {#sec-migrate-fsdp1-fsdp2}
-
-To migrate your config from FSDP1 to FSDP2, you must use the `fsdp_version` top-level config field to specify the FSDP version, and
-also follow the config field mapping below to update field names.
-
-#### Config mapping
-
-FSDP1 | FSDP2
--------- | --------
-fsdp_sharding_strategy | reshard_after_forward
-fsdp_backward_prefetch_policy | **REMOVED**
-fsdp_backward_prefetch | **REMOVED**
-fsdp_forward_prefetch | **REMOVED**
-fsdp_sync_module_states | **REMOVED**
-fsdp_cpu_ram_efficient_loading | cpu_ram_efficient_loading
-fsdp_state_dict_type | state_dict_type
-fsdp_use_orig_params | **REMOVED**
-
-For more details, please see the migration guide in the [torchtitan repo](https://github.com/pytorch/torchtitan/blob/main/docs/fsdp.md). In Axolotl,
-if you were using the following FSDP1 config:
-
-```{.yaml}
-fsdp_version:1
-fsdp_config:
-fsdp_offload_params:false
-fsdp_cpu_ram_efficient_loading:true
-fsdp_auto_wrap_policy: TRANSFORMER_BASED_WRAP
-fsdp_transformer_layer_cls_to_wrap: Qwen3DecoderLayer
-fsdp_state_dict_type: FULL_STATE_DICT
-fsdp_sharding_strategy: FULL_SHARD
-```
-
-You can migrate to the following FSDP2 config:
-
-```{.yaml}
-fsdp_version:2
-fsdp_config:
-offload_params:false
-cpu_ram_efficient_loading:true
-auto_wrap_policy: TRANSFORMER_BASED_WRAP
-transformer_layer_cls_to_wrap: Qwen3DecoderLayer
-state_dict_type: FULL_STATE_DICT
-reshard_after_forward:true
-```
-
-### FSDP1 (deprecated) {#sec-fsdp-config}
-
-::: {.callout-note}
-
-Using `fsdp` to configure FSDP is deprecated and will be removed in an upcoming release of Axolotl. Please use `fsdp_config` as above instead.
-
-:::
-
-```{.yaml}
-fsdp:
--full_shard
--auto_wrap
-fsdp_config:
-fsdp_offload_params:true
-fsdp_state_dict_type: FULL_STATE_DICT
-fsdp_transformer_layer_cls_to_wrap: LlamaDecoderLayer
-```
-
+FSDP1 | FSDP2
+-------- | --------
+fsdp_sharding_strategy | reshard_after_forward
+fsdp_backward_prefetch_policy | **REMOVED**
+fsdp_backward_prefetch | **REMOVED**
+fsdp_forward_prefetch | **REMOVED**
+fsdp_sync_module_states | **REMOVED**
+fsdp_cpu_ram_efficient_loading | cpu_ram_efficient_loading
+fsdp_state_dict_type | state_dict_type
+fsdp_use_orig_params | **REMOVED**
+
+For more details, please see the migration guide in the [torchtitan repo](https://github.com/pytorch/torchtitan/blob/main/docs/fsdp.md). In Axolotl,
+if you were using the following FSDP1 config:
+
+```{.yaml}
+fsdp_version:1
+fsdp_config:
+fsdp_offload_params:false
+fsdp_cpu_ram_efficient_loading:true
+fsdp_auto_wrap_policy: TRANSFORMER_BASED_WRAP
+fsdp_transformer_layer_cls_to_wrap: Qwen3DecoderLayer
+fsdp_state_dict_type: FULL_STATE_DICT
+fsdp_sharding_strategy: FULL_SHARD
+```
+
+You can migrate to the following FSDP2 config:
+
+```{.yaml}
+fsdp_version:2
+fsdp_config:
+offload_params:false
+cpu_ram_efficient_loading:true
+auto_wrap_policy: TRANSFORMER_BASED_WRAP
+transformer_layer_cls_to_wrap: Qwen3DecoderLayer
+state_dict_type: FULL_STATE_DICT
+reshard_after_forward:true
+```
+
+### FSDP1 (deprecated) {#sec-fsdp-config}
+
+::: {.callout-note}
+
+Using `fsdp` to configure FSDP is deprecated and will be removed in an upcoming release of Axolotl. Please use `fsdp_config` as above instead.
+
+:::
+
+```{.yaml}
+fsdp:
+-full_shard
+-auto_wrap
+fsdp_config:
+fsdp_offload_params:true
+fsdp_state_dict_type: FULL_STATE_DICT
+fsdp_transformer_layer_cls_to_wrap: LlamaDecoderLayer
+```
+
+
+## Sequence parallelism {#sec-sequence-parallelism}
+
+We support sequence parallelism (SP) via the
+[ring-flash-attention](https://github.com/zhuzilin/ring-flash-attention) project. This
+allows one to split up sequences across GPUs, which is useful in the event that a
+single sequence causes OOM errors during model training.
+
+See our [dedicated guide](sequence_parallelism.qmd) for more information.
-## Sequence parallelism {#sec-sequence-parallelism}
+### FSDP + QLoRA {#sec-fsdp-qlora}
-We support sequence parallelism (SP) via the
-[ring-flash-attention](https://github.com/zhuzilin/ring-flash-attention) project. This
-allows one to split up sequences across GPUs, which is useful in the event that a
-single sequence causes OOM errors during model training.
-
-See our [dedicated guide](sequence_parallelism.qmd) for more information.
-
-### FSDP + QLoRA {#sec-fsdp-qlora}
-
-For combining FSDP with QLoRA, see our [dedicated guide](fsdp_qlora.qmd).
-
-## Performance Optimization {#sec-performance}
-
-### Liger Kernel Integration {#sec-liger}
-
-Please see [docs](custom_integrations.qmd#liger) for more info.
-
-## Troubleshooting {#sec-troubleshooting}
-
-### NCCL Issues {#sec-nccl}
-
-For NCCL-related problems, see our [NCCL troubleshooting guide](nccl.qmd).
-
-### Common Problems {#sec-common-problems}
+For combining FSDP with QLoRA, see our [dedicated guide](fsdp_qlora.qmd).
+
+## Performance Optimization {#sec-performance}
+
+### Liger Kernel Integration {#sec-liger}
+
+Please see [docs](custom_integrations.qmd#liger) for more info.
+
+## Troubleshooting {#sec-troubleshooting}
+
+### NCCL Issues {#sec-nccl}
+
+For NCCL-related problems, see our [NCCL troubleshooting guide](nccl.qmd).
+
+### Common Problems {#sec-common-problems}
+
+::: {.panel-tabset}
+
+## Memory Issues
+
+- Reduce `micro_batch_size`
+- Reduce `eval_batch_size`
+- Adjust `gradient_accumulation_steps`
+- Consider using a higher ZeRO stage
-::: {.panel-tabset}
+## Training Instability
-## Memory Issues
-
-- Reduce `micro_batch_size`
-- Reduce `eval_batch_size`
-- Adjust `gradient_accumulation_steps`
-- Consider using a higher ZeRO stage
-
-## Training Instability
-
-- Start with DeepSpeed ZeRO-2
-- Monitor loss values
-- Check learning rates
-
-:::
-
-For more detailed troubleshooting, see our [debugging guide](debugging.qmd).
+- Start with DeepSpeed ZeRO-2
+- Monitor loss values
+- Check learning rates
+
+:::
+
+For more detailed troubleshooting, see our [debugging guide](debugging.qmd).
diff --git a/docs/multi-node.html b/docs/multi-node.html
index fab620293..3f9964f1f 100644
--- a/docs/multi-node.html
+++ b/docs/multi-node.html
@@ -2,7 +2,7 @@
-
+
@@ -72,7 +72,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/multimodal.html b/docs/multimodal.html
index 80f130ddf..18cb894a7 100644
--- a/docs/multimodal.html
+++ b/docs/multimodal.html
@@ -2,7 +2,7 @@
-
+
@@ -71,7 +71,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/multipack.html b/docs/multipack.html
index bbaa315a6..dbfd78c58 100644
--- a/docs/multipack.html
+++ b/docs/multipack.html
@@ -2,7 +2,7 @@
-
+
@@ -37,7 +37,7 @@ ul.task-list li input[type="checkbox"] {
-
+
diff --git a/docs/nccl.html b/docs/nccl.html
index b5264f298..93c681564 100644
--- a/docs/nccl.html
+++ b/docs/nccl.html
@@ -2,7 +2,7 @@
-
+
@@ -72,7 +72,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/nd_parallelism.html b/docs/nd_parallelism.html
index c9367943f..522e07cc9 100644
--- a/docs/nd_parallelism.html
+++ b/docs/nd_parallelism.html
@@ -2,7 +2,7 @@
-
+
@@ -71,7 +71,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/optimizers.html b/docs/optimizers.html
index bad461a1a..126c3ddae 100644
--- a/docs/optimizers.html
+++ b/docs/optimizers.html
@@ -2,7 +2,7 @@
-
+
@@ -72,7 +72,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/qat.html b/docs/qat.html
index b22aa1694..94169c27d 100644
--- a/docs/qat.html
+++ b/docs/qat.html
@@ -2,7 +2,7 @@
-
+
@@ -71,7 +71,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/quantize.html b/docs/quantize.html
index 0e37f68ea..3a0edee1f 100644
--- a/docs/quantize.html
+++ b/docs/quantize.html
@@ -2,7 +2,7 @@
-
+
@@ -71,7 +71,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/ray-integration.html b/docs/ray-integration.html
index ed12427ec..4cdc83a2e 100644
--- a/docs/ray-integration.html
+++ b/docs/ray-integration.html
@@ -2,7 +2,7 @@
-
+
@@ -72,7 +72,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/reward_modelling.html b/docs/reward_modelling.html
index 355cfefd0..c86c86c70 100644
--- a/docs/reward_modelling.html
+++ b/docs/reward_modelling.html
@@ -2,7 +2,7 @@
-
+
@@ -72,7 +72,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/rlhf.html b/docs/rlhf.html
index a05bc3a36..e50d1a594 100644
--- a/docs/rlhf.html
+++ b/docs/rlhf.html
@@ -2,7 +2,7 @@
-
+
@@ -72,7 +72,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/sequence_parallelism.html b/docs/sequence_parallelism.html
index 8a7306dee..e51a4d09a 100644
--- a/docs/sequence_parallelism.html
+++ b/docs/sequence_parallelism.html
@@ -2,7 +2,7 @@
-
+
@@ -72,7 +72,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/torchao.html b/docs/torchao.html
index 3a19483c5..673e65d98 100644
--- a/docs/torchao.html
+++ b/docs/torchao.html
@@ -2,7 +2,7 @@
-
+
@@ -72,7 +72,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/docs/unsloth.html b/docs/unsloth.html
index ec34eb6a1..db3f55009 100644
--- a/docs/unsloth.html
+++ b/docs/unsloth.html
@@ -2,7 +2,7 @@
-
+
@@ -72,7 +72,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/examples/colab-notebooks/colab-axolotl-example.html b/examples/colab-notebooks/colab-axolotl-example.html
index 6ea070c7c..c808c98d9 100644
--- a/examples/colab-notebooks/colab-axolotl-example.html
+++ b/examples/colab-notebooks/colab-axolotl-example.html
@@ -2,7 +2,7 @@
-
+
@@ -71,7 +71,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/index.html b/index.html
index 0b4cfb8dd..9ecfe4c91 100644
--- a/index.html
+++ b/index.html
@@ -2,7 +2,7 @@
-
+
@@ -71,7 +71,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
-
+
diff --git a/search.json b/search.json
index 45d395e97..343a56fc8 100644
--- a/search.json
+++ b/search.json
@@ -175,7 +175,7 @@
"href": "docs/multi-gpu.html#sec-deepspeed",
"title": "Multi-GPU",
"section": "2 DeepSpeed",
- "text": "2 DeepSpeed\n\n2.1 Configuration\nAdd to your YAML config:\ndeepspeed: deepspeed_configs/zero1.json\n\n\n2.2 Usage\n# Fetch deepspeed configs (if not already present)\naxolotl fetch deepspeed_configs\n\n# Passing arg via config\naxolotl train config.yml\n\n# Passing arg via cli\naxolotl train config.yml --deepspeed deepspeed_configs/zero1.json\n\n\n2.3 ZeRO Stages\nWe provide default configurations for:\n\nZeRO Stage 1 (zero1.json)\nZeRO Stage 1 with torch compile (zero1_torch_compile.json)\nZeRO Stage 2 (zero2.json)\nZeRO Stage 3 (zero3.json)\nZeRO Stage 3 with bf16 (zero3_bf16.json)\nZeRO Stage 3 with bf16 and CPU offload params(zero3_bf16_cpuoffload_params.json)\nZeRO Stage 3 with bf16 and CPU offload params and optimizer (zero3_bf16_cpuoffload_all.json)\n\n\n\n\n\n\n\nTip\n\n\n\nChoose the configuration that offloads the least amount to memory while still being able to fit on VRAM for best performance.\nStart from Stage 1 -> Stage 2 -> Stage 3.\n\n\n\n\n\n\n\n\nTip\n\n\n\nUsing ZeRO Stage 3 with Single-GPU training\nZeRO Stage 3 can be used for training on a single GPU by manually setting the environment variables:\nWORLD_SIZE=1 LOCAL_RANK=0 MASTER_ADDR=0.0.0.0 MASTER_PORT=29500",
+ "text": "2 DeepSpeed\n\n2.1 Configuration\nAdd to your YAML config:\ndeepspeed: deepspeed_configs/zero1.json\n\n\n2.2 Usage\n# Fetch deepspeed configs (if not already present)\naxolotl fetch deepspeed_configs\n\n# Passing arg via config\naxolotl train config.yml\n\n# Passing arg via cli\naxolotl train config.yml --deepspeed deepspeed_configs/zero1.json\n\n\n2.3 ZeRO Stages\nWe provide default configurations for:\n\nZeRO Stage 1 (zero1.json)\nZeRO Stage 1 with torch compile (zero1_torch_compile.json)\nZeRO Stage 2 (zero2.json)\nZeRO Stage 3 (zero3.json)\nZeRO Stage 3 with bf16 (zero3_bf16.json)\nZeRO Stage 3 with bf16 and CPU offload params(zero3_bf16_cpuoffload_params.json)\nZeRO Stage 3 with bf16 and CPU offload params and optimizer (zero3_bf16_cpuoffload_all.json)\n\n\n\n\n\n\n\nTip\n\n\n\nChoose the configuration that offloads the least amount to memory while still being able to fit on VRAM for best performance.\nStart from Stage 1 -> Stage 2 -> Stage 3.",
"crumbs": [
"Deployments",
"Multi-GPU"
diff --git a/site_libs/quarto-html/quarto-syntax-highlighting-dark-bc185b5c5bdbcb35c2eb49d8a876ef70.css b/site_libs/quarto-html/quarto-syntax-highlighting-dark-befe23ebd2f54d8af2c8a89d1a1611f1.css
similarity index 98%
rename from site_libs/quarto-html/quarto-syntax-highlighting-dark-bc185b5c5bdbcb35c2eb49d8a876ef70.css
rename to site_libs/quarto-html/quarto-syntax-highlighting-dark-befe23ebd2f54d8af2c8a89d1a1611f1.css
index 2a884823a..a6ae06a82 100644
--- a/site_libs/quarto-html/quarto-syntax-highlighting-dark-bc185b5c5bdbcb35c2eb49d8a876ef70.css
+++ b/site_libs/quarto-html/quarto-syntax-highlighting-dark-befe23ebd2f54d8af2c8a89d1a1611f1.css
@@ -216,4 +216,4 @@ code span.wa {
content: "";
}
-/*# sourceMappingURL=9510e8932c082882303fe901a8ba9f6d.css.map */
+/*# sourceMappingURL=237138eac573633834fecf06dae85922.css.map */
diff --git a/sitemap.xml b/sitemap.xml
index ee08c8079..ff59e7d03 100644
--- a/sitemap.xml
+++ b/sitemap.xml
@@ -2,790 +2,790 @@
https://docs.axolotl.ai/examples/colab-notebooks/colab-axolotl-example.html
- 2025-08-28T13:10:49.902Z
+ 2025-08-29T17:52:59.100Zhttps://docs.axolotl.ai/src/axolotl/integrations/LICENSE.html
- 2025-08-28T13:10:49.918Z
+ 2025-08-29T17:52:59.117Zhttps://docs.axolotl.ai/FAQS.html
- 2025-08-28T13:10:49.892Z
+ 2025-08-29T17:52:59.090Zhttps://docs.axolotl.ai/docs/unsloth.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/reward_modelling.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/docker.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/batch_vs_grad.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/torchao.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/dataset_preprocessing.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/multi-gpu.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/optimizers.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/getting-started.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/ray-integration.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/input_output.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/multi-node.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/api/train.html
- 2025-08-28T13:13:52.510Z
+ 2025-08-29T17:56:24.493Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.user_defined.html
- 2025-08-28T13:13:53.188Z
+ 2025-08-29T17:56:25.159Zhttps://docs.axolotl.ai/docs/api/common.datasets.html
- 2025-08-28T13:13:53.860Z
+ 2025-08-29T17:56:25.814Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.stepwise_supervised.html
- 2025-08-28T13:13:53.121Z
+ 2025-08-29T17:56:25.093Zhttps://docs.axolotl.ai/docs/api/cli.utils.args.html
- 2025-08-28T13:13:52.851Z
+ 2025-08-29T17:56:24.828Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.bradley_terry.llama3.html
- 2025-08-28T13:13:53.212Z
+ 2025-08-29T17:56:25.183Zhttps://docs.axolotl.ai/docs/api/utils.schemas.peft.html
- 2025-08-28T13:13:53.642Z
+ 2025-08-29T17:56:25.601Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.messages.chat.html
- 2025-08-28T13:13:53.143Z
+ 2025-08-29T17:56:25.115Zhttps://docs.axolotl.ai/docs/api/integrations.lm_eval.args.html
- 2025-08-28T13:13:53.838Z
+ 2025-08-29T17:56:25.793Zhttps://docs.axolotl.ai/docs/api/cli.train.html
- 2025-08-28T13:13:52.711Z
+ 2025-08-29T17:56:24.690Zhttps://docs.axolotl.ai/docs/api/utils.trainer.html
- 2025-08-28T13:13:53.494Z
+ 2025-08-29T17:56:25.457Zhttps://docs.axolotl.ai/docs/api/core.datasets.transforms.chat_builder.html
- 2025-08-28T13:13:52.668Z
+ 2025-08-29T17:56:24.648Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.comet_.html
- 2025-08-28T13:13:53.950Z
+ 2025-08-29T17:56:25.903Zhttps://docs.axolotl.ai/docs/api/prompt_tokenizers.html
- 2025-08-28T13:13:52.589Z
+ 2025-08-29T17:56:24.570Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chat_template.html
- 2025-08-28T13:13:53.145Z
+ 2025-08-29T17:56:25.116Zhttps://docs.axolotl.ai/docs/api/core.trainers.utils.html
- 2025-08-28T13:13:52.944Z
+ 2025-08-29T17:56:24.919Zhttps://docs.axolotl.ai/docs/api/loaders.processor.html
- 2025-08-28T13:13:52.964Z
+ 2025-08-29T17:56:24.938Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.optimizer.html
- 2025-08-28T13:13:52.987Z
+ 2025-08-29T17:56:24.961Zhttps://docs.axolotl.ai/docs/api/monkeypatch.data.batch_dataset_fetcher.html
- 2025-08-28T13:13:53.415Z
+ 2025-08-29T17:56:25.380Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.lisa.html
- 2025-08-28T13:13:53.943Z
+ 2025-08-29T17:56:25.896Zhttps://docs.axolotl.ai/docs/api/kernels.utils.html
- 2025-08-28T13:13:53.333Z
+ 2025-08-29T17:56:25.299Zhttps://docs.axolotl.ai/docs/api/cli.preprocess.html
- 2025-08-28T13:13:52.816Z
+ 2025-08-29T17:56:24.793Zhttps://docs.axolotl.ai/docs/api/integrations.liger.args.html
- 2025-08-28T13:13:53.834Z
+ 2025-08-29T17:56:25.789Zhttps://docs.axolotl.ai/docs/api/utils.collators.core.html
- 2025-08-28T13:13:53.862Z
+ 2025-08-29T17:56:25.816Zhttps://docs.axolotl.ai/docs/api/cli.utils.load.html
- 2025-08-28T13:13:52.862Z
+ 2025-08-29T17:56:24.839Zhttps://docs.axolotl.ai/docs/api/cli.utils.train.html
- 2025-08-28T13:13:52.880Z
+ 2025-08-29T17:56:24.856Zhttps://docs.axolotl.ai/docs/api/core.trainers.grpo.sampler.html
- 2025-08-28T13:13:52.943Z
+ 2025-08-29T17:56:24.917Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.mlflow_.html
- 2025-08-28T13:13:53.947Z
+ 2025-08-29T17:56:25.899Zhttps://docs.axolotl.ai/docs/api/utils.freeze.html
- 2025-08-28T13:13:53.477Z
+ 2025-08-29T17:56:25.440Zhttps://docs.axolotl.ai/docs/api/kernels.geglu.html
- 2025-08-28T13:13:53.313Z
+ 2025-08-29T17:56:25.280Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.zephyr.html
- 2025-08-28T13:13:53.167Z
+ 2025-08-29T17:56:25.138Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.llama2_chat.html
- 2025-08-28T13:13:53.105Z
+ 2025-08-29T17:56:25.077Zhttps://docs.axolotl.ai/docs/api/cli.config.html
- 2025-08-28T13:13:52.768Z
+ 2025-08-29T17:56:24.745Zhttps://docs.axolotl.ai/docs/api/kernels.quantize.html
- 2025-08-28T13:13:53.331Z
+ 2025-08-29T17:56:25.298Zhttps://docs.axolotl.ai/docs/api/monkeypatch.lora_kernels.html
- 2025-08-28T13:13:53.377Z
+ 2025-08-29T17:56:25.343Zhttps://docs.axolotl.ai/docs/api/common.const.html
- 2025-08-28T13:13:53.844Z
+ 2025-08-29T17:56:25.799Zhttps://docs.axolotl.ai/docs/api/integrations.cut_cross_entropy.args.html
- 2025-08-28T13:13:53.822Z
+ 2025-08-29T17:56:25.777Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_w_system.html
- 2025-08-28T13:13:53.084Z
+ 2025-08-29T17:56:25.056Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.chat_template.html
- 2025-08-28T13:13:53.057Z
+ 2025-08-29T17:56:25.029Zhttps://docs.axolotl.ai/docs/api/utils.schemas.datasets.html
- 2025-08-28T13:13:53.633Z
+ 2025-08-29T17:56:25.592Zhttps://docs.axolotl.ai/docs/api/core.training_args.html
- 2025-08-28T13:13:52.627Z
+ 2025-08-29T17:56:24.608Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.orpo.chat_template.html
- 2025-08-28T13:13:53.209Z
+ 2025-08-29T17:56:25.179Zhttps://docs.axolotl.ai/docs/api/cli.utils.html
- 2025-08-28T13:13:52.839Z
+ 2025-08-29T17:56:24.816Zhttps://docs.axolotl.ai/docs/api/common.architectures.html
- 2025-08-28T13:13:53.843Z
+ 2025-08-29T17:56:25.797Zhttps://docs.axolotl.ai/docs/api/core.trainers.grpo.trainer.html
- 2025-08-28T13:13:52.930Z
+ 2025-08-29T17:56:24.905Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.completion.html
- 2025-08-28T13:13:53.111Z
+ 2025-08-29T17:56:25.083Zhttps://docs.axolotl.ai/docs/api/utils.collators.batching.html
- 2025-08-28T13:13:53.882Z
+ 2025-08-29T17:56:25.835Zhttps://docs.axolotl.ai/docs/api/kernels.lora.html
- 2025-08-28T13:13:53.303Z
+ 2025-08-29T17:56:25.270Zhttps://docs.axolotl.ai/docs/api/utils.schedulers.html
- 2025-08-28T13:13:53.523Z
+ 2025-08-29T17:56:25.484Zhttps://docs.axolotl.ai/docs/api/monkeypatch.relora.html
- 2025-08-28T13:13:53.347Z
+ 2025-08-29T17:56:25.313Zhttps://docs.axolotl.ai/docs/api/monkeypatch.transformers_fa_utils.html
- 2025-08-28T13:13:53.404Z
+ 2025-08-29T17:56:25.369Zhttps://docs.axolotl.ai/docs/api/loaders.constants.html
- 2025-08-28T13:13:52.981Z
+ 2025-08-29T17:56:24.955Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_instruct.html
- 2025-08-28T13:13:53.072Z
+ 2025-08-29T17:56:25.044Zhttps://docs.axolotl.ai/docs/api/cli.merge_lora.html
- 2025-08-28T13:13:52.796Z
+ 2025-08-29T17:56:24.773Zhttps://docs.axolotl.ai/docs/api/core.chat.messages.html
- 2025-08-28T13:13:52.651Z
+ 2025-08-29T17:56:24.631Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_patch_multipack.html
- 2025-08-28T13:13:53.388Z
+ 2025-08-29T17:56:25.354Zhttps://docs.axolotl.ai/docs/api/monkeypatch.mixtral.html
- 2025-08-28T13:13:53.416Z
+ 2025-08-29T17:56:25.381Zhttps://docs.axolotl.ai/docs/api/utils.schemas.trl.html
- 2025-08-28T13:13:53.645Z
+ 2025-08-29T17:56:25.604Zhttps://docs.axolotl.ai/docs/api/core.builders.causal.html
- 2025-08-28T13:13:52.609Z
+ 2025-08-29T17:56:24.591Zhttps://docs.axolotl.ai/docs/api/cli.utils.fetch.html
- 2025-08-28T13:13:52.857Z
+ 2025-08-29T17:56:24.833Zhttps://docs.axolotl.ai/docs/api/monkeypatch.utils.html
- 2025-08-28T13:13:53.385Z
+ 2025-08-29T17:56:25.351Zhttps://docs.axolotl.ai/docs/api/utils.chat_templates.html
- 2025-08-28T13:13:53.455Z
+ 2025-08-29T17:56:25.419Zhttps://docs.axolotl.ai/docs/api/utils.data.sft.html
- 2025-08-28T13:13:53.565Z
+ 2025-08-29T17:56:25.526Zhttps://docs.axolotl.ai/docs/api/monkeypatch.mistral_attn_hijack_flash.html
- 2025-08-28T13:13:53.342Z
+ 2025-08-29T17:56:25.308Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.qat.html
- 2025-08-28T13:13:53.957Z
+ 2025-08-29T17:56:25.910Zhttps://docs.axolotl.ai/docs/api/cli.checks.html
- 2025-08-28T13:13:52.749Z
+ 2025-08-29T17:56:24.727Zhttps://docs.axolotl.ai/docs/api/utils.schemas.utils.html
- 2025-08-28T13:13:53.680Z
+ 2025-08-29T17:56:25.638Zhttps://docs.axolotl.ai/docs/api/utils.schemas.integrations.html
- 2025-08-28T13:13:53.663Z
+ 2025-08-29T17:56:25.622Zhttps://docs.axolotl.ai/docs/api/utils.schemas.training.html
- 2025-08-28T13:13:53.615Z
+ 2025-08-29T17:56:25.574Zhttps://docs.axolotl.ai/docs/api/integrations.grokfast.optimizer.html
- 2025-08-28T13:13:53.823Z
+ 2025-08-29T17:56:25.778Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_flash.html
- 2025-08-28T13:13:53.339Z
+ 2025-08-29T17:56:25.305Zhttps://docs.axolotl.ai/docs/api/utils.model_shard_quant.html
- 2025-08-28T13:13:53.465Z
+ 2025-08-29T17:56:25.429Zhttps://docs.axolotl.ai/docs/api/utils.collators.mm_chat.html
- 2025-08-28T13:13:53.890Z
+ 2025-08-29T17:56:25.843Zhttps://docs.axolotl.ai/docs/api/cli.inference.html
- 2025-08-28T13:13:52.787Z
+ 2025-08-29T17:56:24.764Zhttps://docs.axolotl.ai/docs/api/monkeypatch.trainer_fsdp_optim.html
- 2025-08-28T13:13:53.397Z
+ 2025-08-29T17:56:25.363Zhttps://docs.axolotl.ai/docs/amd_hpc.html
- 2025-08-28T13:10:49.893Z
+ 2025-08-29T17:52:59.091Zhttps://docs.axolotl.ai/docs/multipack.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/lora_optims.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/inference.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/lr_groups.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/dataset-formats/inst_tune.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/dataset-formats/conversation.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/dataset-formats/index.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/dataset-formats/template_free.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/dataset-formats/pretraining.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/dataset-formats/stepwise_supervised.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/dataset-formats/tokenized.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/faq.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/sequence_parallelism.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/mac.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/installation.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/mixed_precision.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/api/core.trainers.trl.html
- 2025-08-28T13:13:52.907Z
+ 2025-08-29T17:56:24.882Zhttps://docs.axolotl.ai/docs/api/cli.cloud.base.html
- 2025-08-28T13:13:52.832Z
+ 2025-08-29T17:56:24.808Zhttps://docs.axolotl.ai/docs/api/core.chat.format.chatml.html
- 2025-08-28T13:13:52.652Z
+ 2025-08-29T17:56:24.632Zhttps://docs.axolotl.ai/docs/api/monkeypatch.multipack.html
- 2025-08-28T13:13:53.343Z
+ 2025-08-29T17:56:25.309Zhttps://docs.axolotl.ai/docs/api/utils.ctx_managers.sequence_parallel.html
- 2025-08-28T13:13:53.021Z
+ 2025-08-29T17:56:24.995Zhttps://docs.axolotl.ai/docs/api/cli.main.html
- 2025-08-28T13:13:52.703Z
+ 2025-08-29T17:56:24.682Zhttps://docs.axolotl.ai/docs/api/utils.schemas.multimodal.html
- 2025-08-28T13:13:53.651Z
+ 2025-08-29T17:56:25.610Zhttps://docs.axolotl.ai/docs/api/kernels.swiglu.html
- 2025-08-28T13:13:53.324Z
+ 2025-08-29T17:56:25.290Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.user_defined.html
- 2025-08-28T13:13:53.168Z
+ 2025-08-29T17:56:25.139Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.rng_state_loader.html
- 2025-08-28T13:13:52.990Z
+ 2025-08-29T17:56:24.964Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.user_defined.html
- 2025-08-28T13:13:53.092Z
+ 2025-08-29T17:56:25.064Zhttps://docs.axolotl.ai/docs/api/utils.lora.html
- 2025-08-28T13:13:53.460Z
+ 2025-08-29T17:56:25.424Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.pygmalion.html
- 2025-08-28T13:13:53.139Z
+ 2025-08-29T17:56:25.110Zhttps://docs.axolotl.ai/docs/api/loaders.tokenizer.html
- 2025-08-28T13:13:52.962Z
+ 2025-08-29T17:56:24.937Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.perplexity.html
- 2025-08-28T13:13:53.938Z
+ 2025-08-29T17:56:25.891Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_xformers.html
- 2025-08-28T13:13:53.340Z
+ 2025-08-29T17:56:25.306Zhttps://docs.axolotl.ai/docs/api/utils.optimizers.adopt.html
- 2025-08-28T13:13:53.557Z
+ 2025-08-29T17:56:25.517Zhttps://docs.axolotl.ai/docs/api/loaders.patch_manager.html
- 2025-08-28T13:13:52.979Z
+ 2025-08-29T17:56:24.954Zhttps://docs.axolotl.ai/docs/api/core.trainers.mixins.scheduler.html
- 2025-08-28T13:13:52.997Z
+ 2025-08-29T17:56:24.971Zhttps://docs.axolotl.ai/docs/api/monkeypatch.unsloth_.html
- 2025-08-28T13:13:53.405Z
+ 2025-08-29T17:56:25.371Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.base.html
- 2025-08-28T13:13:53.022Z
+ 2025-08-29T17:56:24.996Zhttps://docs.axolotl.ai/docs/api/utils.data.pretraining.html
- 2025-08-28T13:13:53.558Z
+ 2025-08-29T17:56:25.519Zhttps://docs.axolotl.ai/docs/api/monkeypatch.btlm_attn_hijack_flash.html
- 2025-08-28T13:13:53.387Z
+ 2025-08-29T17:56:25.352Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_chat.html
- 2025-08-28T13:13:53.070Z
+ 2025-08-29T17:56:25.043Zhttps://docs.axolotl.ai/docs/api/core.trainers.mamba.html
- 2025-08-28T13:13:52.912Z
+ 2025-08-29T17:56:24.888Zhttps://docs.axolotl.ai/docs/api/utils.samplers.multipack.html
- 2025-08-28T13:13:53.931Z
+ 2025-08-29T17:56:25.884Zhttps://docs.axolotl.ai/docs/api/utils.bench.html
- 2025-08-28T13:13:53.469Z
+ 2025-08-29T17:56:25.433Zhttps://docs.axolotl.ai/docs/api/utils.callbacks.profiler.html
- 2025-08-28T13:13:53.942Z
+ 2025-08-29T17:56:25.894Zhttps://docs.axolotl.ai/docs/api/utils.schemas.enums.html
- 2025-08-28T13:13:53.674Z
+ 2025-08-29T17:56:25.632Zhttps://docs.axolotl.ai/docs/api/models.mamba.modeling_mamba.html
- 2025-08-28T13:13:53.861Z
+ 2025-08-29T17:56:25.815Zhttps://docs.axolotl.ai/docs/api/core.chat.format.shared.html
- 2025-08-28T13:13:52.655Z
+ 2025-08-29T17:56:24.635Zhttps://docs.axolotl.ai/docs/api/logging_config.html
- 2025-08-28T13:13:52.598Z
+ 2025-08-29T17:56:24.580Zhttps://docs.axolotl.ai/docs/api/loaders.model.html
- 2025-08-28T13:13:52.954Z
+ 2025-08-29T17:56:24.929Zhttps://docs.axolotl.ai/docs/api/monkeypatch.llama_expand_mask.html
- 2025-08-28T13:13:53.348Z
+ 2025-08-29T17:56:25.314Zhttps://docs.axolotl.ai/docs/api/core.trainers.base.html
- 2025-08-28T13:13:52.891Z
+ 2025-08-29T17:56:24.867Zhttps://docs.axolotl.ai/docs/api/utils.tokenization.html
- 2025-08-28T13:13:53.453Z
+ 2025-08-29T17:56:25.417Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.llama3.html
- 2025-08-28T13:13:53.178Z
+ 2025-08-29T17:56:25.149Zhttps://docs.axolotl.ai/docs/api/convert.html
- 2025-08-28T13:13:52.546Z
+ 2025-08-29T17:56:24.528Zhttps://docs.axolotl.ai/docs/api/datasets.html
- 2025-08-28T13:13:52.532Z
+ 2025-08-29T17:56:24.515Zhttps://docs.axolotl.ai/docs/api/utils.collators.mamba.html
- 2025-08-28T13:13:53.885Z
+ 2025-08-29T17:56:25.839Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.orcamini.html
- 2025-08-28T13:13:53.132Z
+ 2025-08-29T17:56:25.104Zhttps://docs.axolotl.ai/docs/api/utils.schemas.config.html
- 2025-08-28T13:13:53.601Z
+ 2025-08-29T17:56:25.560Zhttps://docs.axolotl.ai/docs/api/core.trainers.dpo.trainer.html
- 2025-08-28T13:13:52.919Z
+ 2025-08-29T17:56:24.894Zhttps://docs.axolotl.ai/docs/api/utils.dict.html
- 2025-08-28T13:13:53.549Z
+ 2025-08-29T17:56:25.510Zhttps://docs.axolotl.ai/docs/api/utils.quantization.html
- 2025-08-28T13:13:53.586Z
+ 2025-08-29T17:56:25.546Zhttps://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_disk.html
- 2025-08-28T13:13:53.446Z
+ 2025-08-29T17:56:25.410Zhttps://docs.axolotl.ai/docs/api/index.html
- 2025-08-28T13:13:52.452Z
+ 2025-08-29T17:56:24.435Zhttps://docs.axolotl.ai/docs/api/integrations.base.html
- 2025-08-28T13:13:53.819Z
+ 2025-08-29T17:56:25.774Zhttps://docs.axolotl.ai/docs/api/utils.distributed.html
- 2025-08-28T13:13:53.543Z
+ 2025-08-29T17:56:25.504Zhttps://docs.axolotl.ai/docs/api/core.datasets.chat.html
- 2025-08-28T13:13:52.660Z
+ 2025-08-29T17:56:24.641Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.passthrough.html
- 2025-08-28T13:13:53.170Z
+ 2025-08-29T17:56:25.141Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.metharme.html
- 2025-08-28T13:13:53.128Z
+ 2025-08-29T17:56:25.100Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chatml.html
- 2025-08-28T13:13:53.165Z
+ 2025-08-29T17:56:25.136Zhttps://docs.axolotl.ai/docs/api/core.builders.rl.html
- 2025-08-28T13:13:52.614Z
+ 2025-08-29T17:56:24.595Zhttps://docs.axolotl.ai/docs/api/cli.utils.sweeps.html
- 2025-08-28T13:13:52.868Z
+ 2025-08-29T17:56:24.844Zhttps://docs.axolotl.ai/docs/api/cli.quantize.html
- 2025-08-28T13:13:52.821Z
+ 2025-08-29T17:56:24.798Zhttps://docs.axolotl.ai/docs/api/cli.vllm_serve.html
- 2025-08-28T13:13:52.828Z
+ 2025-08-29T17:56:24.805Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.input_output.html
- 2025-08-28T13:13:53.117Z
+ 2025-08-29T17:56:25.089Zhttps://docs.axolotl.ai/docs/api/evaluate.html
- 2025-08-28T13:13:52.521Z
+ 2025-08-29T17:56:24.504Zhttps://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_cpu.html
- 2025-08-28T13:13:53.420Z
+ 2025-08-29T17:56:25.384Zhttps://docs.axolotl.ai/docs/api/cli.args.html
- 2025-08-28T13:13:52.740Z
+ 2025-08-29T17:56:24.718Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.kto.chatml.html
- 2025-08-28T13:13:53.186Z
+ 2025-08-29T17:56:25.157Zhttps://docs.axolotl.ai/docs/api/core.chat.format.llama3x.html
- 2025-08-28T13:13:52.654Z
+ 2025-08-29T17:56:24.634Zhttps://docs.axolotl.ai/docs/api/prompt_strategies.dpo.llama3.html
- 2025-08-28T13:13:53.155Z
+ 2025-08-29T17:56:25.126Zhttps://docs.axolotl.ai/docs/api/cli.art.html
- 2025-08-28T13:13:52.743Z
+ 2025-08-29T17:56:24.721Zhttps://docs.axolotl.ai/docs/api/utils.schemas.model.html
- 2025-08-28T13:13:53.608Z
+ 2025-08-29T17:56:25.567Zhttps://docs.axolotl.ai/docs/api/cli.evaluate.html
- 2025-08-28T13:13:52.720Z
+ 2025-08-29T17:56:24.699Zhttps://docs.axolotl.ai/docs/api/cli.delinearize_llama4.html
- 2025-08-28T13:13:52.773Z
+ 2025-08-29T17:56:24.750Zhttps://docs.axolotl.ai/docs/api/integrations.kd.trainer.html
- 2025-08-28T13:13:53.831Z
+ 2025-08-29T17:56:25.786Zhttps://docs.axolotl.ai/docs/api/monkeypatch.stablelm_attn_hijack_flash.html
- 2025-08-28T13:13:53.394Z
+ 2025-08-29T17:56:25.359Zhttps://docs.axolotl.ai/docs/api/cli.cloud.modal_.html
- 2025-08-28T13:13:52.838Z
+ 2025-08-29T17:56:24.815Zhttps://docs.axolotl.ai/docs/api/cli.merge_sharded_fsdp_weights.html
- 2025-08-28T13:13:52.808Z
+ 2025-08-29T17:56:24.785Zhttps://docs.axolotl.ai/docs/api/core.builders.base.html
- 2025-08-28T13:13:52.605Z
+ 2025-08-29T17:56:24.586Zhttps://docs.axolotl.ai/docs/api/loaders.adapter.html
- 2025-08-28T13:13:52.970Z
+ 2025-08-29T17:56:24.944Zhttps://docs.axolotl.ai/docs/api/integrations.spectrum.args.html
- 2025-08-28T13:13:53.841Z
+ 2025-08-29T17:56:25.796Zhttps://docs.axolotl.ai/docs/dataset_loading.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/multimodal.html
- 2025-08-28T13:10:49.897Z
+ 2025-08-29T17:52:59.095Zhttps://docs.axolotl.ai/docs/gradient_checkpointing.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/debugging.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/custom_integrations.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/nd_parallelism.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/quantize.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/nccl.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/config-reference.html
- 2025-08-28T13:14:07.393Z
+ 2025-08-29T17:56:40.104Zhttps://docs.axolotl.ai/docs/rlhf.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/docs/cli.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/fsdp_qlora.html
- 2025-08-28T13:10:49.894Z
+ 2025-08-29T17:52:59.092Zhttps://docs.axolotl.ai/docs/qat.html
- 2025-08-28T13:10:49.898Z
+ 2025-08-29T17:52:59.096Zhttps://docs.axolotl.ai/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html
- 2025-08-28T13:10:49.918Z
+ 2025-08-29T17:52:59.117Zhttps://docs.axolotl.ai/index.html
- 2025-08-28T13:10:49.913Z
+ 2025-08-29T17:52:59.112Z
diff --git a/src/axolotl/integrations/LICENSE.html b/src/axolotl/integrations/LICENSE.html
index 524b5d7d3..9ad6bf121 100644
--- a/src/axolotl/integrations/LICENSE.html
+++ b/src/axolotl/integrations/LICENSE.html
@@ -2,7 +2,7 @@
-
+
@@ -36,7 +36,7 @@ ul.task-list li input[type="checkbox"] {
-
+
diff --git a/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html b/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html
index b0834de4b..ca49ad9f9 100644
--- a/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html
+++ b/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html
@@ -2,7 +2,7 @@
-
+
@@ -36,7 +36,7 @@ ul.task-list li input[type="checkbox"] {
-
+