diff --git a/.nojekyll b/.nojekyll index fbc726c4b..f021f0ddd 100644 --- a/.nojekyll +++ b/.nojekyll @@ -1 +1 @@ -616b3c49 \ No newline at end of file +1f653d45 \ No newline at end of file diff --git a/docs/nd_parallelism.html b/docs/nd_parallelism.html index c65b4fd77..60e725add 100644 --- a/docs/nd_parallelism.html +++ b/docs/nd_parallelism.html @@ -600,6 +600,19 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true});

Examples

+
+
+
+ +
+
+Tip +
+
+
+

See our example configs here.

+
+
  1. HSDP on 2 nodes with 4 GPUs each (8 GPUs total):
      diff --git a/search.json b/search.json index 064bb0fda..139d02d18 100644 --- a/search.json +++ b/search.json @@ -1632,7 +1632,7 @@ "href": "docs/nd_parallelism.html#examples", "title": "N-D Parallelism (Beta)", "section": "Examples", - "text": "Examples\n\nHSDP on 2 nodes with 4 GPUs each (8 GPUs total):\n\nYou want FSDP within each node and DDP across nodes.\nSet dp_shard_size: 4 and dp_replicate_size: 2.\n\nFSDP + TP on a single 8-GPU node:\n\nYou want to split the model across 4 GPUs using FSDP, and further split each layer across 2 GPUs with TP.\nSet dp_shard_size: 4 and tensor_parallel_size: 2.\n\nFSDP + CP on a single 8-GPU node for long context:\n\nYou want to shard the model across all 8 GPUs and also split the sequence length across all 8 GPUs.\nSet dp_shard_size: 8 and context_parallel_size: 8. Note: this means the data parallel group and context parallel group are the same. A more common setup might be to shard across a smaller group.", + "text": "Examples\n\n\n\n\n\n\nTip\n\n\n\nSee our example configs here.\n\n\n\nHSDP on 2 nodes with 4 GPUs each (8 GPUs total):\n\nYou want FSDP within each node and DDP across nodes.\nSet dp_shard_size: 4 and dp_replicate_size: 2.\n\nFSDP + TP on a single 8-GPU node:\n\nYou want to split the model across 4 GPUs using FSDP, and further split each layer across 2 GPUs with TP.\nSet dp_shard_size: 4 and tensor_parallel_size: 2.\n\nFSDP + CP on a single 8-GPU node for long context:\n\nYou want to shard the model across all 8 GPUs and also split the sequence length across all 8 GPUs.\nSet dp_shard_size: 8 and context_parallel_size: 8. Note: this means the data parallel group and context parallel group are the same. A more common setup might be to shard across a smaller group.", "crumbs": [ "Advanced Features", "N-D Parallelism (Beta)" diff --git a/sitemap.xml b/sitemap.xml index 4dd89bf88..558e1be36 100644 --- a/sitemap.xml +++ b/sitemap.xml @@ -2,794 +2,794 @@ https://docs.axolotl.ai/TODO.html - 2025-08-08T06:30:24.576Z + 2025-08-08T11:45:45.421Z https://docs.axolotl.ai/index.html - 2025-08-08T06:30:24.597Z + 2025-08-08T11:45:45.442Z https://docs.axolotl.ai/docs/debugging.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.423Z https://docs.axolotl.ai/docs/amd_hpc.html - 2025-08-08T06:30:24.577Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/api/utils.callbacks.mlflow_.html - 2025-08-08T06:33:46.594Z + 2025-08-08T11:49:07.417Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_expand_mask.html - 2025-08-08T06:33:46.011Z + 2025-08-08T11:49:06.843Z https://docs.axolotl.ai/docs/api/loaders.patch_manager.html - 2025-08-08T06:33:45.648Z + 2025-08-08T11:49:06.482Z https://docs.axolotl.ai/docs/api/core.chat.format.llama3x.html - 2025-08-08T06:33:45.329Z + 2025-08-08T11:49:06.168Z https://docs.axolotl.ai/docs/api/cli.train.html - 2025-08-08T06:33:45.385Z + 2025-08-08T11:49:06.223Z https://docs.axolotl.ai/docs/api/utils.callbacks.perplexity.html - 2025-08-08T06:33:46.585Z + 2025-08-08T11:49:07.408Z https://docs.axolotl.ai/docs/api/core.chat.messages.html - 2025-08-08T06:33:45.326Z + 2025-08-08T11:49:06.165Z https://docs.axolotl.ai/docs/api/utils.callbacks.lisa.html - 2025-08-08T06:33:46.590Z + 2025-08-08T11:49:07.413Z https://docs.axolotl.ai/docs/api/cli.merge_sharded_fsdp_weights.html - 2025-08-08T06:33:45.480Z + 2025-08-08T11:49:06.315Z https://docs.axolotl.ai/docs/api/monkeypatch.mixtral.html - 2025-08-08T06:33:46.071Z + 2025-08-08T11:49:06.902Z https://docs.axolotl.ai/docs/api/utils.chat_templates.html - 2025-08-08T06:33:46.109Z + 2025-08-08T11:49:06.939Z https://docs.axolotl.ai/docs/api/core.chat.format.shared.html - 2025-08-08T06:33:45.330Z + 2025-08-08T11:49:06.169Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.optimizer.html - 2025-08-08T06:33:45.655Z + 2025-08-08T11:49:06.489Z https://docs.axolotl.ai/docs/api/utils.collators.mamba.html - 2025-08-08T06:33:46.533Z + 2025-08-08T11:49:07.357Z https://docs.axolotl.ai/docs/api/logging_config.html - 2025-08-08T06:33:45.274Z + 2025-08-08T11:49:06.114Z https://docs.axolotl.ai/docs/api/utils.collators.mm_chat.html - 2025-08-08T06:33:46.538Z + 2025-08-08T11:49:07.362Z https://docs.axolotl.ai/docs/api/prompt_strategies.completion.html - 2025-08-08T06:33:45.778Z + 2025-08-08T11:49:06.611Z https://docs.axolotl.ai/docs/api/kernels.utils.html - 2025-08-08T06:33:45.996Z + 2025-08-08T11:49:06.828Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chat_template.html - 2025-08-08T06:33:45.811Z + 2025-08-08T11:49:06.644Z https://docs.axolotl.ai/docs/api/kernels.swiglu.html - 2025-08-08T06:33:45.987Z + 2025-08-08T11:49:06.819Z https://docs.axolotl.ai/docs/api/common.const.html - 2025-08-08T06:33:46.493Z + 2025-08-08T11:49:07.318Z https://docs.axolotl.ai/docs/api/cli.cloud.base.html - 2025-08-08T06:33:45.504Z + 2025-08-08T11:49:06.338Z https://docs.axolotl.ai/docs/api/prompt_strategies.orpo.chat_template.html - 2025-08-08T06:33:45.874Z + 2025-08-08T11:49:06.706Z https://docs.axolotl.ai/docs/api/core.builders.rl.html - 2025-08-08T06:33:45.290Z + 2025-08-08T11:49:06.130Z https://docs.axolotl.ai/docs/api/utils.dict.html - 2025-08-08T06:33:46.201Z + 2025-08-08T11:49:07.029Z https://docs.axolotl.ai/docs/api/utils.schemas.integrations.html - 2025-08-08T06:33:46.313Z + 2025-08-08T11:49:07.140Z https://docs.axolotl.ai/docs/api/core.trainers.utils.html - 2025-08-08T06:33:45.613Z + 2025-08-08T11:49:06.447Z https://docs.axolotl.ai/docs/api/monkeypatch.trainer_fsdp_optim.html - 2025-08-08T06:33:46.060Z + 2025-08-08T11:49:06.891Z https://docs.axolotl.ai/docs/api/cli.evaluate.html - 2025-08-08T06:33:45.394Z + 2025-08-08T11:49:06.231Z https://docs.axolotl.ai/docs/api/core.builders.causal.html - 2025-08-08T06:33:45.285Z + 2025-08-08T11:49:06.125Z https://docs.axolotl.ai/docs/api/monkeypatch.multipack.html - 2025-08-08T06:33:46.006Z + 2025-08-08T11:49:06.838Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_patch_multipack.html - 2025-08-08T06:33:46.051Z + 2025-08-08T11:49:06.882Z https://docs.axolotl.ai/docs/api/cli.delinearize_llama4.html - 2025-08-08T06:33:45.445Z + 2025-08-08T11:49:06.282Z https://docs.axolotl.ai/docs/api/utils.schemas.trl.html - 2025-08-08T06:33:46.296Z + 2025-08-08T11:49:07.123Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.zephyr.html - 2025-08-08T06:33:45.833Z + 2025-08-08T11:49:06.666Z https://docs.axolotl.ai/docs/api/integrations.kd.trainer.html - 2025-08-08T06:33:46.480Z + 2025-08-08T11:49:07.306Z https://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_disk.html - 2025-08-08T06:33:46.101Z + 2025-08-08T11:49:06.931Z https://docs.axolotl.ai/docs/api/utils.optimizers.adopt.html - 2025-08-08T06:33:46.209Z + 2025-08-08T11:49:07.037Z https://docs.axolotl.ai/docs/api/monkeypatch.data.batch_dataset_fetcher.html - 2025-08-08T06:33:46.070Z + 2025-08-08T11:49:06.900Z https://docs.axolotl.ai/docs/api/cli.cloud.modal_.html - 2025-08-08T06:33:45.510Z + 2025-08-08T11:49:06.345Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_chat.html - 2025-08-08T06:33:45.738Z + 2025-08-08T11:49:06.571Z https://docs.axolotl.ai/docs/api/utils.freeze.html - 2025-08-08T06:33:46.131Z + 2025-08-08T11:49:06.960Z https://docs.axolotl.ai/docs/api/prompt_strategies.bradley_terry.llama3.html - 2025-08-08T06:33:45.878Z + 2025-08-08T11:49:06.710Z https://docs.axolotl.ai/docs/api/integrations.base.html - 2025-08-08T06:33:46.468Z + 2025-08-08T11:49:07.294Z https://docs.axolotl.ai/docs/api/monkeypatch.unsloth_.html - 2025-08-08T06:33:46.068Z + 2025-08-08T11:49:06.899Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.chatml.html - 2025-08-08T06:33:45.852Z + 2025-08-08T11:49:06.685Z https://docs.axolotl.ai/docs/api/cli.main.html - 2025-08-08T06:33:45.377Z + 2025-08-08T11:49:06.215Z https://docs.axolotl.ai/docs/api/common.datasets.html - 2025-08-08T06:33:46.508Z + 2025-08-08T11:49:07.333Z https://docs.axolotl.ai/docs/api/train.html - 2025-08-08T06:33:45.188Z + 2025-08-08T11:49:06.028Z https://docs.axolotl.ai/docs/api/utils.trainer.html - 2025-08-08T06:33:46.148Z + 2025-08-08T11:49:06.977Z https://docs.axolotl.ai/docs/api/prompt_strategies.llama2_chat.html - 2025-08-08T06:33:45.772Z + 2025-08-08T11:49:06.605Z https://docs.axolotl.ai/docs/api/index.html - 2025-08-08T06:33:45.126Z + 2025-08-08T11:49:05.967Z https://docs.axolotl.ai/docs/api/prompt_strategies.chat_template.html - 2025-08-08T06:33:45.724Z + 2025-08-08T11:49:06.557Z https://docs.axolotl.ai/docs/api/core.training_args.html - 2025-08-08T06:33:45.303Z + 2025-08-08T11:49:06.142Z https://docs.axolotl.ai/docs/api/kernels.quantize.html - 2025-08-08T06:33:45.995Z + 2025-08-08T11:49:06.826Z https://docs.axolotl.ai/docs/api/convert.html - 2025-08-08T06:33:45.223Z + 2025-08-08T11:49:06.063Z https://docs.axolotl.ai/docs/api/integrations.grokfast.optimizer.html - 2025-08-08T06:33:46.472Z + 2025-08-08T11:49:07.298Z https://docs.axolotl.ai/docs/api/prompt_strategies.stepwise_supervised.html - 2025-08-08T06:33:45.789Z + 2025-08-08T11:49:06.621Z https://docs.axolotl.ai/docs/api/utils.schemas.model.html - 2025-08-08T06:33:46.259Z + 2025-08-08T11:49:07.086Z https://docs.axolotl.ai/docs/api/utils.callbacks.qat.html - 2025-08-08T06:33:46.604Z + 2025-08-08T11:49:07.427Z https://docs.axolotl.ai/docs/api/loaders.constants.html - 2025-08-08T06:33:45.650Z + 2025-08-08T11:49:06.484Z https://docs.axolotl.ai/docs/api/cli.utils.sweeps.html - 2025-08-08T06:33:45.540Z + 2025-08-08T11:49:06.374Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.llama3.html - 2025-08-08T06:33:45.822Z + 2025-08-08T11:49:06.654Z https://docs.axolotl.ai/docs/api/core.datasets.transforms.chat_builder.html - 2025-08-08T06:33:45.343Z + 2025-08-08T11:49:06.182Z https://docs.axolotl.ai/docs/api/cli.utils.fetch.html - 2025-08-08T06:33:45.528Z + 2025-08-08T11:49:06.363Z https://docs.axolotl.ai/docs/api/core.trainers.mamba.html - 2025-08-08T06:33:45.582Z + 2025-08-08T11:49:06.416Z https://docs.axolotl.ai/docs/api/utils.schemas.enums.html - 2025-08-08T06:33:46.324Z + 2025-08-08T11:49:07.150Z https://docs.axolotl.ai/docs/api/utils.callbacks.profiler.html - 2025-08-08T06:33:46.589Z + 2025-08-08T11:49:07.412Z https://docs.axolotl.ai/docs/api/prompt_strategies.metharme.html - 2025-08-08T06:33:45.795Z + 2025-08-08T11:49:06.628Z https://docs.axolotl.ai/docs/api/core.trainers.trl.html - 2025-08-08T06:33:45.576Z + 2025-08-08T11:49:06.410Z https://docs.axolotl.ai/docs/api/prompt_strategies.orcamini.html - 2025-08-08T06:33:45.799Z + 2025-08-08T11:49:06.632Z https://docs.axolotl.ai/docs/api/utils.samplers.multipack.html - 2025-08-08T06:33:46.579Z + 2025-08-08T11:49:07.402Z https://docs.axolotl.ai/docs/api/utils.schedulers.html - 2025-08-08T06:33:46.176Z + 2025-08-08T11:49:07.004Z https://docs.axolotl.ai/docs/api/core.trainers.grpo.trainer.html - 2025-08-08T06:33:45.600Z + 2025-08-08T11:49:06.434Z https://docs.axolotl.ai/docs/api/prompt_tokenizers.html - 2025-08-08T06:33:45.265Z + 2025-08-08T11:49:06.105Z https://docs.axolotl.ai/docs/config-reference.html - 2025-08-08T06:33:59.739Z + 2025-08-08T11:49:20.350Z https://docs.axolotl.ai/docs/multimodal.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/mixed_precision.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/unsloth.html - 2025-08-08T06:30:24.582Z + 2025-08-08T11:45:45.427Z https://docs.axolotl.ai/docs/ray-integration.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/dataset-formats/stepwise_supervised.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/dataset-formats/template_free.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/dataset-formats/index.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/dataset-formats/pretraining.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/nd_parallelism.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/sequence_parallelism.html - 2025-08-08T06:30:24.582Z + 2025-08-08T11:45:45.427Z https://docs.axolotl.ai/docs/inference.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/fsdp_qlora.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.423Z https://docs.axolotl.ai/docs/multi-node.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/lora_optims.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/getting-started.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.423Z https://docs.axolotl.ai/docs/dataset_loading.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.423Z https://docs.axolotl.ai/docs/lr_groups.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/input_output.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/src/axolotl/integrations/LICENSE.html - 2025-08-08T06:30:24.601Z + 2025-08-08T11:45:45.446Z https://docs.axolotl.ai/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html - 2025-08-08T06:30:24.601Z + 2025-08-08T11:45:45.446Z https://docs.axolotl.ai/docs/mac.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/optimizers.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/gradient_checkpointing.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.423Z https://docs.axolotl.ai/docs/qat.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/faq.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.423Z https://docs.axolotl.ai/docs/dataset_preprocessing.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.423Z https://docs.axolotl.ai/docs/nccl.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/cli.html - 2025-08-08T06:30:24.577Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/torchao.html - 2025-08-08T06:30:24.582Z + 2025-08-08T11:45:45.427Z https://docs.axolotl.ai/docs/multi-gpu.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/rlhf.html - 2025-08-08T06:30:24.582Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/dataset-formats/tokenized.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.423Z https://docs.axolotl.ai/docs/dataset-formats/conversation.html - 2025-08-08T06:30:24.577Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/dataset-formats/inst_tune.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/reward_modelling.html - 2025-08-08T06:30:24.582Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/docker.html - 2025-08-08T06:30:24.578Z + 2025-08-08T11:45:45.423Z https://docs.axolotl.ai/docs/installation.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/quantize.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/docs/custom_integrations.html - 2025-08-08T06:30:24.577Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/batch_vs_grad.html - 2025-08-08T06:30:24.577Z + 2025-08-08T11:45:45.422Z https://docs.axolotl.ai/docs/api/cli.utils.train.html - 2025-08-08T06:33:45.551Z + 2025-08-08T11:49:06.385Z https://docs.axolotl.ai/docs/api/cli.art.html - 2025-08-08T06:33:45.416Z + 2025-08-08T11:49:06.253Z https://docs.axolotl.ai/docs/api/core.trainers.grpo.sampler.html - 2025-08-08T06:33:45.612Z + 2025-08-08T11:49:06.446Z https://docs.axolotl.ai/docs/api/loaders.model.html - 2025-08-08T06:33:45.623Z + 2025-08-08T11:49:06.457Z https://docs.axolotl.ai/docs/api/cli.preprocess.html - 2025-08-08T06:33:45.488Z + 2025-08-08T11:49:06.323Z https://docs.axolotl.ai/docs/api/cli.utils.html - 2025-08-08T06:33:45.511Z + 2025-08-08T11:49:06.346Z https://docs.axolotl.ai/docs/api/cli.inference.html - 2025-08-08T06:33:45.460Z + 2025-08-08T11:49:06.295Z https://docs.axolotl.ai/docs/api/monkeypatch.btlm_attn_hijack_flash.html - 2025-08-08T06:33:46.050Z + 2025-08-08T11:49:06.880Z https://docs.axolotl.ai/docs/api/datasets.html - 2025-08-08T06:33:45.210Z + 2025-08-08T11:49:06.050Z https://docs.axolotl.ai/docs/api/monkeypatch.transformers_fa_utils.html - 2025-08-08T06:33:46.067Z + 2025-08-08T11:49:06.897Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_flash.html - 2025-08-08T06:33:46.002Z + 2025-08-08T11:49:06.833Z https://docs.axolotl.ai/docs/api/monkeypatch.relora.html - 2025-08-08T06:33:46.010Z + 2025-08-08T11:49:06.841Z https://docs.axolotl.ai/docs/api/monkeypatch.stablelm_attn_hijack_flash.html - 2025-08-08T06:33:46.057Z + 2025-08-08T11:49:06.888Z https://docs.axolotl.ai/docs/api/loaders.adapter.html - 2025-08-08T06:33:45.638Z + 2025-08-08T11:49:06.472Z https://docs.axolotl.ai/docs/api/core.trainers.dpo.trainer.html - 2025-08-08T06:33:45.589Z + 2025-08-08T11:49:06.423Z https://docs.axolotl.ai/docs/api/integrations.cut_cross_entropy.args.html - 2025-08-08T06:33:46.471Z + 2025-08-08T11:49:07.297Z https://docs.axolotl.ai/docs/api/monkeypatch.utils.html - 2025-08-08T06:33:46.048Z + 2025-08-08T11:49:06.879Z https://docs.axolotl.ai/docs/api/loaders.processor.html - 2025-08-08T06:33:45.633Z + 2025-08-08T11:49:06.467Z https://docs.axolotl.ai/docs/api/cli.config.html - 2025-08-08T06:33:45.441Z + 2025-08-08T11:49:06.277Z https://docs.axolotl.ai/docs/api/integrations.liger.args.html - 2025-08-08T06:33:46.483Z + 2025-08-08T11:49:07.309Z https://docs.axolotl.ai/docs/api/loaders.tokenizer.html - 2025-08-08T06:33:45.631Z + 2025-08-08T11:49:06.465Z https://docs.axolotl.ai/docs/api/utils.schemas.config.html - 2025-08-08T06:33:46.252Z + 2025-08-08T11:49:07.079Z https://docs.axolotl.ai/docs/api/utils.ctx_managers.sequence_parallel.html - 2025-08-08T06:33:45.689Z + 2025-08-08T11:49:06.522Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.scheduler.html - 2025-08-08T06:33:45.665Z + 2025-08-08T11:49:06.499Z https://docs.axolotl.ai/docs/api/core.trainers.base.html - 2025-08-08T06:33:45.561Z + 2025-08-08T11:49:06.395Z https://docs.axolotl.ai/docs/api/cli.utils.args.html - 2025-08-08T06:33:45.523Z + 2025-08-08T11:49:06.357Z https://docs.axolotl.ai/docs/api/prompt_strategies.messages.chat.html - 2025-08-08T06:33:45.810Z + 2025-08-08T11:49:06.642Z https://docs.axolotl.ai/docs/api/monkeypatch.lora_kernels.html - 2025-08-08T06:33:46.040Z + 2025-08-08T11:49:06.871Z https://docs.axolotl.ai/docs/api/kernels.lora.html - 2025-08-08T06:33:45.967Z + 2025-08-08T11:49:06.798Z https://docs.axolotl.ai/docs/api/cli.vllm_serve.html - 2025-08-08T06:33:45.500Z + 2025-08-08T11:49:06.335Z https://docs.axolotl.ai/docs/api/utils.schemas.multimodal.html - 2025-08-08T06:33:46.301Z + 2025-08-08T11:49:07.128Z https://docs.axolotl.ai/docs/api/utils.schemas.utils.html - 2025-08-08T06:33:46.329Z + 2025-08-08T11:49:07.156Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_xformers.html - 2025-08-08T06:33:46.003Z + 2025-08-08T11:49:06.835Z https://docs.axolotl.ai/docs/api/integrations.lm_eval.args.html - 2025-08-08T06:33:46.486Z + 2025-08-08T11:49:07.312Z https://docs.axolotl.ai/docs/api/monkeypatch.mistral_attn_hijack_flash.html - 2025-08-08T06:33:46.005Z + 2025-08-08T11:49:06.836Z https://docs.axolotl.ai/docs/api/utils.collators.core.html - 2025-08-08T06:33:46.510Z + 2025-08-08T11:49:07.335Z https://docs.axolotl.ai/docs/api/core.chat.format.chatml.html - 2025-08-08T06:33:45.327Z + 2025-08-08T11:49:06.166Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.passthrough.html - 2025-08-08T06:33:45.836Z + 2025-08-08T11:49:06.669Z https://docs.axolotl.ai/docs/api/core.datasets.chat.html - 2025-08-08T06:33:45.335Z + 2025-08-08T11:49:06.174Z https://docs.axolotl.ai/docs/api/utils.bench.html - 2025-08-08T06:33:46.123Z + 2025-08-08T11:49:06.953Z https://docs.axolotl.ai/docs/api/utils.schemas.training.html - 2025-08-08T06:33:46.266Z + 2025-08-08T11:49:07.093Z https://docs.axolotl.ai/docs/api/utils.collators.batching.html - 2025-08-08T06:33:46.530Z + 2025-08-08T11:49:07.354Z https://docs.axolotl.ai/docs/api/prompt_strategies.input_output.html - 2025-08-08T06:33:45.784Z + 2025-08-08T11:49:06.617Z https://docs.axolotl.ai/docs/api/utils.lora.html - 2025-08-08T06:33:46.114Z + 2025-08-08T11:49:06.944Z https://docs.axolotl.ai/docs/api/prompt_strategies.base.html - 2025-08-08T06:33:45.691Z + 2025-08-08T11:49:06.524Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_w_system.html - 2025-08-08T06:33:45.752Z + 2025-08-08T11:49:06.584Z https://docs.axolotl.ai/docs/api/utils.schemas.datasets.html - 2025-08-08T06:33:46.284Z + 2025-08-08T11:49:07.111Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.user_defined.html - 2025-08-08T06:33:45.835Z + 2025-08-08T11:49:06.667Z https://docs.axolotl.ai/docs/api/utils.schemas.peft.html - 2025-08-08T06:33:46.292Z + 2025-08-08T11:49:07.119Z https://docs.axolotl.ai/docs/api/prompt_strategies.pygmalion.html - 2025-08-08T06:33:45.806Z + 2025-08-08T11:49:06.638Z https://docs.axolotl.ai/docs/api/common.architectures.html - 2025-08-08T06:33:46.491Z + 2025-08-08T11:49:07.317Z https://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_cpu.html - 2025-08-08T06:33:46.075Z + 2025-08-08T11:49:06.905Z https://docs.axolotl.ai/docs/api/utils.callbacks.comet_.html - 2025-08-08T06:33:46.598Z + 2025-08-08T11:49:07.420Z https://docs.axolotl.ai/docs/api/integrations.spectrum.args.html - 2025-08-08T06:33:46.490Z + 2025-08-08T11:49:07.315Z https://docs.axolotl.ai/docs/api/cli.quantize.html - 2025-08-08T06:33:45.493Z + 2025-08-08T11:49:06.328Z https://docs.axolotl.ai/docs/api/cli.checks.html - 2025-08-08T06:33:45.423Z + 2025-08-08T11:49:06.259Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.llama3.html - 2025-08-08T06:33:45.844Z + 2025-08-08T11:49:06.677Z https://docs.axolotl.ai/docs/api/utils.model_shard_quant.html - 2025-08-08T06:33:46.119Z + 2025-08-08T11:49:06.949Z https://docs.axolotl.ai/docs/api/utils.quantization.html - 2025-08-08T06:33:46.238Z + 2025-08-08T11:49:07.065Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.rng_state_loader.html - 2025-08-08T06:33:45.659Z + 2025-08-08T11:49:06.492Z https://docs.axolotl.ai/docs/api/kernels.geglu.html - 2025-08-08T06:33:45.977Z + 2025-08-08T11:49:06.809Z https://docs.axolotl.ai/docs/api/utils.data.pretraining.html - 2025-08-08T06:33:46.210Z + 2025-08-08T11:49:07.038Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.user_defined.html - 2025-08-08T06:33:45.854Z + 2025-08-08T11:49:06.686Z https://docs.axolotl.ai/docs/api/core.builders.base.html - 2025-08-08T06:33:45.281Z + 2025-08-08T11:49:06.120Z https://docs.axolotl.ai/docs/api/cli.merge_lora.html - 2025-08-08T06:33:45.468Z + 2025-08-08T11:49:06.304Z https://docs.axolotl.ai/docs/api/cli.utils.load.html - 2025-08-08T06:33:45.534Z + 2025-08-08T11:49:06.368Z https://docs.axolotl.ai/docs/api/utils.data.sft.html - 2025-08-08T06:33:46.217Z + 2025-08-08T11:49:07.045Z https://docs.axolotl.ai/docs/api/prompt_strategies.user_defined.html - 2025-08-08T06:33:45.760Z + 2025-08-08T11:49:06.592Z https://docs.axolotl.ai/docs/api/utils.tokenization.html - 2025-08-08T06:33:46.107Z + 2025-08-08T11:49:06.938Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chatml.html - 2025-08-08T06:33:45.832Z + 2025-08-08T11:49:06.664Z https://docs.axolotl.ai/docs/api/models.mamba.modeling_mamba.html - 2025-08-08T06:33:46.509Z + 2025-08-08T11:49:07.334Z https://docs.axolotl.ai/docs/api/cli.args.html - 2025-08-08T06:33:45.413Z + 2025-08-08T11:49:06.250Z https://docs.axolotl.ai/docs/api/evaluate.html - 2025-08-08T06:33:45.199Z + 2025-08-08T11:49:06.039Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_instruct.html - 2025-08-08T06:33:45.740Z + 2025-08-08T11:49:06.572Z https://docs.axolotl.ai/docs/api/utils.distributed.html - 2025-08-08T06:33:46.196Z + 2025-08-08T11:49:07.024Z https://docs.axolotl.ai/docs/multipack.html - 2025-08-08T06:30:24.581Z + 2025-08-08T11:45:45.426Z https://docs.axolotl.ai/examples/colab-notebooks/colab-axolotl-example.html - 2025-08-08T06:30:24.586Z + 2025-08-08T11:45:45.430Z https://docs.axolotl.ai/FAQS.html - 2025-08-08T06:30:24.576Z + 2025-08-08T11:45:45.421Z