Original implementation of QKV projection without optimizations.
patch_self_attn_lora
- monkeypatch.lora_kernels.patch_self_attn_lora(cfg)
+ monkeypatch.lora_kernels.patch_self_attn_lora(cfg)
Given an axolotl config, this method patches the inferred attention class forward
pass with optimized LoRA implementations.
It modifies the attention class to use optimized QKV and output projections. The
diff --git a/search.json b/search.json
index 4a2d91753..a149b0691 100644
--- a/search.json
+++ b/search.json
@@ -2807,7 +2807,14 @@
"href": "docs/api/monkeypatch.lora_kernels.html",
"title": "monkeypatch.lora_kernels",
"section": "",
- "text": "monkeypatch.lora_kernels\nModule for patching custom LoRA Triton kernels and torch.autograd functions.\n\n\n\n\n\nName\nDescription\n\n\n\n\napply_lora_kernel_patches\nApplies optimized Triton kernel patches to a PEFT model.\n\n\nget_attention_cls_from_config\nGet the appropriate attention class by inspecting the model config.\n\n\noriginal_apply_o\nOriginal implementation of output projection without optimizations.\n\n\noriginal_apply_qkv\nOriginal implementation of QKV projection without optimizations.\n\n\npatch_self_attn_lora\nGiven an axolotl config, this method patches the inferred attention class forward\n\n\n\n\n\nmonkeypatch.lora_kernels.apply_lora_kernel_patches(model, cfg)\nApplies optimized Triton kernel patches to a PEFT model.\nPatches a PEFT model with optimized implementations for MLP and attention\ncomputations. The optimizations include custom Triton kernels for activation\nfunctions and specialized autograd functions for LoRA computations.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\nmodel\nPeftModelForCausalLM\nA PEFT model to be patched with optimized kernels.\nrequired\n\n\ncfg\nDictDefault\nDictionary mapping axolotl config keys to values.\nrequired\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\nPeftModelForCausalLM\nPeftModelForCausalLM\nThe patched model with optimized kernels.\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\nTypeError\nIf the provided model is not a PeftModelForCausalLM.\n\n\n\nNotImplementedError\nIf the model type is not supported.\n\n\n\nAssertionError\nIf multiple adapters are active (currently unsupported).\n\n\n\n\n\n\nThe optimizations require LoRA adapters with no dropout and no bias terms. The\nfunction will skip patching if these conditions aren’t met.\n\n\n\n\nmonkeypatch.lora_kernels.get_attention_cls_from_config(cfg)\nGet the appropriate attention class by inspecting the model config.\nUses dynamic import to support any model architecture that follows\nthe standard transformers naming convention.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\ncfg\nDictDefault\nDictionary mapping axolotl config keys to values.\nrequired\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\nType[nn.Module]\nThe appropriate attention class for the model.\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\nValueError\nIf base_model not specified or attention class cannot be imported\n\n\n\nImportError\nIf the model module or attention class doesn’t exist\n\n\n\n\n\n\n\nmonkeypatch.lora_kernels.original_apply_o(self, hidden_states)\nOriginal implementation of output projection without optimizations.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\nself\nnn.Module\nThe attention module instance.\nrequired\n\n\nhidden_states\ntorch.Tensor\nInput tensor of shape [batch_size, seq_len, hidden_dim]`.\nrequired\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\ntorch.Tensor\nThe output projection result.\n\n\n\n\n\n\n\nmonkeypatch.lora_kernels.original_apply_qkv(self, hidden_states)\nOriginal implementation of QKV projection without optimizations.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\nself\nnn.Module\nThe attention module instance.\nrequired\n\n\nhidden_states\ntorch.Tensor\nInput tensor of shape [batch_size, seq_len, hidden_dim].\nrequired\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\ntuple[torch.Tensor, torch.Tensor, torch.Tensor]\nA tuple (query_states, key_states, value_states) containing the projected states for query, key, and value.\n\n\n\n\n\n\n\nmonkeypatch.lora_kernels.patch_self_attn_lora(cfg)\nGiven an axolotl config, this method patches the inferred attention class forward\npass with optimized LoRA implementations.\nIt modifies the attention class to use optimized QKV and output projections. The\noriginal implementation is preserved and can be restored if needed.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\ncfg\nDictDefault\nDictionary mapping axolotl config keys to values.\nrequired\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\nAssertionError\nIf the required code blocks are not found in the attention implementation."
+ "text": "monkeypatch.lora_kernels\nModule for patching custom LoRA Triton kernels and torch.autograd functions.\n\n\n\n\n\nName\nDescription\n\n\n\n\nFakeMLP\nplaceholder MLP for triton patching\n\n\n\n\n\nmonkeypatch.lora_kernels.FakeMLP(self, gate_proj, up_proj, down_proj)\nplaceholder MLP for triton patching\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\napply_lora_kernel_patches\nApplies optimized Triton kernel patches to a PEFT model.\n\n\nget_attention_cls_from_config\nGet the appropriate attention class by inspecting the model config.\n\n\noriginal_apply_o\nOriginal implementation of output projection without optimizations.\n\n\noriginal_apply_qkv\nOriginal implementation of QKV projection without optimizations.\n\n\npatch_self_attn_lora\nGiven an axolotl config, this method patches the inferred attention class forward\n\n\n\n\n\nmonkeypatch.lora_kernels.apply_lora_kernel_patches(model, cfg)\nApplies optimized Triton kernel patches to a PEFT model.\nPatches a PEFT model with optimized implementations for MLP and attention\ncomputations. The optimizations include custom Triton kernels for activation\nfunctions and specialized autograd functions for LoRA computations.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\nmodel\nPeftModelForCausalLM\nA PEFT model to be patched with optimized kernels.\nrequired\n\n\ncfg\nDictDefault\nDictionary mapping axolotl config keys to values.\nrequired\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\nPeftModelForCausalLM\nPeftModelForCausalLM\nThe patched model with optimized kernels.\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\nTypeError\nIf the provided model is not a PeftModelForCausalLM.\n\n\n\nNotImplementedError\nIf the model type is not supported.\n\n\n\nAssertionError\nIf multiple adapters are active (currently unsupported).\n\n\n\n\n\n\nThe optimizations require LoRA adapters with no dropout and no bias terms. The\nfunction will skip patching if these conditions aren’t met.\n\n\n\n\nmonkeypatch.lora_kernels.get_attention_cls_from_config(cfg)\nGet the appropriate attention class by inspecting the model config.\nUses dynamic import to support any model architecture that follows\nthe standard transformers naming convention.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\ncfg\nDictDefault\nDictionary mapping axolotl config keys to values.\nrequired\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\nType[nn.Module]\nThe appropriate attention class for the model.\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\nValueError\nIf base_model not specified or attention class cannot be imported\n\n\n\nImportError\nIf the model module or attention class doesn’t exist\n\n\n\n\n\n\n\nmonkeypatch.lora_kernels.original_apply_o(self, hidden_states)\nOriginal implementation of output projection without optimizations.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\nself\nnn.Module\nThe attention module instance.\nrequired\n\n\nhidden_states\ntorch.Tensor\nInput tensor of shape [batch_size, seq_len, hidden_dim]`.\nrequired\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\ntorch.Tensor\nThe output projection result.\n\n\n\n\n\n\n\nmonkeypatch.lora_kernels.original_apply_qkv(self, hidden_states)\nOriginal implementation of QKV projection without optimizations.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\nself\nnn.Module\nThe attention module instance.\nrequired\n\n\nhidden_states\ntorch.Tensor\nInput tensor of shape [batch_size, seq_len, hidden_dim].\nrequired\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\ntuple[torch.Tensor, torch.Tensor, torch.Tensor]\nA tuple (query_states, key_states, value_states) containing the projected states for query, key, and value.\n\n\n\n\n\n\n\nmonkeypatch.lora_kernels.patch_self_attn_lora(cfg)\nGiven an axolotl config, this method patches the inferred attention class forward\npass with optimized LoRA implementations.\nIt modifies the attention class to use optimized QKV and output projections. The\noriginal implementation is preserved and can be restored if needed.\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\nDefault\n\n\n\n\ncfg\nDictDefault\nDictionary mapping axolotl config keys to values.\nrequired\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\nAssertionError\nIf the required code blocks are not found in the attention implementation."
+ },
+ {
+ "objectID": "docs/api/monkeypatch.lora_kernels.html#classes",
+ "href": "docs/api/monkeypatch.lora_kernels.html#classes",
+ "title": "monkeypatch.lora_kernels",
+ "section": "",
+ "text": "Name\nDescription\n\n\n\n\nFakeMLP\nplaceholder MLP for triton patching\n\n\n\n\n\nmonkeypatch.lora_kernels.FakeMLP(self, gate_proj, up_proj, down_proj)\nplaceholder MLP for triton patching"
},
{
"objectID": "docs/api/monkeypatch.lora_kernels.html#functions",
diff --git a/sitemap.xml b/sitemap.xml
index 508f3d178..2b0fe63ee 100644
--- a/sitemap.xml
+++ b/sitemap.xml
@@ -2,682 +2,682 @@
https://axolotl-ai-cloud.github.io/axolotl/examples/colab-notebooks/colab-axolotl-example.html
- 2025-04-07T21:12:39.574Z
+ 2025-04-09T06:53:29.055Z
https://axolotl-ai-cloud.github.io/axolotl/docs/dataset-formats/stepwise_supervised.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/dataset-formats/template_free.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/dataset-formats/tokenized.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/nccl.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/amd_hpc.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/config.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/multi-gpu.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/installation.html
- 2025-04-07T21:12:39.572Z
+ 2025-04-09T06:53:29.053Z
https://axolotl-ai-cloud.github.io/axolotl/docs/torchao.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/reward_modelling.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/input_output.html
- 2025-04-07T21:12:39.572Z
+ 2025-04-09T06:53:29.053Z
https://axolotl-ai-cloud.github.io/axolotl/docs/multimodal.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.callbacks.mlflow_.html
- 2025-04-07T21:13:08.528Z
+ 2025-04-09T06:54:15.412Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.trainer_fsdp_optim.html
- 2025-04-07T21:13:08.128Z
+ 2025-04-09T06:54:15.007Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.data.batch_dataset_fetcher.html
- 2025-04-07T21:13:08.144Z
+ 2025-04-09T06:54:15.023Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.stepwise_supervised.html
- 2025-04-07T21:13:07.839Z
+ 2025-04-09T06:54:14.713Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.mistral_attn_hijack_flash.html
- 2025-04-07T21:13:08.077Z
+ 2025-04-09T06:54:14.953Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.dpo.user_defined.html
- 2025-04-07T21:13:07.885Z
+ 2025-04-09T06:54:14.759Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/integrations.liger.args.html
- 2025-04-07T21:13:08.446Z
+ 2025-04-09T06:54:15.329Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.training.html
- 2025-04-07T21:13:08.311Z
+ 2025-04-09T06:54:15.193Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/datasets.html
- 2025-04-07T21:13:07.344Z
+ 2025-04-09T06:54:14.214Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/kernels.geglu.html
- 2025-04-07T21:13:08.018Z
+ 2025-04-09T06:54:14.894Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.llama_attn_hijack_flash.html
- 2025-04-07T21:13:08.061Z
+ 2025-04-09T06:54:14.938Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.sweeps.html
- 2025-04-07T21:13:07.675Z
+ 2025-04-09T06:54:14.548Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.freeze.html
- 2025-04-07T21:13:08.214Z
+ 2025-04-09T06:54:15.095Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.multipack.html
- 2025-04-07T21:13:08.078Z
+ 2025-04-09T06:54:14.955Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.main.html
- 2025-04-07T21:13:07.572Z
+ 2025-04-09T06:54:14.444Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.trainers.trl.html
- 2025-04-07T21:13:07.750Z
+ 2025-04-09T06:54:14.623Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.dpo.passthrough.html
- 2025-04-07T21:13:07.886Z
+ 2025-04-09T06:54:14.761Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.chat.format.llama3x.html
- 2025-04-07T21:13:07.527Z
+ 2025-04-09T06:54:14.399Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.datasets.transforms.chat_builder.html
- 2025-04-07T21:13:07.541Z
+ 2025-04-09T06:54:14.414Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.kto.user_defined.html
- 2025-04-07T21:13:07.904Z
+ 2025-04-09T06:54:14.778Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.collators.mamba.html
- 2025-04-07T21:13:08.500Z
+ 2025-04-09T06:54:15.385Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/integrations.base.html
- 2025-04-07T21:13:08.431Z
+ 2025-04-09T06:54:15.314Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.bench.html
- 2025-04-07T21:13:08.207Z
+ 2025-04-09T06:54:15.087Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/kernels.swiglu.html
- 2025-04-07T21:13:08.027Z
+ 2025-04-09T06:54:14.904Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.chat.format.shared.html
- 2025-04-07T21:13:07.528Z
+ 2025-04-09T06:54:14.401Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/integrations.cut_cross_entropy.args.html
- 2025-04-07T21:13:08.434Z
+ 2025-04-09T06:54:15.318Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.datasets.chat.html
- 2025-04-07T21:13:07.534Z
+ 2025-04-09T06:54:14.406Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.callbacks.lisa.html
- 2025-04-07T21:13:08.524Z
+ 2025-04-09T06:54:15.409Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/integrations.grokfast.optimizer.html
- 2025-04-07T21:13:08.435Z
+ 2025-04-09T06:54:15.319Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.alpaca_chat.html
- 2025-04-07T21:13:07.789Z
+ 2025-04-09T06:54:14.663Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.alpaca_instruct.html
- 2025-04-07T21:13:07.791Z
+ 2025-04-09T06:54:14.664Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.kto.chatml.html
- 2025-04-07T21:13:07.902Z
+ 2025-04-09T06:54:14.777Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.integrations.html
- 2025-04-07T21:13:08.357Z
+ 2025-04-09T06:54:15.239Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.trl.html
- 2025-04-07T21:13:08.339Z
+ 2025-04-09T06:54:15.222Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_tokenizers.html
- 2025-04-07T21:13:07.398Z
+ 2025-04-09T06:54:14.269Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.data.sft.html
- 2025-04-07T21:13:08.288Z
+ 2025-04-09T06:54:15.170Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schedulers.html
- 2025-04-07T21:13:08.255Z
+ 2025-04-09T06:54:15.136Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.chat_templates.html
- 2025-04-07T21:13:08.190Z
+ 2025-04-09T06:54:15.070Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.models.html
- 2025-04-07T21:13:08.174Z
+ 2025-04-09T06:54:15.053Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.dpo.chatml.html
- 2025-04-07T21:13:07.882Z
+ 2025-04-09T06:54:14.757Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.distributed.html
- 2025-04-07T21:13:08.275Z
+ 2025-04-09T06:54:15.157Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.utils.html
- 2025-04-07T21:13:08.116Z
+ 2025-04-09T06:54:14.996Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.utils.html
- 2025-04-07T21:13:08.369Z
+ 2025-04-09T06:54:15.252Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.llama_expand_mask.html
- 2025-04-07T21:13:08.086Z
+ 2025-04-09T06:54:14.963Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/common.datasets.html
- 2025-04-07T21:13:08.471Z
+ 2025-04-09T06:54:15.355Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/logging_config.html
- 2025-04-07T21:13:07.403Z
+ 2025-04-09T06:54:14.274Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/kernels.quantize.html
- 2025-04-07T21:13:08.035Z
+ 2025-04-09T06:54:14.911Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.llama_patch_multipack.html
- 2025-04-07T21:13:08.119Z
+ 2025-04-09T06:54:14.999Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.model.html
- 2025-04-07T21:13:08.306Z
+ 2025-04-09T06:54:15.188Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.stablelm_attn_hijack_flash.html
- 2025-04-07T21:13:08.125Z
+ 2025-04-09T06:54:15.004Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.mixtral.html
- 2025-04-07T21:13:08.145Z
+ 2025-04-09T06:54:15.025Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.tokenization.html
- 2025-04-07T21:13:08.180Z
+ 2025-04-09T06:54:15.060Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/integrations.kd.trainer.html
- 2025-04-07T21:13:08.442Z
+ 2025-04-09T06:54:15.326Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.datasets.html
- 2025-04-07T21:13:08.328Z
+ 2025-04-09T06:54:15.211Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.collators.core.html
- 2025-04-07T21:13:08.473Z
+ 2025-04-09T06:54:15.358Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.btlm_attn_hijack_flash.html
- 2025-04-07T21:13:08.118Z
+ 2025-04-09T06:54:14.997Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.optimizers.adopt.html
- 2025-04-07T21:13:08.285Z
+ 2025-04-09T06:54:15.167Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.input_output.html
- 2025-04-07T21:13:07.835Z
+ 2025-04-09T06:54:14.709Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/index.html
- 2025-04-07T21:13:07.267Z
+ 2025-04-09T06:54:14.136Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.cloud.modal_.html
- 2025-04-07T21:13:07.720Z
+ 2025-04-09T06:54:14.592Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.dpo.llama3.html
- 2025-04-07T21:13:07.872Z
+ 2025-04-09T06:54:14.747Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.train.html
- 2025-04-07T21:13:07.580Z
+ 2025-04-09T06:54:14.452Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.trainer_builder.html
- 2025-04-07T21:13:07.418Z
+ 2025-04-09T06:54:14.289Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.callbacks.perplexity.html
- 2025-04-07T21:13:08.519Z
+ 2025-04-09T06:54:15.404Z
https://axolotl-ai-cloud.github.io/axolotl/docs/getting-started.html
- 2025-04-07T21:12:39.570Z
+ 2025-04-09T06:53:29.051Z
https://axolotl-ai-cloud.github.io/axolotl/docs/dataset_loading.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/batch_vs_grad.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/faq.html
- 2025-04-07T21:12:39.570Z
+ 2025-04-09T06:53:29.051Z
https://axolotl-ai-cloud.github.io/axolotl/docs/debugging.html
- 2025-04-07T21:12:39.570Z
+ 2025-04-09T06:53:29.051Z
https://axolotl-ai-cloud.github.io/axolotl/docs/lr_groups.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.053Z
https://axolotl-ai-cloud.github.io/axolotl/TODO.html
- 2025-04-07T21:12:39.568Z
+ 2025-04-09T06:53:29.049Z
https://axolotl-ai-cloud.github.io/axolotl/src/axolotl/integrations/LICENSE.html
- 2025-04-07T21:12:39.588Z
+ 2025-04-09T06:53:29.069Z
https://axolotl-ai-cloud.github.io/axolotl/index.html
- 2025-04-07T21:12:39.585Z
+ 2025-04-09T06:53:29.066Z
https://axolotl-ai-cloud.github.io/axolotl/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html
- 2025-04-07T21:12:39.589Z
+ 2025-04-09T06:53:29.070Z
https://axolotl-ai-cloud.github.io/axolotl/FAQS.html
- 2025-04-07T21:12:39.568Z
+ 2025-04-09T06:53:29.048Z
https://axolotl-ai-cloud.github.io/axolotl/docs/multi-node.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/sequence_parallelism.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/multipack.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/inference.html
- 2025-04-07T21:12:39.572Z
+ 2025-04-09T06:53:29.053Z
https://axolotl-ai-cloud.github.io/axolotl/docs/lora_optims.html
- 2025-04-07T21:12:39.572Z
+ 2025-04-09T06:53:29.053Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.lora_embeddings.html
- 2025-04-07T21:13:08.198Z
+ 2025-04-09T06:54:15.078Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/kernels.utils.html
- 2025-04-07T21:13:08.036Z
+ 2025-04-09T06:54:14.913Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.chat_template.html
- 2025-04-07T21:13:07.776Z
+ 2025-04-09T06:54:14.649Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/convert.html
- 2025-04-07T21:13:07.358Z
+ 2025-04-09T06:54:14.228Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/common.const.html
- 2025-04-07T21:13:08.455Z
+ 2025-04-09T06:54:15.339Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.cloud.base.html
- 2025-04-07T21:13:07.714Z
+ 2025-04-09T06:54:14.586Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.relora.html
- 2025-04-07T21:13:08.085Z
+ 2025-04-09T06:54:14.962Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.lora.html
- 2025-04-07T21:13:08.194Z
+ 2025-04-09T06:54:15.075Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.merge_lora.html
- 2025-04-07T21:13:07.650Z
+ 2025-04-09T06:54:14.522Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.bradley_terry.llama3.html
- 2025-04-07T21:13:07.927Z
+ 2025-04-09T06:54:14.802Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.merge_sharded_fsdp_weights.html
- 2025-04-07T21:13:07.661Z
+ 2025-04-09T06:54:14.534Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/integrations.spectrum.args.html
- 2025-04-07T21:13:08.452Z
+ 2025-04-09T06:54:15.336Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/models.mamba.modeling_mamba.html
- 2025-04-07T21:13:08.472Z
+ 2025-04-09T06:54:15.356Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/common.architectures.html
- 2025-04-07T21:13:08.453Z
+ 2025-04-09T06:54:15.337Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.trainer.html
- 2025-04-07T21:13:08.231Z
+ 2025-04-09T06:54:15.111Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.callbacks.comet_.html
- 2025-04-07T21:13:08.531Z
+ 2025-04-09T06:54:15.416Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.vllm_serve.html
- 2025-04-07T21:13:07.710Z
+ 2025-04-09T06:54:14.583Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.multimodal.html
- 2025-04-07T21:13:08.344Z
+ 2025-04-09T06:54:15.228Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.gradient_checkpointing.unsloth.html
- 2025-04-07T21:13:08.292Z
+ 2025-04-09T06:54:15.174Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.trainers.base.html
- 2025-04-07T21:13:07.733Z
+ 2025-04-09T06:54:14.606Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.unsloth_.html
- 2025-04-07T21:13:08.135Z
+ 2025-04-09T06:54:15.015Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.samplers.multipack.html
- 2025-04-07T21:13:08.513Z
+ 2025-04-09T06:54:15.397Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.callbacks.profiler.html
- 2025-04-07T21:13:08.523Z
+ 2025-04-09T06:54:15.407Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/integrations.lm_eval.args.html
- 2025-04-07T21:13:08.449Z
+ 2025-04-09T06:54:15.333Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.data.pretraining.html
- 2025-04-07T21:13:08.287Z
+ 2025-04-09T06:54:15.169Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/evaluate.html
- 2025-04-07T21:13:07.337Z
+ 2025-04-09T06:54:14.207Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.dict.html
- 2025-04-07T21:13:08.278Z
+ 2025-04-09T06:54:15.160Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.utils.html
- 2025-04-07T21:13:07.706Z
+ 2025-04-09T06:54:14.578Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.pygmalion.html
- 2025-04-07T21:13:07.856Z
+ 2025-04-09T06:54:14.731Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.training_args.html
- 2025-04-07T21:13:07.502Z
+ 2025-04-09T06:54:14.374Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.inference.html
- 2025-04-07T21:13:07.641Z
+ 2025-04-09T06:54:14.514Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/kernels.lora.html
- 2025-04-07T21:13:08.007Z
+ 2025-04-09T06:54:14.883Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.evaluate.html
- 2025-04-07T21:13:07.588Z
+ 2025-04-09T06:54:14.460Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.collators.batching.html
- 2025-04-07T21:13:08.497Z
+ 2025-04-09T06:54:15.381Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.completion.html
- 2025-04-07T21:13:07.829Z
+ 2025-04-09T06:54:14.703Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.dpo.zephyr.html
- 2025-04-07T21:13:07.883Z
+ 2025-04-09T06:54:14.758Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.metharme.html
- 2025-04-07T21:13:07.846Z
+ 2025-04-09T06:54:14.720Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.orpo.chat_template.html
- 2025-04-07T21:13:07.923Z
+ 2025-04-09T06:54:14.799Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.alpaca_w_system.html
- 2025-04-07T21:13:07.803Z
+ 2025-04-09T06:54:14.676Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.model_shard_quant.html
- 2025-04-07T21:13:08.203Z
+ 2025-04-09T06:54:15.083Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.config.html
- 2025-04-07T21:13:07.628Z
+ 2025-04-09T06:54:14.500Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.enums.html
- 2025-04-07T21:13:08.363Z
+ 2025-04-09T06:54:15.246Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.preprocess.html
- 2025-04-07T21:13:07.669Z
+ 2025-04-09T06:54:14.542Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.chat.messages.html
- 2025-04-07T21:13:07.524Z
+ 2025-04-09T06:54:14.396Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.dpo.chat_template.html
- 2025-04-07T21:13:07.862Z
+ 2025-04-09T06:54:14.736Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.peft.html
- 2025-04-07T21:13:08.336Z
+ 2025-04-09T06:54:15.219Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/train.html
- 2025-04-07T21:13:07.327Z
+ 2025-04-09T06:54:14.197Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.messages.chat.html
- 2025-04-07T21:13:07.860Z
+ 2025-04-09T06:54:14.735Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.orcamini.html
- 2025-04-07T21:13:07.850Z
+ 2025-04-09T06:54:14.724Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.collators.mm_chat.html
- 2025-04-07T21:13:08.505Z
+ 2025-04-09T06:54:15.389Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.kto.llama3.html
- 2025-04-07T21:13:07.894Z
+ 2025-04-09T06:54:14.769Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.attention.mllama.html
- 2025-04-07T21:13:08.142Z
+ 2025-04-09T06:54:15.022Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.checks.html
- 2025-04-07T21:13:07.611Z
+ 2025-04-09T06:54:14.483Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.transformers_fa_utils.html
- 2025-04-07T21:13:08.134Z
+ 2025-04-09T06:54:15.014Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.llama_attn_hijack_xformers.html
- 2025-04-07T21:13:08.062Z
+ 2025-04-09T06:54:14.939Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.trainers.dpo.trainer.html
- 2025-04-07T21:13:07.757Z
+ 2025-04-09T06:54:14.630Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.user_defined.html
- 2025-04-07T21:13:07.811Z
+ 2025-04-09T06:54:14.684Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/cli.args.html
- 2025-04-07T21:13:07.604Z
+ 2025-04-09T06:54:14.477Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.llama2_chat.html
- 2025-04-07T21:13:07.823Z
+ 2025-04-09T06:54:14.697Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/utils.schemas.config.html
- 2025-04-07T21:13:08.299Z
+ 2025-04-09T06:54:15.182Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.trainers.grpo.trainer.html
- 2025-04-07T21:13:07.760Z
+ 2025-04-09T06:54:14.633Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/core.chat.format.chatml.html
- 2025-04-07T21:13:07.526Z
+ 2025-04-09T06:54:14.398Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/monkeypatch.lora_kernels.html
- 2025-04-07T21:13:08.108Z
+ 2025-04-09T06:54:14.988Z
https://axolotl-ai-cloud.github.io/axolotl/docs/api/prompt_strategies.base.html
- 2025-04-07T21:13:07.761Z
+ 2025-04-09T06:54:14.634Z
https://axolotl-ai-cloud.github.io/axolotl/docs/rlhf.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/cli.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/unsloth.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/fsdp_qlora.html
- 2025-04-07T21:12:39.570Z
+ 2025-04-09T06:53:29.051Z
https://axolotl-ai-cloud.github.io/axolotl/docs/dataset_preprocessing.html
- 2025-04-07T21:12:39.570Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/custom_integrations.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/mac.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/docker.html
- 2025-04-07T21:12:39.570Z
+ 2025-04-09T06:53:29.051Z
https://axolotl-ai-cloud.github.io/axolotl/docs/ray-integration.html
- 2025-04-07T21:12:39.573Z
+ 2025-04-09T06:53:29.054Z
https://axolotl-ai-cloud.github.io/axolotl/docs/dataset-formats/index.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/dataset-formats/conversation.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/dataset-formats/pretraining.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z
https://axolotl-ai-cloud.github.io/axolotl/docs/dataset-formats/inst_tune.html
- 2025-04-07T21:12:39.569Z
+ 2025-04-09T06:53:29.050Z