diff --git a/.nojekyll b/.nojekyll index 2ae66d1f5..7bb6da628 100644 --- a/.nojekyll +++ b/.nojekyll @@ -1 +1 @@ -488ee49b \ No newline at end of file +fd501483 \ No newline at end of file diff --git a/docs/api/loaders.model.html b/docs/api/loaders.model.html index c8a872fe9..58a3f26a4 100644 --- a/docs/api/loaders.model.html +++ b/docs/api/loaders.model.html @@ -605,9 +605,9 @@ adapter integration, and various optimizations.

Returns
---+++ @@ -619,7 +619,7 @@ adapter integration, and various optimizations.

- + diff --git a/search.json b/search.json index b3371e9dd..92fbe8ac9 100644 --- a/search.json +++ b/search.json @@ -2626,14 +2626,14 @@ "href": "docs/api/loaders.model.html", "title": "loaders.model", "section": "", - "text": "loaders.model\nModel loader class implementation for loading, configuring, and patching various\nmodels.\n\n\n\n\n\nName\nDescription\n\n\n\n\nModelLoader\nManages model configuration, initialization and application of patches during\n\n\n\n\n\nloaders.model.ModelLoader(\n self,\n cfg,\n tokenizer,\n *,\n inference=False,\n reference_model=False,\n **kwargs,\n)\nManages model configuration, initialization and application of patches during\nmodel loading.\nThis class orchestrates the entire process of loading a model from configuration to\nfinal preparation. It handles device mapping, quantization, attention mechanisms,\nadapter integration, and various optimizations.\n\n\n\nLoading and validating model configuration\nApplying monkey patches for optimizations / fixes\nSetting up device mapping (including multi-GPU configurations)\nConfiguring quantization\nSetting attention mechanisms (Flash Attention, SDPA, etc.)\nLoading and initializing the model\nApplying adapters (LoRA, QLoRA, etc.)\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\nmodel\nPreTrainedModel | PeftModel | PeftMixedModel\nThe loaded model instance (available after load() is called).\n\n\nmodel_kwargs\ndict[str, Any]\nDictionary of keyword arguments passed to model initialization.\n\n\nbase_model\n\nName or path of the base model to load.\n\n\nmodel_type\n\nType of model to load (e.g., AutoModelForCausalLM).\n\n\nmodel_config\n\nConfiguration object for the model.\n\n\nauto_model_loader\n\nclass used for loading the model (default: AutoModelForCausalLM).\n\n\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nload\nLoad and prepare the model with all configurations and patches.\n\n\n\n\n\nloaders.model.ModelLoader.load()\nLoad and prepare the model with all configurations and patches.\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\ntuple[PreTrainedModel, PeftConfig | None]\nA tuple with the loaded model and its LoRA configuration (if applicable)." + "text": "loaders.model\nModel loader class implementation for loading, configuring, and patching various\nmodels.\n\n\n\n\n\nName\nDescription\n\n\n\n\nModelLoader\nManages model configuration, initialization and application of patches during\n\n\n\n\n\nloaders.model.ModelLoader(\n self,\n cfg,\n tokenizer,\n *,\n inference=False,\n reference_model=False,\n **kwargs,\n)\nManages model configuration, initialization and application of patches during\nmodel loading.\nThis class orchestrates the entire process of loading a model from configuration to\nfinal preparation. It handles device mapping, quantization, attention mechanisms,\nadapter integration, and various optimizations.\n\n\n\nLoading and validating model configuration\nApplying monkey patches for optimizations / fixes\nSetting up device mapping (including multi-GPU configurations)\nConfiguring quantization\nSetting attention mechanisms (Flash Attention, SDPA, etc.)\nLoading and initializing the model\nApplying adapters (LoRA, QLoRA, etc.)\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\nmodel\nPreTrainedModel | PeftModel | PeftMixedModel\nThe loaded model instance (available after load() is called).\n\n\nmodel_kwargs\ndict[str, Any]\nDictionary of keyword arguments passed to model initialization.\n\n\nbase_model\n\nName or path of the base model to load.\n\n\nmodel_type\n\nType of model to load (e.g., AutoModelForCausalLM).\n\n\nmodel_config\n\nConfiguration object for the model.\n\n\nauto_model_loader\n\nclass used for loading the model (default: AutoModelForCausalLM).\n\n\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nload\nLoad and prepare the model with all configurations and patches.\n\n\n\n\n\nloaders.model.ModelLoader.load()\nLoad and prepare the model with all configurations and patches.\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\ntuple[PreTrainedModel | PeftModelForCausalLM, PeftConfig | None]\nA tuple with the loaded model and its LoRA configuration (if applicable)." }, { "objectID": "docs/api/loaders.model.html#classes", "href": "docs/api/loaders.model.html#classes", "title": "loaders.model", "section": "", - "text": "Name\nDescription\n\n\n\n\nModelLoader\nManages model configuration, initialization and application of patches during\n\n\n\n\n\nloaders.model.ModelLoader(\n self,\n cfg,\n tokenizer,\n *,\n inference=False,\n reference_model=False,\n **kwargs,\n)\nManages model configuration, initialization and application of patches during\nmodel loading.\nThis class orchestrates the entire process of loading a model from configuration to\nfinal preparation. It handles device mapping, quantization, attention mechanisms,\nadapter integration, and various optimizations.\n\n\n\nLoading and validating model configuration\nApplying monkey patches for optimizations / fixes\nSetting up device mapping (including multi-GPU configurations)\nConfiguring quantization\nSetting attention mechanisms (Flash Attention, SDPA, etc.)\nLoading and initializing the model\nApplying adapters (LoRA, QLoRA, etc.)\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\nmodel\nPreTrainedModel | PeftModel | PeftMixedModel\nThe loaded model instance (available after load() is called).\n\n\nmodel_kwargs\ndict[str, Any]\nDictionary of keyword arguments passed to model initialization.\n\n\nbase_model\n\nName or path of the base model to load.\n\n\nmodel_type\n\nType of model to load (e.g., AutoModelForCausalLM).\n\n\nmodel_config\n\nConfiguration object for the model.\n\n\nauto_model_loader\n\nclass used for loading the model (default: AutoModelForCausalLM).\n\n\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nload\nLoad and prepare the model with all configurations and patches.\n\n\n\n\n\nloaders.model.ModelLoader.load()\nLoad and prepare the model with all configurations and patches.\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\ntuple[PreTrainedModel, PeftConfig | None]\nA tuple with the loaded model and its LoRA configuration (if applicable)." + "text": "Name\nDescription\n\n\n\n\nModelLoader\nManages model configuration, initialization and application of patches during\n\n\n\n\n\nloaders.model.ModelLoader(\n self,\n cfg,\n tokenizer,\n *,\n inference=False,\n reference_model=False,\n **kwargs,\n)\nManages model configuration, initialization and application of patches during\nmodel loading.\nThis class orchestrates the entire process of loading a model from configuration to\nfinal preparation. It handles device mapping, quantization, attention mechanisms,\nadapter integration, and various optimizations.\n\n\n\nLoading and validating model configuration\nApplying monkey patches for optimizations / fixes\nSetting up device mapping (including multi-GPU configurations)\nConfiguring quantization\nSetting attention mechanisms (Flash Attention, SDPA, etc.)\nLoading and initializing the model\nApplying adapters (LoRA, QLoRA, etc.)\n\n\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\nmodel\nPreTrainedModel | PeftModel | PeftMixedModel\nThe loaded model instance (available after load() is called).\n\n\nmodel_kwargs\ndict[str, Any]\nDictionary of keyword arguments passed to model initialization.\n\n\nbase_model\n\nName or path of the base model to load.\n\n\nmodel_type\n\nType of model to load (e.g., AutoModelForCausalLM).\n\n\nmodel_config\n\nConfiguration object for the model.\n\n\nauto_model_loader\n\nclass used for loading the model (default: AutoModelForCausalLM).\n\n\n\n\n\n\n\n\n\nName\nDescription\n\n\n\n\nload\nLoad and prepare the model with all configurations and patches.\n\n\n\n\n\nloaders.model.ModelLoader.load()\nLoad and prepare the model with all configurations and patches.\n\n\n\n\n\n\n\n\n\n\nName\nType\nDescription\n\n\n\n\n\ntuple[PreTrainedModel | PeftModelForCausalLM, PeftConfig | None]\nA tuple with the loaded model and its LoRA configuration (if applicable)." }, { "objectID": "docs/api/utils.bench.html", diff --git a/sitemap.xml b/sitemap.xml index 399670ef3..1c043ba6b 100644 --- a/sitemap.xml +++ b/sitemap.xml @@ -2,754 +2,754 @@ https://docs.axolotl.ai/TODO.html - 2025-05-28T13:57:40.151Z + 2025-05-28T14:03:55.090Z https://docs.axolotl.ai/docs/debugging.html - 2025-05-28T13:57:40.154Z + 2025-05-28T14:03:55.092Z https://docs.axolotl.ai/docs/rlhf.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/input_output.html - 2025-05-28T13:57:40.156Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/lr_groups.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/dataset-formats/template_free.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.092Z https://docs.axolotl.ai/docs/dataset-formats/pretraining.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/dataset-formats/conversation.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/dataset-formats/inst_tune.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/fsdp_qlora.html - 2025-05-28T13:57:40.154Z + 2025-05-28T14:03:55.092Z https://docs.axolotl.ai/docs/torchao.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/dataset_preprocessing.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.092Z https://docs.axolotl.ai/docs/batch_vs_grad.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/nccl.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/lora_optims.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/multipack.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/faq.html - 2025-05-28T13:57:40.154Z + 2025-05-28T14:03:55.092Z https://docs.axolotl.ai/docs/quantize.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/sequence_parallelism.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/multi-gpu.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/api/utils.collators.core.html - 2025-05-28T13:58:10.754Z + 2025-05-28T14:04:28.116Z https://docs.axolotl.ai/docs/api/utils.collators.mm_chat.html - 2025-05-28T13:58:10.782Z + 2025-05-28T14:04:28.144Z https://docs.axolotl.ai/docs/api/prompt_strategies.pygmalion.html - 2025-05-28T13:58:10.042Z + 2025-05-28T14:04:27.427Z https://docs.axolotl.ai/docs/api/cli.cloud.base.html - 2025-05-28T13:58:09.763Z + 2025-05-28T14:04:27.177Z https://docs.axolotl.ai/docs/api/common.architectures.html - 2025-05-28T13:58:10.733Z + 2025-05-28T14:04:28.095Z https://docs.axolotl.ai/docs/api/utils.optimizers.adopt.html - 2025-05-28T13:58:10.483Z + 2025-05-28T14:04:27.851Z https://docs.axolotl.ai/docs/api/utils.callbacks.lisa.html - 2025-05-28T13:58:10.834Z + 2025-05-28T14:04:28.195Z https://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_cpu.html - 2025-05-28T13:58:10.346Z + 2025-05-28T14:04:27.719Z https://docs.axolotl.ai/docs/api/integrations.lm_eval.args.html - 2025-05-28T13:58:10.728Z + 2025-05-28T14:04:28.090Z https://docs.axolotl.ai/docs/api/utils.model_shard_quant.html - 2025-05-28T13:58:10.399Z + 2025-05-28T14:04:27.771Z https://docs.axolotl.ai/docs/api/utils.schemas.utils.html - 2025-05-28T13:58:10.590Z + 2025-05-28T14:04:27.955Z https://docs.axolotl.ai/docs/api/core.trainers.trl.html - 2025-05-28T13:58:09.806Z + 2025-05-28T14:04:27.219Z https://docs.axolotl.ai/docs/api/utils.collators.batching.html - 2025-05-28T13:58:10.773Z + 2025-05-28T14:04:28.135Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_w_system.html - 2025-05-28T13:58:09.986Z + 2025-05-28T14:04:27.373Z https://docs.axolotl.ai/docs/api/integrations.base.html - 2025-05-28T13:58:10.709Z + 2025-05-28T14:04:28.072Z https://docs.axolotl.ai/docs/api/core.chat.messages.html - 2025-05-28T13:58:09.565Z + 2025-05-28T14:04:26.982Z https://docs.axolotl.ai/docs/api/kernels.lora.html - 2025-05-28T13:58:10.198Z + 2025-05-28T14:04:27.576Z https://docs.axolotl.ai/docs/api/utils.callbacks.perplexity.html - 2025-05-28T13:58:10.829Z + 2025-05-28T14:04:28.190Z https://docs.axolotl.ai/docs/api/utils.schemas.training.html - 2025-05-28T13:58:10.527Z + 2025-05-28T14:04:27.894Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.user_defined.html - 2025-05-28T13:58:10.071Z + 2025-05-28T14:04:27.456Z https://docs.axolotl.ai/docs/api/utils.samplers.multipack.html - 2025-05-28T13:58:10.822Z + 2025-05-28T14:04:28.183Z https://docs.axolotl.ai/docs/api/utils.collators.mamba.html - 2025-05-28T13:58:10.777Z + 2025-05-28T14:04:28.139Z https://docs.axolotl.ai/docs/api/cli.merge_sharded_fsdp_weights.html - 2025-05-28T13:58:09.710Z + 2025-05-28T14:04:27.124Z https://docs.axolotl.ai/docs/api/utils.callbacks.profiler.html - 2025-05-28T13:58:10.832Z + 2025-05-28T14:04:28.193Z https://docs.axolotl.ai/docs/api/loaders.tokenizer.html - 2025-05-28T13:58:09.874Z + 2025-05-28T14:04:27.275Z https://docs.axolotl.ai/docs/api/core.datasets.transforms.chat_builder.html - 2025-05-28T13:58:09.582Z + 2025-05-28T14:04:26.999Z https://docs.axolotl.ai/docs/api/cli.vllm_serve.html - 2025-05-28T13:58:09.760Z + 2025-05-28T14:04:27.174Z https://docs.axolotl.ai/docs/api/cli.config.html - 2025-05-28T13:58:09.675Z + 2025-05-28T14:04:27.090Z https://docs.axolotl.ai/docs/api/utils.schemas.trl.html - 2025-05-28T13:58:10.557Z + 2025-05-28T14:04:27.923Z https://docs.axolotl.ai/docs/api/core.trainers.mamba.html - 2025-05-28T13:58:09.811Z + 2025-05-28T14:04:27.224Z https://docs.axolotl.ai/docs/api/integrations.grokfast.optimizer.html - 2025-05-28T13:58:10.713Z + 2025-05-28T14:04:28.077Z https://docs.axolotl.ai/docs/api/prompt_strategies.chat_template.html - 2025-05-28T13:58:09.959Z + 2025-05-28T14:04:27.347Z https://docs.axolotl.ai/docs/api/utils.schemas.multimodal.html - 2025-05-28T13:58:10.562Z + 2025-05-28T14:04:27.928Z https://docs.axolotl.ai/docs/api/monkeypatch.utils.html - 2025-05-28T13:58:10.313Z + 2025-05-28T14:04:27.687Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.user_defined.html - 2025-05-28T13:58:10.091Z + 2025-05-28T14:04:27.474Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.rng_state_loader.html - 2025-05-28T13:58:09.910Z + 2025-05-28T14:04:27.298Z https://docs.axolotl.ai/docs/api/loaders.adapter.html - 2025-05-28T13:58:09.886Z + 2025-05-28T14:04:27.282Z https://docs.axolotl.ai/docs/api/integrations.liger.args.html - 2025-05-28T13:58:10.724Z + 2025-05-28T14:04:28.087Z https://docs.axolotl.ai/docs/api/utils.data.sft.html - 2025-05-28T13:58:10.486Z + 2025-05-28T14:04:27.854Z https://docs.axolotl.ai/docs/api/utils.freeze.html - 2025-05-28T13:58:10.410Z + 2025-05-28T14:04:27.782Z https://docs.axolotl.ai/docs/api/cli.preprocess.html - 2025-05-28T13:58:09.718Z + 2025-05-28T14:04:27.132Z https://docs.axolotl.ai/docs/api/index.html - 2025-05-28T13:58:09.298Z + 2025-05-28T14:04:26.722Z https://docs.axolotl.ai/docs/api/cli.main.html - 2025-05-28T13:58:09.614Z + 2025-05-28T14:04:27.030Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chatml.html - 2025-05-28T13:58:10.068Z + 2025-05-28T14:04:27.453Z https://docs.axolotl.ai/docs/api/core.trainer_builder.html - 2025-05-28T13:58:09.454Z + 2025-05-28T14:04:26.874Z https://docs.axolotl.ai/docs/api/train.html - 2025-05-28T13:58:09.359Z + 2025-05-28T14:04:26.782Z https://docs.axolotl.ai/docs/api/utils.callbacks.qat.html - 2025-05-28T13:58:10.848Z + 2025-05-28T14:04:28.209Z https://docs.axolotl.ai/docs/api/monkeypatch.stablelm_attn_hijack_flash.html - 2025-05-28T13:58:10.322Z + 2025-05-28T14:04:27.695Z https://docs.axolotl.ai/docs/api/utils.schemas.enums.html - 2025-05-28T13:58:10.584Z + 2025-05-28T14:04:27.950Z https://docs.axolotl.ai/docs/api/loaders.constants.html - 2025-05-28T13:58:09.901Z + 2025-05-28T14:04:27.292Z https://docs.axolotl.ai/docs/api/prompt_strategies.metharme.html - 2025-05-28T13:58:10.031Z + 2025-05-28T14:04:27.417Z https://docs.axolotl.ai/docs/api/kernels.swiglu.html - 2025-05-28T13:58:10.219Z + 2025-05-28T14:04:27.596Z https://docs.axolotl.ai/docs/api/core.trainers.base.html - 2025-05-28T13:58:09.789Z + 2025-05-28T14:04:27.202Z https://docs.axolotl.ai/docs/api/monkeypatch.relora.html - 2025-05-28T13:58:10.278Z + 2025-05-28T14:04:27.653Z https://docs.axolotl.ai/docs/api/cli.cloud.modal_.html - 2025-05-28T13:58:09.770Z + 2025-05-28T14:04:27.184Z https://docs.axolotl.ai/docs/api/core.training_args.html - 2025-05-28T13:58:09.542Z + 2025-05-28T14:04:26.959Z https://docs.axolotl.ai/docs/api/utils.callbacks.comet_.html - 2025-05-28T13:58:10.841Z + 2025-05-28T14:04:28.202Z https://docs.axolotl.ai/docs/api/core.chat.format.shared.html - 2025-05-28T13:58:09.569Z + 2025-05-28T14:04:26.986Z https://docs.axolotl.ai/docs/api/monkeypatch.lora_kernels.html - 2025-05-28T13:58:10.305Z + 2025-05-28T14:04:27.679Z https://docs.axolotl.ai/docs/api/cli.merge_lora.html - 2025-05-28T13:58:09.698Z + 2025-05-28T14:04:27.112Z https://docs.axolotl.ai/docs/api/utils.trainer.html - 2025-05-28T13:58:10.427Z + 2025-05-28T14:04:27.799Z https://docs.axolotl.ai/docs/api/utils.dict.html - 2025-05-28T13:58:10.475Z + 2025-05-28T14:04:27.844Z https://docs.axolotl.ai/docs/api/kernels.quantize.html - 2025-05-28T13:58:10.226Z + 2025-05-28T14:04:27.604Z https://docs.axolotl.ai/docs/api/core.trainers.utils.html - 2025-05-28T13:58:09.845Z + 2025-05-28T14:04:27.257Z https://docs.axolotl.ai/docs/api/monkeypatch.data.batch_dataset_fetcher.html - 2025-05-28T13:58:10.341Z + 2025-05-28T14:04:27.714Z https://docs.axolotl.ai/docs/api/utils.ctx_managers.sequence_parallel.html - 2025-05-28T13:58:09.939Z + 2025-05-28T14:04:27.327Z https://docs.axolotl.ai/docs/api/core.trainers.grpo.sampler.html - 2025-05-28T13:58:09.843Z + 2025-05-28T14:04:27.256Z https://docs.axolotl.ai/docs/api/prompt_strategies.user_defined.html - 2025-05-28T13:58:09.994Z + 2025-05-28T14:04:27.381Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_flash.html - 2025-05-28T13:58:10.253Z + 2025-05-28T14:04:27.629Z https://docs.axolotl.ai/docs/api/cli.train.html - 2025-05-28T13:58:09.622Z + 2025-05-28T14:04:27.038Z https://docs.axolotl.ai/docs/api/cli.quantize.html - 2025-05-28T13:58:09.775Z + 2025-05-28T14:04:27.188Z https://docs.axolotl.ai/docs/api/cli.utils.html - 2025-05-28T13:58:09.755Z + 2025-05-28T14:04:27.169Z https://docs.axolotl.ai/src/axolotl/integrations/LICENSE.html - 2025-05-28T13:57:40.174Z + 2025-05-28T14:03:55.111Z https://docs.axolotl.ai/FAQS.html - 2025-05-28T13:57:40.151Z + 2025-05-28T14:03:55.090Z https://docs.axolotl.ai/examples/colab-notebooks/colab-axolotl-example.html - 2025-05-28T13:57:40.158Z + 2025-05-28T14:03:55.096Z https://docs.axolotl.ai/index.html - 2025-05-28T13:57:40.170Z + 2025-05-28T14:03:55.108Z https://docs.axolotl.ai/src/axolotl/integrations/cut_cross_entropy/ACKNOWLEDGEMENTS.html - 2025-05-28T13:57:40.174Z + 2025-05-28T14:03:55.112Z https://docs.axolotl.ai/docs/api/datasets.html - 2025-05-28T13:58:09.377Z + 2025-05-28T14:04:26.799Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.optimizer.html - 2025-05-28T13:58:09.906Z + 2025-05-28T14:04:27.295Z https://docs.axolotl.ai/docs/api/prompt_strategies.orpo.chat_template.html - 2025-05-28T13:58:10.111Z + 2025-05-28T14:04:27.494Z https://docs.axolotl.ai/docs/api/prompt_strategies.completion.html - 2025-05-28T13:58:10.013Z + 2025-05-28T14:04:27.400Z https://docs.axolotl.ai/docs/api/evaluate.html - 2025-05-28T13:58:09.369Z + 2025-05-28T14:04:26.792Z https://docs.axolotl.ai/docs/api/loaders.processor.html - 2025-05-28T13:58:09.876Z + 2025-05-28T14:04:27.277Z https://docs.axolotl.ai/docs/api/common.const.html - 2025-05-28T13:58:10.734Z + 2025-05-28T14:04:28.097Z https://docs.axolotl.ai/docs/api/prompt_strategies.bradley_terry.llama3.html - 2025-05-28T13:58:10.115Z + 2025-05-28T14:04:27.497Z https://docs.axolotl.ai/docs/api/utils.distributed.html - 2025-05-28T13:58:10.472Z + 2025-05-28T14:04:27.841Z https://docs.axolotl.ai/docs/api/utils.tokenization.html - 2025-05-28T13:58:10.379Z + 2025-05-28T14:04:27.751Z https://docs.axolotl.ai/docs/api/utils.schemas.integrations.html - 2025-05-28T13:58:10.574Z + 2025-05-28T14:04:27.940Z https://docs.axolotl.ai/docs/api/utils.schedulers.html - 2025-05-28T13:58:10.452Z + 2025-05-28T14:04:27.823Z https://docs.axolotl.ai/docs/api/core.trainers.mixins.scheduler.html - 2025-05-28T13:58:09.917Z + 2025-05-28T14:04:27.305Z https://docs.axolotl.ai/docs/api/core.datasets.chat.html - 2025-05-28T13:58:09.575Z + 2025-05-28T14:04:26.991Z https://docs.axolotl.ai/docs/api/utils.callbacks.mlflow_.html - 2025-05-28T13:58:10.838Z + 2025-05-28T14:04:28.198Z https://docs.axolotl.ai/docs/api/monkeypatch.mistral_attn_hijack_flash.html - 2025-05-28T13:58:10.269Z + 2025-05-28T14:04:27.644Z https://docs.axolotl.ai/docs/api/core.chat.format.llama3x.html - 2025-05-28T13:58:09.568Z + 2025-05-28T14:04:26.985Z https://docs.axolotl.ai/docs/api/cli.checks.html - 2025-05-28T13:58:09.657Z + 2025-05-28T14:04:27.073Z https://docs.axolotl.ai/docs/api/monkeypatch.transformers_fa_utils.html - 2025-05-28T13:58:10.331Z + 2025-05-28T14:04:27.705Z https://docs.axolotl.ai/docs/api/prompt_strategies.llama2_chat.html - 2025-05-28T13:58:10.007Z + 2025-05-28T14:04:27.394Z https://docs.axolotl.ai/docs/api/convert.html - 2025-05-28T13:58:09.390Z + 2025-05-28T14:04:26.813Z https://docs.axolotl.ai/docs/api/utils.schemas.datasets.html - 2025-05-28T13:58:10.545Z + 2025-05-28T14:04:27.911Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.llama3.html - 2025-05-28T13:58:10.058Z + 2025-05-28T14:04:27.443Z https://docs.axolotl.ai/docs/api/cli.args.html - 2025-05-28T13:58:09.651Z + 2025-05-28T14:04:27.066Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_attn_hijack_xformers.html - 2025-05-28T13:58:10.255Z + 2025-05-28T14:04:27.631Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.chat_template.html - 2025-05-28T13:58:10.047Z + 2025-05-28T14:04:27.433Z https://docs.axolotl.ai/docs/api/models.mamba.modeling_mamba.html - 2025-05-28T13:58:10.752Z + 2025-05-28T14:04:28.115Z https://docs.axolotl.ai/docs/api/prompt_strategies.messages.chat.html - 2025-05-28T13:58:10.046Z + 2025-05-28T14:04:27.431Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_patch_multipack.html - 2025-05-28T13:58:10.316Z + 2025-05-28T14:04:27.690Z https://docs.axolotl.ai/docs/api/loaders.patch_manager.html - 2025-05-28T13:58:09.899Z + 2025-05-28T14:04:27.290Z https://docs.axolotl.ai/docs/api/utils.schemas.peft.html - 2025-05-28T13:58:10.553Z + 2025-05-28T14:04:27.919Z https://docs.axolotl.ai/docs/api/monkeypatch.llama_expand_mask.html - 2025-05-28T13:58:10.279Z + 2025-05-28T14:04:27.654Z https://docs.axolotl.ai/docs/api/core.trainers.grpo.trainer.html - 2025-05-28T13:58:09.831Z + 2025-05-28T14:04:27.244Z https://docs.axolotl.ai/docs/api/integrations.spectrum.args.html - 2025-05-28T13:58:10.731Z + 2025-05-28T14:04:28.094Z https://docs.axolotl.ai/docs/api/core.trainers.relora.html - 2025-05-28T13:58:09.815Z + 2025-05-28T14:04:27.228Z https://docs.axolotl.ai/docs/api/loaders.model.html - 2025-05-28T13:58:09.859Z + 2025-05-28T14:04:27.267Z https://docs.axolotl.ai/docs/api/utils.bench.html - 2025-05-28T13:58:10.402Z + 2025-05-28T14:04:27.774Z https://docs.axolotl.ai/docs/api/prompt_strategies.input_output.html - 2025-05-28T13:58:10.019Z + 2025-05-28T14:04:27.406Z https://docs.axolotl.ai/docs/api/cli.inference.html - 2025-05-28T13:58:09.689Z + 2025-05-28T14:04:27.104Z https://docs.axolotl.ai/docs/api/core.trainers.dpo.trainer.html - 2025-05-28T13:58:09.822Z + 2025-05-28T14:04:27.235Z https://docs.axolotl.ai/docs/api/monkeypatch.gradient_checkpointing.offload_disk.html - 2025-05-28T13:58:10.372Z + 2025-05-28T14:04:27.744Z https://docs.axolotl.ai/docs/api/utils.lora.html - 2025-05-28T13:58:10.394Z + 2025-05-28T14:04:27.765Z https://docs.axolotl.ai/docs/api/monkeypatch.btlm_attn_hijack_flash.html - 2025-05-28T13:58:10.314Z + 2025-05-28T14:04:27.688Z https://docs.axolotl.ai/docs/api/utils.chat_templates.html - 2025-05-28T13:58:10.389Z + 2025-05-28T14:04:27.761Z https://docs.axolotl.ai/docs/api/integrations.cut_cross_entropy.args.html - 2025-05-28T13:58:10.712Z + 2025-05-28T14:04:28.076Z https://docs.axolotl.ai/docs/api/monkeypatch.trainer_fsdp_optim.html - 2025-05-28T13:58:10.325Z + 2025-05-28T14:04:27.699Z https://docs.axolotl.ai/docs/api/prompt_strategies.stepwise_supervised.html - 2025-05-28T13:58:10.024Z + 2025-05-28T14:04:27.410Z https://docs.axolotl.ai/docs/api/common.datasets.html - 2025-05-28T13:58:10.751Z + 2025-05-28T14:04:28.114Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_instruct.html - 2025-05-28T13:58:09.974Z + 2025-05-28T14:04:27.362Z https://docs.axolotl.ai/docs/api/core.chat.format.chatml.html - 2025-05-28T13:58:09.566Z + 2025-05-28T14:04:26.983Z https://docs.axolotl.ai/docs/api/monkeypatch.attention.mllama.html - 2025-05-28T13:58:10.340Z + 2025-05-28T14:04:27.713Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.zephyr.html - 2025-05-28T13:58:10.070Z + 2025-05-28T14:04:27.454Z https://docs.axolotl.ai/docs/api/utils.schemas.config.html - 2025-05-28T13:58:10.515Z + 2025-05-28T14:04:27.882Z https://docs.axolotl.ai/docs/api/monkeypatch.unsloth_.html - 2025-05-28T13:58:10.333Z + 2025-05-28T14:04:27.706Z https://docs.axolotl.ai/docs/api/prompt_strategies.dpo.passthrough.html - 2025-05-28T13:58:10.073Z + 2025-05-28T14:04:27.457Z https://docs.axolotl.ai/docs/api/prompt_strategies.orcamini.html - 2025-05-28T13:58:10.035Z + 2025-05-28T14:04:27.420Z https://docs.axolotl.ai/docs/api/cli.evaluate.html - 2025-05-28T13:58:09.631Z + 2025-05-28T14:04:27.046Z https://docs.axolotl.ai/docs/api/monkeypatch.multipack.html - 2025-05-28T13:58:10.271Z + 2025-05-28T14:04:27.646Z https://docs.axolotl.ai/docs/api/cli.sweeps.html - 2025-05-28T13:58:09.724Z + 2025-05-28T14:04:27.138Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.chatml.html - 2025-05-28T13:58:10.089Z + 2025-05-28T14:04:27.472Z https://docs.axolotl.ai/docs/api/prompt_strategies.base.html - 2025-05-28T13:58:09.941Z + 2025-05-28T14:04:27.329Z https://docs.axolotl.ai/docs/api/kernels.utils.html - 2025-05-28T13:58:10.227Z + 2025-05-28T14:04:27.605Z https://docs.axolotl.ai/docs/api/utils.schemas.model.html - 2025-05-28T13:58:10.522Z + 2025-05-28T14:04:27.889Z https://docs.axolotl.ai/docs/api/utils.data.pretraining.html - 2025-05-28T13:58:10.484Z + 2025-05-28T14:04:27.853Z https://docs.axolotl.ai/docs/api/monkeypatch.mixtral.html - 2025-05-28T13:58:10.343Z + 2025-05-28T14:04:27.716Z https://docs.axolotl.ai/docs/api/logging_config.html - 2025-05-28T13:58:09.438Z + 2025-05-28T14:04:26.859Z https://docs.axolotl.ai/docs/api/prompt_strategies.alpaca_chat.html - 2025-05-28T13:58:09.973Z + 2025-05-28T14:04:27.360Z https://docs.axolotl.ai/docs/api/prompt_strategies.kto.llama3.html - 2025-05-28T13:58:10.081Z + 2025-05-28T14:04:27.465Z https://docs.axolotl.ai/docs/api/integrations.kd.trainer.html - 2025-05-28T13:58:10.721Z + 2025-05-28T14:04:28.084Z https://docs.axolotl.ai/docs/api/utils.quantization.html - 2025-05-28T13:58:10.507Z + 2025-05-28T14:04:27.874Z https://docs.axolotl.ai/docs/api/prompt_tokenizers.html - 2025-05-28T13:58:09.433Z + 2025-05-28T14:04:26.854Z https://docs.axolotl.ai/docs/api/kernels.geglu.html - 2025-05-28T13:58:10.208Z + 2025-05-28T14:04:27.586Z https://docs.axolotl.ai/docs/custom_integrations.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/multimodal.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/multi-node.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/docker.html - 2025-05-28T13:57:40.154Z + 2025-05-28T14:03:55.092Z https://docs.axolotl.ai/docs/amd_hpc.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/dataset_loading.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.092Z https://docs.axolotl.ai/docs/qat.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/cli.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/config.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/unsloth.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/ray-integration.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/dataset-formats/stepwise_supervised.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/dataset-formats/index.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.091Z https://docs.axolotl.ai/docs/dataset-formats/tokenized.html - 2025-05-28T13:57:40.153Z + 2025-05-28T14:03:55.092Z https://docs.axolotl.ai/docs/installation.html - 2025-05-28T13:57:40.156Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/inference.html - 2025-05-28T13:57:40.156Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/mac.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z https://docs.axolotl.ai/docs/getting-started.html - 2025-05-28T13:57:40.154Z + 2025-05-28T14:03:55.092Z https://docs.axolotl.ai/docs/reward_modelling.html - 2025-05-28T13:57:40.157Z + 2025-05-28T14:03:55.095Z
tuple[PreTrainedModel, PeftConfig | None]tuple[PreTrainedModel | PeftModelForCausalLM, PeftConfig | None] A tuple with the loaded model and its LoRA configuration (if applicable).