axolotl/docs at dd660c2ed046e8715cdf73c23cf14066ac165ce7 - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

Wing Lian c67910fa6f bump hf deps (#2735 ) [skip ci]

* bump hf deps

* upgrade liger-kernel too

* install cce from fork for transformers fix

* fix reference to vocab size in gemma3 patch

* use padding_idx instead of pad_token_id

* remove fixed gemma3 patch

* use updated cce fork

* fix local mllama cce patches w docstring

* add test for multipack with trainer setup and fix trainer for trainer refactor upstream

* bump modal version

* guard for iterable datasetS

* mllama model arch layout changed in latest transformers

* fix batch sampler with drop_last

* fix: address upstream vlm changes for lora

* fix: update references to old lora target path

* fix: remove mllama fa2 patch due to upstream fix

* fix: lora kernel patch path for multimodal models

* fix: removed mllama from quarto

* run test for came optim on 2.6.0+

* fix fsdp2 patch and remove deprecated patch

* make sure to set sequence_parallel_degree for grpo

* Add SP test for GRPO

* add sp to grpo config for trainer

* use reward_funcs as kwarg to grpo trainer

* fix the comprehension for reward funcs

* reward funcs already passed in as args

* init sp_group right before training

* fix check for adding models to SP context

* make sure to pass args to super

* upgrade deepspeed

* use updated trl and add reasoning flags for vllm

* patch the worker

---------

Co-authored-by: NanoCode012 <nano@axolotl.ai>

2025-06-05 07:20:33 -07:00

..

dataset-formats

Fix(doc): clarify data loading for local datasets and splitting samples (#2726 ) [skip ci]

2025-05-28 15:48:22 +07:00

Ray Train Axolotl Integration (#2251 )

2025-01-29 00:10:19 -05:00

.gitignore

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

amd_hpc.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

batch_vs_grad.qmd

Feat: update doc (#1475 ) [skip ci]

2024-04-04 13:43:40 +09:00

cli.qmd

QAT (#2590 )

2025-05-28 12:35:47 +01:00

config.qmd

Fix: RL base feature parity (#2133 )

2025-05-30 11:21:47 +07:00

custom_integrations.qmd

Add: Sparse Finetuning Integration with llmcompressor (#2479 )

2025-05-01 12:25:16 -04:00

dataset_loading.qmd

Fix(doc): clarify data loading for local datasets and splitting samples (#2726 ) [skip ci]

2025-05-28 15:48:22 +07:00

dataset_preprocessing.qmd

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

debugging.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

docker.qmd

add support for base image with uv (#2691 )

2025-06-02 12:48:55 -07:00

faq.qmd

Add a few items to faq (#2734 )

2025-05-28 16:20:19 -04:00

fsdp_qlora.qmd

github urls (#1734 )

2024-07-11 09:19:29 -04:00

getting-started.qmd

Fix quarto (#2717 )

2025-05-23 21:16:51 -04:00

inference.qmd

Feat: minor docs improvements for RLHF and faq on embeddings (#2401 ) [skip ci]

2025-03-17 08:39:04 -04:00

input_output.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

installation.qmd

add support for base image with uv (#2691 )

2025-06-02 12:48:55 -07:00

lora_optims.qmd

feat(doc): note lora kernel incompat with RLHF (#2706 ) [skip ci]

2025-05-28 15:48:40 +07:00

lr_groups.qmd

support for custom lr groups for non-embedding modules (#2213 )

2025-01-24 12:56:28 -05:00

mac.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

multi-gpu.qmd

SP dataloader patching + removing custom sampler / dataloader logic (#2686 )

2025-05-21 11:20:20 -04:00

multi-node.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

multimodal.qmd

bump hf deps (#2735 ) [skip ci]

2025-06-05 07:20:33 -07:00

multipack.qmd

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

nccl.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

qat.qmd

QAT (#2590 )

2025-05-28 12:35:47 +01:00

quantize.qmd

QAT (#2590 )

2025-05-28 12:35:47 +01:00

ray-integration.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

reward_modelling.qmd

chore(docs): add cookbook/blog link to docs (#2410 ) [skip ci]

2025-03-17 08:38:19 -04:00

rlhf.qmd

fix abbriviatation spelling error

2025-06-03 21:30:40 +02:00

sequence_parallelism.qmd

SP dataloader patching + removing custom sampler / dataloader logic (#2686 )

2025-05-21 11:20:20 -04:00

torchao.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

unsloth.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00