axolotl/docs at 243620394a2576db507b1f6ab033c4183a18233e - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

NanoCode012 243620394a fix: force train split for json,csv,txt for test_datasets and misc doc changes (#3226 )

* fix: force train split for json,csv,txt for test_datasets

* feat(doc): add info on mixing datasets for VLM

* feat(doc): max memory

* fix(doc): clarify lr groups

* fix: add info on vision not being dropped

* feat: add qwen3-vl to multimodal docs

* fix: add moe blocks to arch list

* feat(doc): improve mistral docs

* chore: add helpful link [skip-e2e]

* fix: add vram usage for mistral small

* Update link in docs/faq.qmd

Co-authored-by: salman <salman.mohammadi@outlook.com>

---------

Co-authored-by: Wing Lian <wing@axolotl.ai>
Co-authored-by: salman <salman.mohammadi@outlook.com>

2025-10-22 15:23:20 -07:00

..

dataset-formats

feat: support training with JSON string tool arguments (#3136 )

2025-09-25 12:06:21 +07:00

Ray Train Axolotl Integration (#2251 )

2025-01-29 00:10:19 -05:00

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

.gitignore

Config doc autogen (#2718 )

2025-06-18 15:36:53 -04:00

amd_hpc.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

batch_vs_grad.qmd

Feat: update doc (#1475 ) [skip ci]

2024-04-04 13:43:40 +09:00

cli.qmd

CLI: add --launcher option, support launcher args, cleanup, refactor (#2924 )

2025-07-30 15:46:56 -04:00

custom_integrations.qmd

densemixer plugin integration (#2868 )

2025-07-07 17:05:19 -04:00

dataset_loading.qmd

Config doc autogen (#2718 )

2025-06-18 15:36:53 -04:00

dataset_preprocessing.qmd

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

debugging.qmd

feat:add support dataset_num_processes (#3129 ) [skip ci]

2025-10-13 17:18:12 +07:00

docker.qmd

feat(doc): re-add docker 2.7.0 tag back (#2902 ) [skip ci]

2025-07-12 11:40:01 -04:00

faq.qmd

fix: force train split for json,csv,txt for test_datasets and misc doc changes (#3226 )

2025-10-22 15:23:20 -07:00

fsdp_qlora.qmd

Add FSDP v2 swap memory support + QLoRA compatibility fixes (#3167 )

2025-09-26 10:23:59 +01:00

getting-started.qmd

Config doc autogen (#2718 )

2025-06-18 15:36:53 -04:00

gradient_checkpointing.qmd

Activation Offloading w CUDA Streams (#2900 ) [skip ci]

2025-07-14 20:10:20 -04:00

inference.qmd

Feat: minor docs improvements for RLHF and faq on embeddings (#2401 ) [skip ci]

2025-03-17 08:39:04 -04:00

input_output.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

installation.qmd

feat(doc): improve visibility for colab notebooks (#3110 ) [skip ci]

2025-09-03 01:40:53 -04:00

lora_optims.qmd

doc fix (#3187 )

2025-09-26 09:55:15 -04:00

lr_groups.qmd

fix: force train split for json,csv,txt for test_datasets and misc doc changes (#3226 )

2025-10-22 15:23:20 -07:00

mac.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

mixed_precision.qmd

basic torchao fp8 mixed precision training (#2926 )

2025-07-22 16:27:47 -04:00

multi-gpu.qmd

fix(doc): add act checkpointing migration to fsdp2 docs (#3193 ) [skip ci]

2025-10-10 10:57:50 +07:00

multi-node.qmd

CLI: add --launcher option, support launcher args, cleanup, refactor (#2924 )

2025-07-30 15:46:56 -04:00

multimodal.qmd

fix: force train split for json,csv,txt for test_datasets and misc doc changes (#3226 )

2025-10-22 15:23:20 -07:00

multipack.qmd

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

nccl.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

nd_parallelism.qmd

feat: update nd parallelism readme (#3039 )

2025-08-08 12:45:36 +01:00

optimizations.qmd

feat(doc): add optimizations table of content to our improvements (#3175 ) [skip ci]

2025-09-24 16:13:49 -04:00

optimizers.qmd

feat: add complete optimizer docs (#3017 ) [skip ci]

2025-08-06 08:01:51 -04:00

qat.qmd

feat(doc): add optimizations table of content to our improvements (#3175 ) [skip ci]

2025-09-24 16:13:49 -04:00

quantize.qmd

qat doc updates (#3162 ) [skip-ci]

2025-09-17 10:38:15 +01:00

ray-integration.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

reward_modelling.qmd

Center rewards coefficient (#3124 )

2025-09-03 16:22:37 -04:00

rlhf.qmd

Add chat_template.argilla_chat support for DPO datasets (#3202 )

2025-10-17 17:00:26 +07:00

sequence_parallelism.qmd

Distributed/ND-Parallel (#2977 )

2025-07-31 15:25:02 -04:00

streaming.qmd

Streaming SFT support (#3101 )

2025-09-02 12:08:44 -04:00

torchao.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

unsloth.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00