axolotl/examples at c2bd75aff610f257e00a7f4e7e6649f39d770757 - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

Wing Lian c2bd75aff6 Nemo gym integration (#3516 ) [skip ci]

* nemo gym integration with grpo wip

* mostly working

* cleanup

* simplify

* update docs

* nemo gym support wip

* cleanup

* chore: lint

* address PR review and add more tests

* chore: lint

* post merge lora fixes for CI (#3536) [skip ci]

* post merge lora fixes for CI

* handle lora kernel auto-enable for moe without grouped_mm

* prefer not to import torch in schema validation

* address pr comments, add timeout, add tests

* roundup_power2_divisions not needed with newer pytorch versions (#3540)

* roundup_power2_divisions not needed with newer pytorch versions

* remove typo

* update qwen3.5 moe 35b-a3b yaml for 5090

* more bug fixes

* fix tests to match updated trainer

* don't use fa2 for hooks test

* reset plugins on the instance

* retry download

* fix references to renamed axolotl_cfg property on trainer

* Fix ref to trainer cfg

* fix: robust handling of race condition on patching check (#3543) [skip ci]

* EBFT: Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models (#3527) [skip ci]

* EBFT wip

* fixes

* more fixeS

* add missing strided module

* ebft fixes for multi-turn

* make ebft work with async

* add example for ebft w qwen3.5

* fix for split thinking and update yaml for lora over linear attention only

* enforce_eager for vllm arg in schema

* fix sync weights

* fix multi-gpu

* handle updated sig for mm

* ddp fixes

* improve multi-gpu handling, don't calculate logits, adaptive completion length

* chore: lint

* chore: lint

* support completion_mean

* Address corereview feedback

* clamp min IS ratio

* Address PR code review

* more fixes identified

* address code review

* Fix property from rebase conflict

* fix for ebft sync and update docs

* make trainer loss patch check a solo test

---------

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-25 07:38:06 -04:00

..

feat(doc): add optimizations table of content to our improvements (#3175 ) [skip ci]

2025-09-24 16:13:49 -04:00

upgrade vllm to v0.14.0 (#3345 )

2026-01-21 20:00:18 -05:00

upgrade vllm to v0.14.0 (#3345 )

2026-01-21 20:00:18 -05:00

fix: unify default for conversations_field [skip-e2e] (#3070 )

2025-09-23 21:22:15 +07:00

deploy training jobs to baseten w truss in axolotl cli (#3086 ) [skip ci]

2025-08-26 09:29:50 -04:00

make pad_to_sequence_len default to the same value as sample_packing (#2941 ) [skip ci]

2025-07-21 11:40:56 -04:00

colab-notebooks

Scattermoe LoRA optimizations (#3513 )

2026-03-19 23:07:42 -04:00

fix: unify default for conversations_field [skip-e2e] (#3070 )

2025-09-23 21:22:15 +07:00

use warmup_ratio as a better default than warmup steps since it's data dependent (#2897 ) [skip ci]

2025-07-30 06:44:06 -04:00

upgrade vllm to v0.14.0 (#3345 )

2026-01-21 20:00:18 -05:00

distributed-parallel

feat: update nd parallelism readme (#3039 )

2025-08-08 12:45:36 +01:00

EAFT (#3366 ) [skip ci]

2026-01-28 06:44:15 -05:00

Nemo gym integration (#3516 ) [skip ci]

2026-03-25 07:38:06 -04:00

make pad_to_sequence_len default to the same value as sample_packing (#2941 ) [skip ci]

2025-07-21 11:40:56 -04:00

make pad_to_sequence_len default to the same value as sample_packing (#2941 ) [skip ci]

2025-07-21 11:40:56 -04:00

fix: gemma3 configs (#3500 ) [skip ci]

2026-03-20 16:14:06 +07:00

upgrade vllm to v0.14.0 (#3345 )

2026-01-21 20:00:18 -05:00

use warmup_ratio as a better default than warmup steps since it's data dependent (#2897 ) [skip ci]

2025-07-30 06:44:06 -04:00

feat: add doc for expert quantization, glm45 air example configs, and update readme for release (#3452 ) [skip ci]

2026-03-05 09:58:09 -05:00

add glm support + patch (#3329 ) [skip ci]

2026-02-10 17:43:53 +07:00

feat: add doc for expert quantization, glm45 air example configs, and update readme for release (#3452 ) [skip ci]

2026-03-05 09:58:09 -05:00

upgrade vllm to v0.14.0 (#3345 )

2026-01-21 20:00:18 -05:00

upgrade vllm to v0.14.0 (#3345 )

2026-01-21 20:00:18 -05:00

upgrade vllm to v0.14.0 (#3345 )

2026-01-21 20:00:18 -05:00

feat: add internvl3_5 (#3141 ) [skip-ci]

2025-12-25 18:07:59 +07:00

transformers v5 upgrade (#3272 )

2026-01-27 17:08:24 -05:00

Feat: add kimi linear support (#3257 )

2025-12-25 17:53:52 +07:00

feat: add lfm2 family and latest moe model (#3208 )

2025-10-09 10:47:41 -04:00

Add FSDP v2 swap memory support + QLoRA compatibility fixes (#3167 )

2025-09-26 10:23:59 +01:00

add: support mxfp4 axo (#3375 )

2026-03-05 13:40:45 -05:00

fix: revert changing default optimizer to muon (#2965 ) [skip ci]

2025-07-22 10:00:30 -04:00

fix: unify default for conversations_field [skip-e2e] (#3070 )

2025-09-23 21:22:15 +07:00

fix: revert changing default optimizer to muon (#2965 ) [skip ci]

2025-07-22 10:00:30 -04:00

upgrade vllm to v0.14.0 (#3345 )

2026-01-21 20:00:18 -05:00

transformers v5 upgrade (#3272 )

2026-01-27 17:08:24 -05:00

Feat: add MiMo and Plano (#3332 ) [skip-ci]

2025-12-25 18:09:03 +07:00

fix: improve ministral3 docs to be clearer (#3300 )

2025-12-04 21:44:44 +07:00

feat : scaled softmax support (#3338 )

2026-01-13 14:33:11 +07:00

build examples readmes with quarto (#3046 )

2025-12-25 19:17:25 +07:00

fix: add dequant bf16 repo (#3507 ) [skip ci]

2026-03-20 17:11:46 +07:00

build examples readmes with quarto (#3046 )

2025-12-25 19:17:25 +07:00

feat: add nemotron config (#3506 )

2026-03-20 16:23:42 +07:00

feat: cleanup old flex mask patch, suppress Matmul bnb warn, and misc (#3330 ) [skip-ci]

2025-12-25 17:56:20 +07:00

use warmup_ratio as a better default than warmup steps since it's data dependent (#2897 ) [skip ci]

2025-07-30 06:44:06 -04:00

fix: unify default for conversations_field [skip-e2e] (#3070 )

2025-09-23 21:22:15 +07:00

Feat: add Magistral Small 2509 and native mistral3 tokenizer support (#3165 )

2025-09-18 15:42:20 +07:00

Feat: add MiMo and Plano (#3332 ) [skip-ci]

2025-12-25 18:09:03 +07:00

Add QAT NVFP4 configs for blogpost (#3280 ) [skip ci]

2025-12-17 09:35:22 -05:00

Distributed Muon Optimizer (#3264 )

2025-12-19 10:43:47 -05:00

fix: unify default for conversations_field [skip-e2e] (#3070 )

2025-09-23 21:22:15 +07:00

fix: unify default for conversations_field [skip-e2e] (#3070 )

2025-09-23 21:22:15 +07:00

fix: update qwen3 jinja tokenization off a few tokens (#3295 )

2025-12-09 14:31:03 +07:00

fix: qwen3-next to use fla causal-conv1d to support packing (#3437

2026-03-03 09:26:46 -05:00

roundup_power2_divisions not needed with newer pytorch versions (#3540 )

2026-03-24 15:40:05 -04:00

Feat: add Olmo3 (BC with Olmo and Olmo2) (#3275 )

2025-11-24 10:21:31 +07:00

Example for Slurm and various fixes (#3038 ) [skip ci]

2025-08-08 08:02:03 -04:00

Feat: add Olmo3 (BC with Olmo and Olmo2) (#3275 )

2025-11-24 10:21:31 +07:00

Streaming SFT support (#3101 )

2025-09-02 12:08:44 -04:00

feat: Add SwanLab integration for experiment tracking (#3334 )

2026-01-06 09:19:18 -05:00

Fix: quantize and target moe layers in transformers v5 for adapters and many misc fixes (#3439 )

2026-03-03 10:06:23 -05:00

upgrade vllm to v0.14.0 (#3345 )

2026-01-21 20:00:18 -05:00