axolotl

Author	SHA1	Message	Date
Manas Vardhan	5ed455715e	feat: support dot-notation CLI args for nested config options (#3419 ) * feat: support dot-notation CLI args for nested config options Add support for overriding nested config fields (like TRL config) via CLI using dot-notation, e.g.: axolotl train grpo.yaml --trl.vllm-server-host=10.0.0.1 --trl.beta=0.1 Changes: - args.py: Detect BaseModel subclass fields and generate dot-notation CLI options (--parent.child) that map to double-underscore kwargs (parent__child). Also fix _strip_optional_type for Python 3.10+ union syntax (X \| None). - config.py: Handle double-underscore kwargs in load_cfg by setting nested dict values on the config. - Add tests for nested option handling. Fixes #2702 * Address CodeRabbit review: fix string parent bug, add type hints and docstring Signed-off-by: Manas Vardhan <manasvardhan@gmail.com> * Add type coercion for CLI kwargs and fix pre-commit issues - Add _coerce_value() for YAML-style type inference on string CLI args - When existing config value has a type (int/float/bool), cast to match - When no existing value, infer type from string (true/false, ints, floats, null) - Apply coercion to both flat and nested (dot-notation) kwargs - Fix unused pytest import (pre-commit/ruff) - Update tests to pass string values (matching real CLI behavior) - Add dedicated TestCoerceValue test class Addresses maintainer feedback on type casting for nested kwargs. --------- Signed-off-by: Manas Vardhan <manasvardhan@gmail.com>	2026-02-23 10:10:06 -05:00
Lorenzo Baraldi	3f30572d4a	Fix typo in dataset_processes field (#3426 ) * Fix typo in dataset_processes field * fix: use updated config name --------- Co-authored-by: NanoCode012 <nano@axolotl.ai>	2026-02-23 14:18:37 +07:00
NanoCode012	43d60c7439	bump cut-cross-entropy to 58d6572 (#3424 )	2026-02-20 14:24:51 -05:00
Wing Lian	0ea252d392	update to trackio 0.16.1 (#3425 ) [skip ci]	2026-02-20 14:24:33 -05:00
Wing Lian	29722dec60	use bunnycdn for CI assets (#3422 ) [skip ci]	2026-02-20 00:09:25 -05:00
NanoCode012	7fbedbd300	fix(doc): add limitation for unfrozen_parameters (#3416 )	2026-02-19 18:32:26 -05:00
Wing Lian	145ffc9be1	upgrade transformers to 5.2.0 and torchao to 0.16.0 (#3407 ) * upgrade transformers to 5.1.0 and torchao to 0.16.0 * upgrade trl for parity * handle trl api changes * orpo doesn't have max_prompt_len to check anymore * cpoconfig doesn't take max_prompt_length and fix cpu offload * slow fsdp1 test * triton min 3.4.0 and liger to 0.7.0 * use transformers main for now for zero3 fix * handle group_by_length change * fix changes upstream * mark skip flaky test * use transformers latest release 5.2.0	2026-02-19 18:27:27 -05:00
NanoCode012	4f1b5ad29f	fix: clarify how to use lm_eval plugin (#3404 ) [skip ci]	2026-02-15 07:52:30 -05:00
NanoCode012	d6a2532dd7	feat(doc): clarify how to use scattermoe (#3408 ) [skip ci] * feat(doc): clarify how to use scattermoe * chore: fix wording	2026-02-15 07:51:28 -05:00
Wing Lian	5eb265513c	fix generic patch for cce (#3405 )	2026-02-12 08:58:04 -05:00
NanoCode012	06ac407b92	feat: improve telemetry log (#3398 ) * fix: redact trackio and data_files * fix: add new orgs to whitelist * feat: add run id to logs for users to easily share * fix: update to add more metrics * fix: add missed experiment tracker * chore: formatting in main	2026-02-10 23:01:34 +07:00
NanoCode012	4e22cf0651	fix: remove telemetry warning (#3397 ) [skip ci]	2026-02-10 23:01:16 +07:00
VED	a4ee56c315	fix: set rollout in GRPO training_kwargs (#3392 )	2026-02-10 18:06:15 +07:00
NanoCode012	c67cbcb0f5	fix: ignore add_special_tokens and use test mode for generation for mistral tokenizer (#3396 ) [skip ci] * fix: ignore add_special_tokens and use test mode for generation * fix: incorrectly setting kwarg	2026-02-10 18:03:26 +07:00
NanoCode012	a2da852576	fix: improve lora kernels failure message and handle trust_remote_code (#3378 ) [skip ci] * fix: improve lora kernels failure message and handle trust_remote_code * chore: re-order model guides	2026-02-10 17:58:40 +07:00
madScientist10	37e9da7a53	add hub_revision support for specifying branch when pushing checkpoints (#3387 ) [skip ci]	2026-02-10 17:53:09 +07:00
NanoCode012	ed7105dba7	fix: GRPO config not accept max_prompt_length (#3390 ) [skip ci]	2026-02-10 17:52:09 +07:00
NanoCode012	b6d3653f74	feat: add step3p5 for cce (#3384 ) [skip ci] * feat: add step3p5 for cce * chore: reorder model	2026-02-10 17:51:43 +07:00
NanoCode012	fcc4cfdb63	feat: add sageattention (#2823 ) [skip ci] * feat: add sageattention * feat: call path on pre model load * fix: patch to use register to correct var * fix: add strict check import at start * chore: fix comments * chore: refactor * feat: add capability check * fix: missed underscore * fix: let sageattention use FA backend in transformers * feat: update sage attention for attention mask and position ids * feat: allow sample packing but add warning without packing * fix: loss hitting 0 with packing and attention mask note * feat: downcast embeds if sage attention too * feat: add config validation * feat: add attention docs * chore: docs	2026-02-10 17:49:21 +07:00
VED	97a4f28511	fix: saving state dict and eval for Context Parallel (#3382 ) [skip ci] * clone state_dict if none * patch calculating eval loss for cp	2026-02-10 17:47:26 +07:00
VED	86a5803212	train_per_sec_per_gpu metric (#3364 ) [skip ci] * fix token count * guard for none n zero	2026-02-10 17:44:55 +07:00
tgoab	530a0c0bf0	Changes from dataset_processes to dataset_num_proc (#3352 ) [skip ci] * changes from dataset_processes to dataset_num_proc * deprecation message improved --------- Co-authored-by: Juliana Nieto Cárdenas <jnietoca@purdue.edu>	2026-02-10 17:44:17 +07:00
VED	0343a72cc9	add glm support + patch (#3329 ) [skip ci] * add glm support + patch * lint * lint * Update examples/glm4/glm-4-6v-flash-qlora.yaml Co-authored-by: NanoCode012 <kevinvong@rocketmail.com> * Update examples/glm4/glm-4-6v-flash-qlora.yaml Co-authored-by: NanoCode012 <kevinvong@rocketmail.com> * Update src/axolotl/processing_strategies.py Co-authored-by: NanoCode012 <kevinvong@rocketmail.com> * patch removed * lint * lint2 * docs + rename * rmv moe * docs * removed processor * sdpa T_T" * ddp_find_unused_parameters: true * muti gpu yaml tested both * muti gpu yaml tested both * Update examples/glm46v/README.md Co-authored-by: NanoCode012 <kevinvong@rocketmail.com> * Update examples/glm46v/README.md Co-authored-by: NanoCode012 <kevinvong@rocketmail.com> * Update examples/glm46v/README.md Co-authored-by: NanoCode012 <kevinvong@rocketmail.com> * rmv text only section + v5 comments * rename --------- Co-authored-by: Ved <ved.work2024@gmail.com> Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>	2026-02-10 17:43:53 +07:00
Wing Lian	236dad3bb7	set 0.15.0.dev0 version (#3380 )	2026-01-30 21:28:01 -05:00
Wing Lian	be00978bc2	tag for v0.14.0 release (#3379 ) Some checks failed ci-cd / build-axolotl (<nil>, 128, 12.8.1, linux/amd64, 3.11, 2.8.0) (push) Has been cancelled Details ci-cd / build-axolotl (<nil>, 128, 12.8.1, linux/amd64,linux/arm64, 3.11, 2.9.0) (push) Has been cancelled Details ci-cd / build-axolotl (<nil>, 128, 12.8.1, true, linux/amd64,linux/arm64, 3.11, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl (<nil>, 129, 12.9.1, linux/amd64,linux/arm64, 3.12, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl (<nil>, 130, 13.0.0, linux/amd64,linux/arm64, 3.11, 2.9.1) (push) Has been cancelled Details publish pypi / Create Release (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 128, 12.8.1, linux/amd64, 3.11, 2.8.0) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 128, 12.8.1, linux/amd64,linux/arm64, 3.11, 2.9.0) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 128, 12.8.1, true, linux/amd64,linux/arm64, 3.11, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 129, 12.9.1, linux/amd64,linux/arm64, 3.12, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 130, 13.0.0, linux/amd64,linux/arm64, 3.11, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl-cloud-no-tmux (<nil>, 128, 12.8.1, true, 3.11, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl-cloud-no-tmux (<nil>, 130, 13.0.0, <nil>, 3.11, 2.9.1) (push) Has been cancelled Details publish pypi / Upload release to PyPI (push) Has been cancelled Details v0.14.0	2026-01-30 14:10:27 -05:00
Wing Lian	3738978394	Add support for batched_mm, grouped_mm and scattermoe for MoE models (#3377 ) * kernels plugin for moe for v5 * add support for native batched_mm or grouped_mm	2026-01-29 14:25:47 -05:00
Wing Lian	6132a30cda	handle warnings from v5 upgrade (#3376 )	2026-01-28 06:45:01 -05:00
NanoCode012	3dd86d35b8	feat: add new cce support for glm series and exaone4 (#3373 ) [skip ci]	2026-01-28 06:44:44 -05:00
salman	dd9ebaeba1	EAFT (#3366 ) [skip ci] * wip eaft * fix eaft loss fn * adding ref --------- Co-authored-by: Salman Mohammadi <“salman.mohammadi@outlook.com”>	2026-01-28 06:44:15 -05:00
Wing Lian	fc4e37920b	transformers v5 upgrade (#3272 ) * Prepare for transformers v5 upgrade * fix hf cli * update for hf hub changes * fix tokenizer apply_chat_template args * remap include_tokens_per_second * fix tps * handle migration for warmup * use latest hf hub * Fix scan -> ls * fix import * fix for renaming of mistral common tokenizer -> backend * update for fixed tokenziation for llama * Skip phi35 tests for now * remove mistral patch fixed upstream in huggingface/transformers#41439 * use namespacing for patch * don't rely on sdist for e2e tests for now * run modal ci without waiting too * Fix dep for ci * fix imports * Fix fp8 check * fsdp2 fixes * fix version handling * update fsdp version tests for new v5 behavior * Fail multigpu tests after 3 failures * skip known v5 broken tests for now and cleanup * bump deps * unmark skipped test * re-enable test_fsdp_qlora_prequant_packed test * increase multigpu ci timeout * skip broken gemma3 test * reduce timout back to original 120min now that the hanging test is skipped * fix for un-necessary collator for pretraining with bsz=1 * fix: safe_serialization deprecated in transformers v5 rc01 (#3318) * torch_dtype deprecated * load model in float32 for consistency with tests * revert some test fixtures back * use hf cache ls instead of scan * don't strip fsdp_version more fdsp_Version fixes for v5 fix version in fsdp_config fix aliasing fix fsdp_version check check fsdp_version is 2 in both places * Transformers v5 rc2 (#3347) * bump dep * use latest fbgemm, grab model config as part of fixture, un-skip test * import AutoConfig * don't need more problematic autoconfig when specifying config.json manually * add fixtures for argilla ultrafeedback datasets * download phi4-reasoning * fix arg * update tests for phi fast tokenizer changes * use explicit model types for gemma3 --------- Co-authored-by: Wing Lian <wing@axolotl.ai> * fix: AutoModelForVision2Seq -> AutoModelForImageTextToText * chore: remove duplicate * fix: attempt fix gemma3 text mode * chore: lint * ga release of v5 * need property setter for name_or_path for mistral tokenizer * vllm not compatible with transformers v5 * setter for chat_template w mistral too --------- Co-authored-by: NanoCode012 <nano@axolotl.ai> Co-authored-by: salman <salman.mohammadi@outlook.com>	2026-01-27 17:08:24 -05:00
Wing Lian	a531e9d946	upgrade vllm to v0.14.0 (#3345 )	2026-01-21 20:00:18 -05:00
Wing Lian	04328aeb97	cu129 targets for ci builds (#3369 ) * cu129 targets for ci builds * remove copy-paste is_latest	2026-01-21 17:24:44 -05:00
VED	d0d26d5064	feat: Add GDPO Support (#3353 ) * gdpo support - test left * lint * fixxes for vllm serv * test advantages * docss * lint * lint = * gdpo simple + lint * lint nit * example * lint * trl 0.27.0 * blocklist * test assert rmv * add validation check for GDPO + sum_then_normalize --------- Co-authored-by: Wing Lian <wing@axolotl.ai>	2026-01-21 17:22:45 -05:00
Wing Lian	8623dd8a72	strip only starting 'v' char; e.g don't strip from '.dev' (#3368 ) [skip ci]	2026-01-21 14:19:03 -05:00
Wing Lian	8cd75cff9f	use cuda 12.9.1 and add python 3.12 to base images (#3367 )	2026-01-21 13:34:14 -05:00
Wing Lian	8ab9d9ea88	Version dev (#3365 )	2026-01-20 22:58:29 -05:00
Wing Lian	6e42def14b	set version to v0.13.1 (#3363 ) Some checks failed ci-cd / build-axolotl (<nil>, 128, 12.8.1, linux/amd64, 3.11, 2.8.0) (push) Has been cancelled Details ci-cd / build-axolotl (<nil>, 128, 12.8.1, linux/amd64,linux/arm64, 3.11, 2.9.0) (push) Has been cancelled Details ci-cd / build-axolotl (<nil>, 128, 12.8.1, true, linux/amd64,linux/arm64, 3.11, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl (<nil>, 130, 13.0.0, linux/amd64,linux/arm64, 3.11, 2.9.1) (push) Has been cancelled Details publish pypi / Create Release (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 128, 12.8.1, linux/amd64, 3.11, 2.8.0) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 128, 12.8.1, linux/amd64,linux/arm64, 3.11, 2.9.0) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 128, 12.8.1, true, linux/amd64,linux/arm64, 3.11, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 130, 13.0.0, linux/amd64,linux/arm64, 3.11, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl-cloud-no-tmux (<nil>, 128, 12.8.1, true, 3.11, 2.9.1) (push) Has been cancelled Details ci-cd / build-axolotl-cloud-no-tmux (<nil>, 130, 13.0.0, <nil>, 3.11, 2.9.1) (push) Has been cancelled Details publish pypi / Upload release to PyPI (push) Has been cancelled Details v0.13.1	2026-01-20 08:58:32 -05:00
Wing Lian	c413480b35	upgrade transformers to 4.57.6 and peft to 0.17.1 and datasets to 4.5.0 (#3361 )	2026-01-16 11:48:50 -05:00
Wing Lian	8f25124269	upgrade transformers to 4.57.5 (#3358 ) * upgrade transformers to 4.57.5 * explicitly set versions for fbgemm-gpu * handle index url for cuda version * explicitly set cu version for fbgemm deps, skip for 130 * cu suffix not needed on version if using whl subpath	2026-01-16 11:17:43 -05:00
Wing Lian	790df757cb	don't install xformers in for arm64 (#3359 ) * install xformers in the base docker image * install numba and numpy first * set CUDA_HOME for xformers install * Set cuda home env * don't install xformers by default on aarch64/arm64	2026-01-16 09:02:37 -05:00
Wing Lian	d282f32481	don't install deepspeed in arm64 images (#3357 )	2026-01-14 12:03:55 -05:00
Wing Lian	6331e4a130	fix amd64 and set 2.9.1 as latest cloud image (#3356 )	2026-01-14 11:56:36 -05:00
salman	1410e4474e	update PR template (#3349 ) [skip ci]	2026-01-14 09:39:21 -05:00
Wing Lian	dc77b5bf42	fix arm64 builds (#3355 ) * fix syntax for secrets in gha yaml * setup env for uv too * arm64 for base uv too * don't build causal-conv1d or mamba for arm64 and use arm64 wheels * fix dockerfile syntax * fix shell syntax	2026-01-14 09:38:48 -05:00
NanoCode012	359b7ad85e	fix: gemma3_text model loading vision config (#3354 ) * fix: gemma3-text mode loading vision config * fix: improve defaults to use lora kernels	2026-01-13 09:49:23 -05:00
VED	258ce8d4fa	feat : scaled softmax support (#3338 ) * scaled softmax * comment * lint * remove egear * validation for flash * lint * val imporve + neet * fix correct softmax scale val(learned) * learned scale val 4 ssm * lint * fix model_type rmv * sdpa_atten * test fix + lint * test fix * sdp_a val rmv * flex fix * main flash * lint * flex attn * lint comment * fix score_mod * Update src/axolotl/utils/schemas/validation.py Co-authored-by: NanoCode012 <kevinvong@rocketmail.com> --------- Co-authored-by: Ved <ved.work2024@gmail.com> Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>	2026-01-13 14:33:11 +07:00
@TT	3e0bbd33ec	feat: add ARM64/AArch64 build support to Dockerfile-base (#3346 ) * Add support for capability to build arm64 image * Fixing wrong variable TARGETPLATFORM bug * Adding missing semicolons * skip docker hub login if PR (no push) or no credentials * Enabling arm64 builds for Dockerfile-base in Github actions * TARGETARCH automatically default to platform arch under build * Enabling arm64 builds for axolotl docker builds * Enabling arm64 builds for axolotl-cloud docker build Github actions --------- Co-authored-by: Wing Lian <wing@axolotl.ai>	2026-01-12 12:00:02 -05:00
salman	4ae6f766ad	bump bnb to v0.49.1 (#3351 )	2026-01-12 09:42:04 -05:00
VED	e7f0d4ba5b	Increased test coverage for lora/qlora (#3147 ) * config_val tests * remove config val(not needed) * config validation * parameter freeze validation * merge/unmerge tests * removal unwanted * rename * lint * updated lint * Update tests/utils/lora/test_config_validation_lora.py Co-authored-by: NanoCode012 <kevinvong@rocketmail.com> * pytest skip + mock fix * nitpicks * revert some nitpicks --------- Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>	2026-01-06 11:44:48 -05:00
VED	7bf6f70e96	fix total/trainable tokens log (#3344 ) * fix total/trainable tokens log * fix total/trainable tokens log	2026-01-06 09:25:17 -05:00

1 2 3 4 5 ...

2571 Commits