axolotl

Author	SHA1	Message	Date
Salman Mohammadi	7d8e8c9ac2	nit [skip-e2e]	2025-08-13 12:58:30 +01:00
Salman Mohammadi	7c2466b739	nit	2025-08-13 12:58:13 +01:00
Salman Mohammadi	3146cb56dd	docs	2025-08-13 12:53:58 +01:00
Salman Mohammadi	c09b0a3bbf	reverting change	2025-08-13 11:27:15 +01:00
Salman Mohammadi	e05acccd77	linting	2025-08-13 11:24:22 +01:00
Salman Mohammadi	c44abad531	debugging CI	2025-08-13 11:24:05 +01:00
Salman Mohammadi	817d70e669	debugging CI	2025-08-13 10:45:41 +01:00
Salman Mohammadi	03f5a7fd16	adding back	2025-08-12 18:35:12 +01:00
Salman Mohammadi	3d9b96a94f	testing revert	2025-08-12 15:53:43 +01:00
Salman Mohammadi	42c16024a2	docs	2025-08-12 15:34:46 +01:00
Salman Mohammadi	ec94d632f3	docs	2025-08-12 14:07:55 +01:00
Salman Mohammadi	e8bd3b0b3b	Merge branch 'fix-preview' of github.com:axolotl-ai-cloud/axolotl into fix-preview	2025-08-12 13:42:56 +01:00
Salman Mohammadi	5a08b94668	update workflow	2025-08-12 12:29:09 +01:00
salman	ecb8c1f4b3	Merge branch 'main' into fix-preview	2025-08-12 09:43:39 +01:00
Salman Mohammadi	ab57be6526	render docs on python file change to preview api ref	2025-08-12 09:43:23 +01:00
Wing Lian	3d45620008	remove prepare-from-posids patch (#3052 ) [skip ci]	2025-08-11 09:34:41 -04:00
github-actions[bot]	ce20e838b5	chore: update pre-commit hooks (#3050 ) [skip ci] Co-authored-by: djsaunde <1245942+djsaunde@users.noreply.github.com>	2025-08-11 09:32:21 -04:00
Wing Lian	d4d84d48af	fix ray train and add fsdp2 smoke test for ray trainer (#3053 ) * add fsdp2 smokle test for ray trainer * fix raytrain with fsdp2	2025-08-11 09:31:54 -04:00
Wing Lian	c9640bca2c	attempt to fix quartodoc render for yields	2025-08-10 22:23:09 -04:00
Wing Lian	9b12c05660	use exec instead of subprocess to make ctrl+c nicer for cli (#3044 ) * use exec instead of subprocess to make ctrl+c nicer for cli * change var name to use_exec * simplify to bool * flush std* * patch subprocess as mock in test * fix tests * more test fixes	2025-08-10 20:22:20 -04:00
Wing Lian	686933194e	fix vllm tagging and add cloud images w/o tmux (#3049 ) [skip ci]	2025-08-10 20:21:56 -04:00
Wing Lian	d12b461d19	follow up fix for plugin registration (#3054 ) [skip ci]	2025-08-10 20:21:38 -04:00
Wing Lian	d6b81b3683	update training args check for new defaults (#3051 ) [skip ci] * update training args check for new defaults * skip check for now	2025-08-10 11:26:22 -04:00
Wing Lian	05f1b4b2e8	run monkeypatch tests in seperate runner (#3047 )	2025-08-09 14:34:07 -04:00
Wing Lian	7cfc80ec77	set dev version (#3045 ) [skip ci]	2025-08-08 13:56:53 -04:00
salman	0da6a95efa	Add citation.tff (#3043 ) [skip ci]	2025-08-08 16:18:42 +01:00
Wing Lian	2c8497e489	tag for v0.12.0 release (#3041 ) Some checks failed ci-cd / build-axolotl (<nil>, 126, 12.6.3, 3.11, 2.6.0) (push) Has been cancelled Details ci-cd / build-axolotl (<nil>, 126, 12.6.3, 3.11, 2.7.0) (push) Has been cancelled Details ci-cd / build-axolotl (<nil>, 128, 12.8.1, 3.11, 2.7.1) (push) Has been cancelled Details ci-cd / build-axolotl (vllm, 126, 12.6.3, true, 3.11, 2.7.1) (push) Has been cancelled Details publish pypi / Create Release (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 126, 12.6.3, 3.11, 2.6.0) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 126, 12.6.3, 3.11, 2.7.0) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 126, 12.6.3, true, 3.11, 2.7.1) (push) Has been cancelled Details ci-cd / build-axolotl-cloud (<nil>, 128, 12.8.1, 3.11, 2.7.1) (push) Has been cancelled Details ci-cd / build-axolotl-cloud-no-tmux (<nil>, 126, 12.6.3, 3.11, 2.6.0) (push) Has been cancelled Details publish pypi / Upload release to PyPI (push) Has been cancelled Details v0.12.0	2025-08-08 08:24:09 -04:00
NanoCode012	f70d4de8c7	feat(doc): add links to new features on README (#2980 ) [skip ci] * feat(doc): add links to new features on README * fix merge error * remove blurb about older FSDP2 integration * update blog link * chore: update cce commit * feat: update model support into readme * Update README.md Co-authored-by: salman <salman.mohammadi@outlook.com> * chore: lint num spaces --------- Co-authored-by: Wing Lian <wing@axolotl.ai> Co-authored-by: salman <salman.mohammadi@outlook.com>	2025-08-08 08:16:43 -04:00
Dan Saunders	0ae06d756d	use nanmean for loss aggregation (CP fix) (#3033 ) * use nanmena for loss aggregation (CP fix) * use regular asserts * small changes to make tests isolate * combining evaluation_loop patches * fix * delete unused * fix check	2025-08-08 08:15:17 -04:00
NanoCode012	2974670bf8	Feat: add arcee (#3028 ) * feat: add arcee * feat: add latest models supported by cce * feat: add arcee example config * chore: lint * fix: typo * feat: change to instruct * feat: add vram usage * Update README.md	2025-08-08 08:09:11 -04:00
Wing Lian	50f2b94d50	add 120b and deepspeed zero3 examples (#3035 ) [skip ci] * add 120b and deepspeed zero3 examples * add a bit of flavor and cleanup gpt oss readme * fix: remove expert vram usage * fix: remove redundant EOS token from eot_tokens * feat: add 120B to docs --------- Co-authored-by: NanoCode012 <nano@axolotl.ai>	2025-08-08 08:04:56 -04:00
Wing Lian	eb2c87b525	Example for Slurm and various fixes (#3038 ) [skip ci] * slurm example and make preprocess play nicely * start slurm if it init file exists * remove incorrect comment * feat: add slurm docs --------- Co-authored-by: NanoCode012 <nano@axolotl.ai>	2025-08-08 08:02:03 -04:00
NanoCode012	4db7f023c6	feat(doc): standardize the axolotl install to a release (#3040 ) [skip ci]	2025-08-08 08:00:26 -04:00
NanoCode012	4273d5cf7e	feat: update nd parallelism readme (#3039 ) Co-authored-by: salman <salman.mohammadi@outlook.com>	2025-08-08 12:45:36 +01:00
Wing Lian	c5e5aba547	Add 2.8.0 base images and uv images (#3034 )	2025-08-08 02:30:16 -04:00
Wing Lian	9d5c95db6f	Add support for Accelerate CP, ND examples, and fix for parallel config w fsdp (#3019 ) * fix for parallelism config from trainer * fix handling of parallelism_config w accelerate * add todo for removal * update to latest axolotl-contribs-mit for optimizer fix too * synchronize training after checkpoint save * dir spelling * use latest accelerate main * fix to not use partial state parallelism_config * more fixeS * use most recent accelerate fix * fix cpu_ram_efficient_loading to meta devices from rank 0 to prevent CPU RAM oom * improve handling of broadcasting fsdp2 state dict * support for openai chat template with thinking key as the reasoning trace * address PR feedback * refactor to remove dependency on PartialState for parallelism config * bump accelerate, gptoss fixes * limit meta fixes to fsdp2 for now * fixes for gpt oss * fixup examples, don't use cpu-ram-efficient-loading for now * remove problematic barrier * patch parallelism config * reorder comparison * device mesh fixes * make pure CP work * lint	2025-08-07 21:22:15 -04:00
NanoCode012	ca796fb56e	feat(doc): update gpt-oss readme (#3029 ) [skip ci] * feat(doc): update gpt-oss readme * fix: caps * feat: add toolcalling section * feat: add example tool dataset to docs * chore: update	2025-08-07 09:26:42 -04:00
VED	597953bef0	clear cache before clean up (#3031 ) [skip ci] * clear chahe before save_model * chore: lint --------- Co-authored-by: Wing Lian <wing@axolotl.ai>	2025-08-07 09:25:58 -04:00
NanoCode012	39fbd3b2b5	fix: lora kernels for mistral3 (#3027 ) [skip ci]	2025-08-07 09:25:37 -04:00
salman	46dfacf255	ND Parallel Doc Nits (#3032 )	2025-08-07 10:34:26 +01:00
Wing Lian	4bce713b39	allow custom trainer_cls to be defined as a module reference in the YAML (#3024 ) [skip ci] * allow custom trainer_cls to be defined as a module reference in the YAML * address PR feedback and add test * add tests	2025-08-06 22:49:19 -04:00
Dan Saunders	d09290f2f4	Lora kernels bias support (#3025 ) * lora kernels bias support * revert rename * nit * lint, tests * satisfying the rabbit	2025-08-06 20:20:08 -04:00
Wing Lian	e442ff22aa	fix keyerror on load_in_8bit/load_in_4bit access in _set_quantization_config (#3023 ) * set load_in_8bit/load_in_4bit in _set_quantization_config to prevent keyerror * use dict.get instead	2025-08-06 14:28:52 -04:00
Wing Lian	ba3dba3e4f	add kernels for gpt oss models (#3020 ) * add kernels for gpt oss models * add support for gpt-oss * typo incorrect package * fix: layout for configs and added wandb/epochs * add gptoss example w offload and set moe leaf for z3 * add support for Mxfp4Config from yaml * update yaml to use official model * fix lora and don't allow triton to go above 3.3.1 * fix lr and tweak vram use * fix range for triton since pinned wasn't compatible with toch 2.6.0 * update cce with gpt oss patches --------- Co-authored-by: NanoCode012 <nano@axolotl.ai>	2025-08-06 09:47:55 -04:00
Wing Lian	97e86c6d47	drop old patches and code that are no longer needed (#3007 ) [skip ci]	2025-08-06 08:02:39 -04:00
VED	784f8c0e95	fix:kd_distillation key_error logprobs (#2990 ) * fix:kd_distillation key_error logprobs * style * fix: leave handling of pop logprobs to parent --------- Co-authored-by: NanoCode012 <nano@axolotl.ai>	2025-08-06 08:02:07 -04:00
NanoCode012	e3177c3210	feat: add complete optimizer docs (#3017 ) [skip ci] * feat: add complete optimizer docs * fix: deprecate old torchao adamw low bit	2025-08-06 08:01:51 -04:00
Wing Lian	70faea331f	add support for connecting via prime-intellect (#3021 )	2025-08-06 01:06:52 -04:00
Wing Lian	8021c718ce	use skip_move_to_device for all cases (#3015 ) * use skip_move_to_device for all cases * use experimental option for skip move	2025-08-06 00:13:12 -04:00
Wing Lian	42f5e6f9e9	upgrade transformers==4.55.0 (#3018 )	2025-08-05 16:29:12 -04:00

1 2 3 4 5 ...

2369 Commits