axolotl

Author	SHA1	Message	Date
Dan Saunders	345a159796	coderabbit comments	2025-06-07 04:50:29 +00:00
Dan Saunders	657bffd85f	update posthog dep	2025-06-05 23:46:20 +00:00
Dan Saunders	f0dde8e2d5	lint	2025-06-05 23:41:46 +00:00
Dan Saunders	25fa4df70f	fix	2025-06-05 23:33:46 +00:00
Dan Saunders	e735f4270b	slight changes	2025-06-05 23:33:46 +00:00
Dan Saunders	035e7a2f4c	simplifying	2025-06-05 23:33:46 +00:00
Dan Saunders	2d36c11264	minor fixes	2025-06-05 23:33:46 +00:00
Dan Saunders	b8ec5bdccf	doc update	2025-06-05 23:33:44 +00:00
Dan Saunders	249405b46e	docs fix	2025-06-05 23:31:44 +00:00
Dan Saunders	d3be84fec2	enable / disable logic update	2025-06-05 23:31:44 +00:00
Dan Saunders	1c74ab175f	opt-in version of telemetry	2025-06-05 23:31:44 +00:00
Dan Saunders	b2f1fc109a	distributed fix	2025-06-05 23:31:44 +00:00
Dan Saunders	5a2a80cc48	fix issue with tests in ci	2025-06-05 23:31:44 +00:00
Dan Saunders	4033fe74f8	fixes	2025-06-05 23:31:44 +00:00
Dan Saunders	e9df4444be	remove duplicate info	2025-06-05 23:31:44 +00:00
Dan Saunders	ffd2985750	adding runtime metrics / system info additional accelerator support, etc.	2025-06-05 23:31:44 +00:00
Dan Saunders	17310f9acc	adding runtime metrics / system info additional accelerator support, etc.	2025-06-05 23:31:44 +00:00
Dan Saunders	71ae6f9f87	improved redaction, send system info during model config load telemetry, etc.	2025-06-05 23:31:08 +00:00
Dan Saunders	9dd1092f8f	doc update	2025-06-05 23:27:29 +00:00
Dan Saunders	2c2f2647a9	fix	2025-06-05 23:27:29 +00:00
Dan Saunders	98313a6b3f	adding back in base_model redaction w/ whitelist	2025-06-05 23:27:29 +00:00
Dan Saunders	8b75205d3b	sleep on all ranks in distributed setting	2025-06-05 23:27:29 +00:00
Dan Saunders	ef4990f304	simplifying path redaction	2025-06-05 23:27:29 +00:00
Dan Saunders	db3297b090	small update / fix	2025-06-05 23:27:27 +00:00
Dan Saunders	86ed554bda	tests for runtime metrics telemetry and assoc. callback	2025-06-05 23:26:07 +00:00
Dan Saunders	f254d7d5a2	adding runtime metrics (cpu + gpu memory, steps/s, etc.)	2025-06-05 23:26:05 +00:00
Dan Saunders	d8b0522ea0	updated sanitization logic, tests	2025-06-05 23:20:51 +00:00
Dan Saunders	1edd6b9524	update error file path sanitization function; adding more error tracking	2025-06-05 23:20:49 +00:00
Dan Saunders	66c6fb56cb	progress on telemetry: config load, process, model load, train start / end, error tracking	2025-06-05 22:59:50 +00:00
Dan Saunders	90b39ce112	updates	2025-06-05 22:49:15 +00:00
Dan Saunders	5afab46cc6	updates	2025-06-05 22:49:15 +00:00
Dan Saunders	bd152c6115	adding todo	2025-06-05 22:49:15 +00:00
Dan Saunders	76336743ff	initial telemetry manager impl	2025-06-05 22:49:14 +00:00
Wing Lian	7909bfb076	add manual seed for flaky test_geglu_backward test (#2763 ) [skip ci]	2025-06-05 09:23:17 -07:00
Wing Lian	cb03c765a1	add uv tooling for e2e gpu tests (#2750 ) * add uv tooling for e2e gpu tests * fixes from PR feedback * simplify check * fix env var * make sure to use uv for other install * use raw_dockerfile_image * Fix import * fix args to experimental dockerfile image call * use updated modal versions	2025-06-05 07:25:06 -07:00
Timofey Klyubin	4440b4a1ce	remove unused field for chat_template.default for DPO training (#2755 ) [skip ci] * remove unused field for chat_template.default "messages" field present in final dataset causes issues with DPO training otherwise * lint and fix tests for new return value * remove unused field for chat_template.default "messages" field present in final dataset causes issues with DPO training otherwise lint and fix tests for new return value fix for updated expected fields for dpo remove unused field for chat_template.default "messages" field present in final dataset causes issues with DPO training otherwise fix test still expecting "messages" field * chore: lint --------- Co-authored-by: Wing Lian <wing@axolotl.ai>	2025-06-05 07:22:58 -07:00
NanoCode012	e8e45b3441	fix: remove hqq (#2759 ) [skip ci]	2025-06-05 07:22:23 -07:00
Wing Lian	c67910fa6f	bump hf deps (#2735 ) [skip ci] * bump hf deps * upgrade liger-kernel too * install cce from fork for transformers fix * fix reference to vocab size in gemma3 patch * use padding_idx instead of pad_token_id * remove fixed gemma3 patch * use updated cce fork * fix local mllama cce patches w docstring * add test for multipack with trainer setup and fix trainer for trainer refactor upstream * bump modal version * guard for iterable datasetS * mllama model arch layout changed in latest transformers * fix batch sampler with drop_last * fix: address upstream vlm changes for lora * fix: update references to old lora target path * fix: remove mllama fa2 patch due to upstream fix * fix: lora kernel patch path for multimodal models * fix: removed mllama from quarto * run test for came optim on 2.6.0+ * fix fsdp2 patch and remove deprecated patch * make sure to set sequence_parallel_degree for grpo * Add SP test for GRPO * add sp to grpo config for trainer * use reward_funcs as kwarg to grpo trainer * fix the comprehension for reward funcs * reward funcs already passed in as args * init sp_group right before training * fix check for adding models to SP context * make sure to pass args to super * upgrade deepspeed * use updated trl and add reasoning flags for vllm * patch the worker --------- Co-authored-by: NanoCode012 <nano@axolotl.ai>	2025-06-05 07:20:33 -07:00
NanoCode012	787880215b	fix(deepspeed): deepspeed config not being set for z3 (#2754 ) * fix(deepspeed): deepspeed config not being set for z3 * fix: comments	2025-06-03 14:27:09 -07:00
NanoCode012	4b1a29c694	feat(modal): update docker tag to use torch2.6 from torch2.5 (#2749 ) [skip ci]	2025-06-03 14:26:07 -07:00
NanoCode012	d7fa60662e	feat: add chat_template kwargs (#2694 ) [skip ci]	2025-06-03 14:25:26 -07:00
Dan Saunders	1d91d905c9	remove deprecated wandb env var (#2751 ) * remove deprecated wandb env var * remove os.environ wandb setting; unused loggers * remove os.environ wandb setting; unused loggers	2025-06-03 14:04:15 -07:00
mhenrhcsen	2bf61d8e25	fix abbriviatation spelling error	2025-06-03 21:30:40 +02:00
mhenrhcsen	68788e419e	feat: add Group Relative Policy Optimization (GPRO) to RLHF documentation	2025-06-03 21:30:40 +02:00
github-actions[bot]	94219f6ee8	chore: update pre-commit hooks (#2745 ) * chore: update pre-commit hooks * trigger linter when pre commit hooks are updated * fix type checks from upgraded pre-commit --------- Co-authored-by: djsaunde <1245942+djsaunde@users.noreply.github.com> Co-authored-by: Wing Lian <wing@axolotl.ai>	2025-06-02 15:54:29 -07:00
Wing Lian	ecc719f5c7	add support for base image with uv (#2691 )	2025-06-02 12:48:55 -07:00
NanoCode012	d5d0dc5938	fix: suppress non-axolotl logs unless it's warning or higher (#2724 ) * fix: increase log level for root loggers and axolotl's * fix: BasePlugin using wrong logger * fix: update logger to take name from module * feat: change logger class to AxolotlLogger to filter non-axolotl infos or below * fix: change behavior to not disable existing loggers * fix: update logging to respect correct env * chore: fix comment * fix: suppress accelerate log to LOG_LEVEL if not set --------- Co-authored-by: salman <salman.mohammadi@outlook.com>	2025-05-31 12:13:43 +07:00
NanoCode012	5e86c35322	fix(log): remove duplicate merge_lora param (#2742 ) [skip ci]	2025-05-31 12:13:31 +07:00
NanoCode012	6778856804	Fix: RL base feature parity (#2133 ) * feat: add num_proc and load from cache for rl mapping * fix: refactor sft and rl trainer to set same base args * feat: add report_to to set run name * fix: consolidate handling of fp16, bf16, tf32 kwarg * chore: consolidate eval_strat, loraplus, lr sched, max_length * fix: deprecate old types * fix: adding missing Any * fix: max_steps incorrectly set * fix: remove unnecessary datacollator kwarg insert and pop * fix: update default max_steps * fix: add missing weight_decay handling * fix: ignore max_length for grpo * feat: update CI on trainer_builder * fix: comments * improve handling of warmup/logging steps * use transformers default for logging steps, not None * fix: remove redundant override * fix: lint * feat: allow custom optim for rl methods * fix: duplicate optim setting * fix(test): set sequence_parallel_degree default in base cfg * feat: add handling for seed and SP/ring-attn config * chore: add back return typing from rebase * fix(test): use RLType directly to skip needing to validate * feat: split training builder into sub modules * fix: remove deprecated clause * chore: add missing config to doc * fix: update quarto autodoc * fix: import path for trainer builder and submodules * fix: remove redundant configs from rebase mistake * chore: simplify dynamo check * fix: optimizer_cls_and_kwargs to be passed into trainer_kwargs * fix: add missing rex from rebase * fix: move pop optimizer_cls_and_kwargs * fix: pop optimizer cls in rl too * fix: leftover bug from rebase * fix: update handling of trainer_cls in RL * fix: address pr feedback * feat: call hook_pre_create_trainer for rl * chore: lint * fix: return notimplemented for ppo * feat: moved torch compile to base and refactor collator setting * chore: remove unused importlib.util import * fix: optimizer cls not being popped * feat: move epoch setting to base * fix: catch unhandled custom optimizer * fix: remove duplicate lora plus setting * chore: refactor if condition * chore: refactor set_base_training_args into smaller modules * fix: address TrainerBuilderBase class variables to instance var * fix: add handling for beta3 and episilon2 * fix: change to pass dict via arg instead of updating dict * chore: simplify if condition * fix: force access to lr & weight decay in case not provided to early error * fix: remove log sweep * chore: refactor if condition * fix: address renamed cfg * fix: improve handling of cosine hyp * fix: remove unused params * chore: refactor * chore: clarify doc safetensors * fix: update import path to be unified following comments * fix: duplicate kwargs passed * feat: return separate trainer_kwargs * chore: refactor * chore: refactor based on comments * chore: refactor based on comments * fix: move gpustats callback to base * chore: create trainer_cls_args first based on comments * fix: ipo label smoothing passed incorrectly * feat: add optimizer parity for RL methods with test * feat: add parity for optimizer in RM/PRM and add test * fix: remove redundant function override for orpo/cpo batch metrics * fix: improve handling of dpo_label_smoothing and merge issue * fix: test fixture returning wrong field * fix: address avoid direct modify fixture * chore: minor refactor * Revert "chore: refactor" This reverts commit `99c8859eb0`. * feat: rename trainer_builder to builders --------- Co-authored-by: Wing Lian <wing@axolotl.ai>	2025-05-30 11:21:47 +07:00
Wing Lian	ec4ebfd997	Add a few items to faq (#2734 ) * Add a few items to faq * formatting * chore: lint	2025-05-28 16:20:19 -04:00

1 2 3 4 5 ...

2170 Commits