axolotl

Author	SHA1	Message	Date
Sunny Liu	cb121ab91b	update test_mixtral [skip e2e]	2025-04-21 16:27:26 -04:00
Sunny Liu	b59640a4c7	amend model loading for hqq + fix hqq version	2025-04-21 15:53:43 -04:00
Sunny Liu	f0a189131b	amend model loading for hqq + fix hqq version	2025-04-21 15:53:29 -04:00
Sunny Liu	c8fb5baad6	amend unittests pt2	2025-04-21 13:28:52 -04:00
Sunny Liu	9be971d47c	update test_models.py to conform to new quantization config	2025-04-21 11:34:37 -04:00
Sunny Liu	ffd4ef1ece	nit	2025-04-21 11:28:59 -04:00
Sunny Liu	320aff1867	update config doc	2025-04-21 10:59:04 -04:00
Sunny Liu	ac24eba2ac	include HQQLinear in find target_linear	2025-04-21 10:36:39 -04:00
Sunny Liu	8a5ad8aee3	typo	2025-04-21 10:36:39 -04:00
Sunny Liu	843b50fdaa	rigorous qlora validation	2025-04-21 10:36:39 -04:00
Sunny Liu	098ffcc5a2	removed redundant hqq config validation	2025-04-21 10:36:39 -04:00
Sunny Liu	ba8e29c841	quantization config refactoring - better integration	2025-04-21 10:36:39 -04:00
Sunny Liu	143b2e082c	nit [skip e2e]	2025-04-21 10:36:39 -04:00
Sunny Liu	aba484de97	WIP quant config refactor	2025-04-21 10:36:39 -04:00
Sunny Liu	f6f5f89c6d	fix more typo	2025-04-21 10:36:39 -04:00
Sunny Liu	8926fe9981	lax config requirement - qlora + hqq	2025-04-21 10:36:39 -04:00
Sunny Liu	987c5217a0	fix typos	2025-04-21 10:36:39 -04:00
Sunny Liu	feaef03cb9	didn't realise model_config.quantization_config is just a regular dict	2025-04-21 10:36:39 -04:00
Sunny Liu	ba5d917845	add e2e test for hqq training	2025-04-21 10:36:39 -04:00
Sunny Liu	0e9b060b4d	add doc + requirement for hqq	2025-04-21 10:36:39 -04:00
Sunny Liu	0c40d12a18	more comprehensive hqq config options	2025-04-21 10:36:39 -04:00
Sunny Liu	f55b3c805b	hqq_nbits triggers prepare_model_for_kbit_training	2025-04-21 10:36:39 -04:00
Sunny Liu	a64601f957	fix wrong variable name	2025-04-21 10:36:39 -04:00
Sunny Liu	eb7bc70b99	fix dumb mistake	2025-04-21 10:36:39 -04:00
Sunny Liu	db6c76b147	forgot to return data in check	2025-04-21 10:36:39 -04:00
Sunny Liu	99730ce40a	hqq integration	2025-04-21 10:36:39 -04:00
Wing Lian	7651550850	make sure to download fixtures for kd test (#2541 ) * make sure to download fixtures for kd test * use same alpaca dataset	2025-04-21 10:31:50 -04:00
Wing Lian	341e95aac9	prevent rate limiting to hf when using dispatch batches (#2536 ) [skip ci]	2025-04-21 10:31:35 -04:00
Catgat	b882dfb63f	Fixed Rex Scheduler Warm Up (#2535 ) [skip ci] * Fixed Rex Scheduler Warm Up * chore: lint --------- Co-authored-by: Wing Lian <wing@axolotl.ai>	2025-04-21 10:30:55 -04:00
Wing Lian	b640db1dbc	don't run multigpu tests twice, run SP in separate test (#2542 ) * don't run multigpu tests twice, run SP in separate test * fix multiline	2025-04-21 10:24:13 -04:00
Chiwan Park	4ce469d32e	fix: upgrade liger to 0.5.8 and use native Gemma3 patches (#2527 ) * fix: upgrade liger to 0.5.8 and use native Gemma3 patches * fix: make lint happy * doc: update Liger Kernel FLCE support for Gemma 3	2025-04-18 09:57:40 -07:00
Wing Lian	60a8f0958d	zero val fix for beta (#2538 )	2025-04-17 17:27:19 -07:00
NanoCode012	9da730d6a4	fix(doc): cut cross entropy installation instructions broken in qmd (#2532 )	2025-04-16 15:02:51 -07:00
NanoCode012	32637fad00	fix: preprocess yielding whole dataset to each worker (#2503 ) [skip ci]	2025-04-16 15:02:35 -07:00
Dan Saunders	f776f889a1	adding codecov reporting (#2372 ) [skip ci] * adding codecov reporting * update codecov-action to v5 * fix --------- Co-authored-by: Dan Saunders <dan@axolotl.ai>	2025-04-16 15:02:17 -07:00
Wing Lian	69eda209a6	re-enable DS zero3 ci with updated transformers (#2533 )	2025-04-16 14:48:40 -07:00
Dan Saunders	b8c633aa97	batch api HF adapter for ring-flash-attn; cleanup and improvements (#2520 ) * batch api HF adapter for ring-flash-attn; cleanup and improvements * update * adding all batch ring-flash-attn methods via single adapter * removing pad_to_sequence_len=False for now * fix * updating docs to include batch SP * review comments * fixes for batch API funcs, simplify * fixes * fix * updates * add batch_zigzag smoke test	2025-04-16 13:50:48 -04:00
NanoCode012	682a9cf79b	Fix: add delinearization and make qlora work with fsdp2 (#2515 ) * fixes for delinearization, and make qlora work with fsdp2 * Add back mistakenly removed lm_eval * typo [skip ci] * patch evals for torch.compile + fsdp2 * also check torch_compile w fsdp2 * lots of fixes for flex attn with llama4 * fix patch check and patch llama4 too * attempt to make the patches stick * use transformers 4.51.2 * update configs and README for llama4 * remove torch.compile for CI test * cleanup any existing singletons * set singleton cache to None instead of deleting * use importlib reload with monkeypatch * don't worry about transformers version, mark inputs with grads, fix regex * make sure embeds aren't on cpu * logging and mem improvements * vllm version and add to docker, make sure to save processor on conversion * fix ambiguous tensor bool check * fix vllm to not use v1, upgrade hf transformers * fix tests * make flex_attn_compile_kwargs configurable, since this depends on model params --------- Co-authored-by: Wing Lian <wing@axolotl.ai> Co-authored-by: Salman Mohammadi <salman.mohammadi@outlook.com>	2025-04-15 23:31:39 -07:00
NanoCode012	271b24cccc	feat: update cce to latest (#2521 )	2025-04-15 22:17:10 -07:00
Wing Lian	198d775d6d	make sure the all of the model is on the same device, so this test will pass on multigpu (#2524 ) [skip ci]	2025-04-15 22:15:42 -07:00
NanoCode012	e4307fb7d7	feat: add examples for deepcoder (#2517 )	2025-04-12 07:25:23 -07:00
Wing Lian	dd8bad06d0	remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]	2025-04-12 07:25:11 -07:00
Wing Lian	de8a625dd7	make e2e tests a bit faster by reducing test split size (#2522 ) [skip ci] * [ci] make e2e tests a bit faster by reducing test split size * use 10% split of alpaca dataset to speed up dataset loading/tokenization * reduce gas 4->2 for most e2e tests * increase val set size for packing	2025-04-12 07:24:43 -07:00
NanoCode012	51267ded04	chore: update doc links (#2509 ) * chore: update doc links * fix: address pr feedback	2025-04-11 09:53:18 -04:00
NanoCode012	756a0559c1	feat(doc): explain deepspeed configs (#2514 ) [skip ci] * feat(doc): explain deepspeed configs * fix: add fetch configs	2025-04-11 09:52:43 -04:00
NanoCode012	9a8e3e9c7b	Feat(examples): add deepcogito (#2516 ) [skip ci] * feat: add examples for deepcogito * fix: reduce num evals per epoch * fix: reduce num epochs	2025-04-11 09:52:23 -04:00
Wing Lian	7e7180fa10	add mocks for loading datasets in cli train tests (#2497 ) [skip ci] * add mocks for loading datasets in cli train tests * Apply suggestions from code review to fix patched module for preprocess Co-authored-by: NanoCode012 <nano@axolotl.ai> --------- Co-authored-by: NanoCode012 <nano@axolotl.ai>	2025-04-11 09:51:59 -04:00
Sung Ching Liu	22c562533d	Update rlhf.qmd (#2519 ) Fix typo in command that spawns a vllm server, should be `axolotl vllm-serve` not `axolotl vllm_serve`	2025-04-10 11:33:09 -04:00
NanoCode012	16823e1de6	feat: add CNAME (#2513 )	2025-04-10 12:34:25 +07:00
NanoCode012	e0420b3528	fix: allow merge lora on pre-quantized model (#2511 ) * fix: allow merge lora on pre-quantized model * fix: remove unused sections per comment	2025-04-09 14:01:42 -04:00

1 2 3 4 5 ...

2027 Commits