Wing Lian
82b16bd040
split llmcompressor from vllm checks
2025-04-30 17:21:53 -04:00
Wing Lian
fd5c985038
additional fixes for docker and saving compressed
2025-04-30 17:21:53 -04:00
Rahul Tuli
5246aebc04
Fix: Test
...
Signed-off-by: Rahul Tuli <rtuli@redhat.com >
2025-04-30 17:21:53 -04:00
Rahul Tuli
f4bcc71c86
Apply patch from @winglian
...
Signed-off-by: Rahul Tuli <rtuli@redhat.com >
2025-04-30 17:21:53 -04:00
Rahul Tuli
3a9e172272
Add: line about further optimizations using llmcompressor
...
Signed-off-by: Rahul Tuli <rtuli@redhat.com >
2025-04-30 17:21:53 -04:00
Rahul Tuli
372f0e137b
Address Review Comments:
...
* deleted redundant docs/llm_compressor.qmd
* incorporated feedback in integration README.md
* added llmcompressor integration to docs/custom_integrations.qmd
Signed-off-by: Rahul Tuli <rtuli@redhat.com >
2025-04-30 17:21:52 -04:00
Rahul Tuli
17dffec71d
Add: .qmd file
2025-04-30 17:21:52 -04:00
Rahul Tuli
3a8b637598
Tests, Style, Updates
2025-04-30 17:21:52 -04:00
Rahul Tuli
12cd09e6f5
Rebase and updates!
2025-04-30 17:21:52 -04:00
Rahul Tuli
fe82f62248
Add: llm_compressor integration documentation
2025-04-30 17:21:52 -04:00
Rahul Tuli
db31d7ad22
Move: LLMCompressorPlugin into it's own submodule
2025-04-30 17:21:52 -04:00
Rahul Tuli
eb7f2aa4b9
Update model config
2025-04-30 17:21:51 -04:00
Rahul Tuli
f80e36ddd2
Use: absolute import
2025-04-30 17:21:51 -04:00
Rahul Tuli
412d2ec6d0
Rename: sft.yaml to sparse-finetuning.yaml
2025-04-30 17:21:51 -04:00
Rahul Tuli
50fc5e6984
Add: llcompressor installable
2025-04-30 17:21:51 -04:00
Rahul Tuli
83a88b745f
Address review comments from @markurtz
2025-04-30 17:21:51 -04:00
Rahul Tuli
8855bb115f
Apply suggestions from @markurtz
...
Co-authored-by: Mark Kurtz <mark.j.kurtz@gmail.com >
2025-04-30 17:21:51 -04:00
Rahul Tuli
ef9543b371
Update llmcompressor version to latest
2025-04-30 17:21:51 -04:00
Rahul Tuli
25e701e885
Revert: TODO's
2025-04-30 17:21:50 -04:00
Rahul Tuli
891a21e599
Use: warning over warn
2025-04-30 17:21:50 -04:00
Rahul Tuli
8beb2f27ad
pre commit hooks
2025-04-30 17:21:50 -04:00
Rahul Tuli
56ba66b60f
Add:llmcompressor instalable
2025-04-30 17:21:50 -04:00
Rahul Tuli
13d4b865d6
Update: review comments!
2025-04-30 17:21:50 -04:00
Rahul Tuli
3da866b2b9
Add: SFTPlugin with llmcompressor
2025-04-30 17:21:50 -04:00
Wing Lian
24ff5f53f8
additional args for grpo config/trainer ( #2598 )
2025-04-30 13:11:12 -04:00
Wing Lian
5e949eaa07
replace zero_only with simpler if statement ( #2592 )
2025-04-30 13:11:03 -04:00
Wing Lian
89ca14d9a0
ensure we pass axolotl extras to the Dockerfile so vllm is included in shipped images ( #2599 )
2025-04-30 11:35:45 -04:00
Wing Lian
8446b4ad28
don't automatically enable lora kernels for RL training ( #2600 )
2025-04-30 11:06:50 -04:00
Wing Lian
fc79606b6d
only import vllm serve cli if its being called ( #2597 ) [skip ci]
2025-04-30 09:11:25 -04:00
Wing Lian
baeb00231b
Handle other reasoning trace dataset formats ( #2591 )
...
* Handle other reasoning trace dataset formats
* rename var to improve readability
* chore: refactor with comments
---------
Co-authored-by: NanoCode012 <nano@axolotl.ai >
2025-04-30 03:32:55 -04:00
Wing Lian
2413688b08
upload the deepspeed json to wandb ( #2593 ) [skip ci]
2025-04-30 03:32:44 -04:00
NanoCode012
5bb1f3da56
feat: add qwen3 moe block for ds3 ( #2596 ) [skip ci]
2025-04-30 03:32:23 -04:00
Wing Lian
a21b9cc472
patch to convert LR from tensor to float when using DS ( #2595 ) [skip ci]
2025-04-30 03:31:57 -04:00
Aleksandr Dremov
41a1ec0c95
Plugins create_lr_scheduler support ( #2584 )
...
* lr_scheduler support
* fix
* Update scheduler.py
* Update scheduler.py
* cfg handling
* black
* remove debug
* remove adding the axolotl cfg to the scheduler mixin
---------
Co-authored-by: Wing Lian <wing@axolotl.ai >
2025-04-29 17:08:30 -04:00
Dan Saunders
ecac731922
auto-enable lora kernels where possible ( #2589 )
...
* auto-enable lora kernels where possible
* test
* revert change to example yaml
* naming
* remove print
* slight logic change
2025-04-29 16:18:49 -04:00
NanoCode012
742fef4200
fix(doc): key used to point to url in multimodal doc ( #2575 ) [skip ci]
2025-04-29 15:10:59 -04:00
Wing Lian
a39caf8824
bump vllm==0.8.5 for qwen3 support ( #2583 ) [skip ci]
2025-04-29 15:10:40 -04:00
Wing Lian
07e4f2e25b
support for qwen3 with lora kernels ( #2588 )
...
* support for qwen3 with lora kernels
* fix patch
* typo
2025-04-29 15:02:49 -04:00
Dan Saunders
c7d07de6b4
Fix eval + add smoke test ( #2586 )
...
* fix evaluate CLI
* add smoke test
* fix naming
* lint
2025-04-29 12:58:54 -04:00
Wing Lian
6565ae85d8
set config on the PluginManager for callback access ( #2587 )
2025-04-29 12:05:44 -04:00
Wing Lian
80b4edb4a7
Post release fixes ( #2581 )
...
* fix missing kwarg on child
* make the runpod test shorter
* update docs
* rename runpod test json file
* typing fixes and ordering of doc
2025-04-29 10:01:38 -04:00
Wing Lian
fedbcc0254
remove torch 2.4.1 CI as part of support deprecation ( #2582 )
2025-04-29 08:28:32 -04:00
Wing Lian
8175896ada
add dev tag for v0.10.0.dev0 ( #2580 )
2025-04-28 20:30:14 -04:00
Wing Lian
14d670dbf0
v0.9.0 release ( #2578 )
ci-cd / build-axolotl (<nil>, 124, 12.4.1, 3.11, 2.4.1) (push) Has been cancelled
ci-cd / build-axolotl (<nil>, 124, 12.4.1, 3.11, 2.5.1) (push) Has been cancelled
ci-cd / build-axolotl (vllm, 124, 12.4.1, true, 3.11, 2.6.0) (push) Has been cancelled
ci-cd / build-axolotl (vllm, 126, 12.6.3, 3.11, 2.7.0) (push) Has been cancelled
publish pypi / Create Release (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 124, 12.4.1, 3.11, 2.4.1) (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 124, 12.4.1, 3.11, 2.5.1) (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 124, 12.4.1, true, 3.11, 2.6.0) (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 126, 12.6.3, 3.11, 2.7.0) (push) Has been cancelled
ci-cd / build-axolotl-cloud-no-tmux (<nil>, 124, 12.4.1, 3.11, 2.6.0) (push) Has been cancelled
publish pypi / Upload release to PyPI (push) Has been cancelled
v0.9.0
2025-04-28 18:23:17 -04:00
Wing Lian
2d77165dc0
automatically split out reasoning trace from dataset ( #2579 )
...
* automatically split out reasoning trace from dataset
* chore: lint
* fix import
2025-04-28 18:23:03 -04:00
Wing Lian
63b17e3109
chat template and example for qwen3 ( #2577 )
2025-04-28 15:09:41 -04:00
NanoCode012
1178a15ede
Feat: Add qwen3 and CCE for qwen family ( #2518 )
2025-04-28 12:18:46 -04:00
Wing Lian
c513487d1a
support val_set_size for splitting test split from train with DPO ( #2572 )
2025-04-28 12:12:15 -04:00
Dan Saunders
dda95e6c40
add preview-docs workflow ( #2432 )
...
* add preview-docs workflow
* update preview-docs workflow
* use correct publish-dir
* install deps prior to docs build
* use correct publish-dir
* use quarto publish with netlify target
* adding _publish.yml
* fix
* fix
* fix
* remove unused file
* fix naming
---------
Co-authored-by: Dan Saunders <dan@axolotl.ai >
2025-04-28 11:20:46 -04:00
NanoCode012
7099343c56
feat: add eos_tokens and train_on_eot for chat_template EOT parsing ( #2364 )
...
* feat: add eos_tokens and train_on_eot for chat_template EOT parsing
* fix: comments
* chore: add some examples of tokens
* feat: add new potential errors for chat_template to faq
* feat: add examples for EOT handling
* fix: change error to warning for missing EOS
* fix: warning typo
* feat: add tests for eot token handling
* fix: remove broken caplog capture in test
* fix: chattemplate strategy with kd missing eot changes
2025-04-28 10:11:20 -04:00