Wing Lian
|
6affbb1f85
|
move import of llmcompressor to reset session inside test
|
2025-04-30 18:10:44 -04:00 |
|
Wing Lian
|
0ed4b4c310
|
make sure to reset the session after each test
|
2025-04-30 17:21:53 -04:00 |
|
Wing Lian
|
f4a0f496a0
|
move decorator to test method instead of class
|
2025-04-30 17:21:53 -04:00 |
|
Wing Lian
|
82b16bd040
|
split llmcompressor from vllm checks
|
2025-04-30 17:21:53 -04:00 |
|
Wing Lian
|
fd5c985038
|
additional fixes for docker and saving compressed
|
2025-04-30 17:21:53 -04:00 |
|
Rahul Tuli
|
5246aebc04
|
Fix: Test
Signed-off-by: Rahul Tuli <rtuli@redhat.com>
|
2025-04-30 17:21:53 -04:00 |
|
Rahul Tuli
|
f4bcc71c86
|
Apply patch from @winglian
Signed-off-by: Rahul Tuli <rtuli@redhat.com>
|
2025-04-30 17:21:53 -04:00 |
|
Rahul Tuli
|
3a9e172272
|
Add: line about further optimizations using llmcompressor
Signed-off-by: Rahul Tuli <rtuli@redhat.com>
|
2025-04-30 17:21:53 -04:00 |
|
Rahul Tuli
|
372f0e137b
|
Address Review Comments:
* deleted redundant docs/llm_compressor.qmd
* incorporated feedback in integration README.md
* added llmcompressor integration to docs/custom_integrations.qmd
Signed-off-by: Rahul Tuli <rtuli@redhat.com>
|
2025-04-30 17:21:52 -04:00 |
|
Rahul Tuli
|
17dffec71d
|
Add: .qmd file
|
2025-04-30 17:21:52 -04:00 |
|
Rahul Tuli
|
3a8b637598
|
Tests, Style, Updates
|
2025-04-30 17:21:52 -04:00 |
|
Rahul Tuli
|
12cd09e6f5
|
Rebase and updates!
|
2025-04-30 17:21:52 -04:00 |
|
Rahul Tuli
|
fe82f62248
|
Add: llm_compressor integration documentation
|
2025-04-30 17:21:52 -04:00 |
|
Rahul Tuli
|
db31d7ad22
|
Move: LLMCompressorPlugin into it's own submodule
|
2025-04-30 17:21:52 -04:00 |
|
Rahul Tuli
|
eb7f2aa4b9
|
Update model config
|
2025-04-30 17:21:51 -04:00 |
|
Rahul Tuli
|
f80e36ddd2
|
Use: absolute import
|
2025-04-30 17:21:51 -04:00 |
|
Rahul Tuli
|
412d2ec6d0
|
Rename: sft.yaml to sparse-finetuning.yaml
|
2025-04-30 17:21:51 -04:00 |
|
Rahul Tuli
|
50fc5e6984
|
Add: llcompressor installable
|
2025-04-30 17:21:51 -04:00 |
|
Rahul Tuli
|
83a88b745f
|
Address review comments from @markurtz
|
2025-04-30 17:21:51 -04:00 |
|
Rahul Tuli
|
8855bb115f
|
Apply suggestions from @markurtz
Co-authored-by: Mark Kurtz <mark.j.kurtz@gmail.com>
|
2025-04-30 17:21:51 -04:00 |
|
Rahul Tuli
|
ef9543b371
|
Update llmcompressor version to latest
|
2025-04-30 17:21:51 -04:00 |
|
Rahul Tuli
|
25e701e885
|
Revert: TODO's
|
2025-04-30 17:21:50 -04:00 |
|
Rahul Tuli
|
891a21e599
|
Use: warning over warn
|
2025-04-30 17:21:50 -04:00 |
|
Rahul Tuli
|
8beb2f27ad
|
pre commit hooks
|
2025-04-30 17:21:50 -04:00 |
|
Rahul Tuli
|
56ba66b60f
|
Add:llmcompressor instalable
|
2025-04-30 17:21:50 -04:00 |
|
Rahul Tuli
|
13d4b865d6
|
Update: review comments!
|
2025-04-30 17:21:50 -04:00 |
|
Rahul Tuli
|
3da866b2b9
|
Add: SFTPlugin with llmcompressor
|
2025-04-30 17:21:50 -04:00 |
|
Wing Lian
|
24ff5f53f8
|
additional args for grpo config/trainer (#2598)
|
2025-04-30 13:11:12 -04:00 |
|
Wing Lian
|
5e949eaa07
|
replace zero_only with simpler if statement (#2592)
|
2025-04-30 13:11:03 -04:00 |
|
Wing Lian
|
89ca14d9a0
|
ensure we pass axolotl extras to the Dockerfile so vllm is included in shipped images (#2599)
|
2025-04-30 11:35:45 -04:00 |
|
Wing Lian
|
8446b4ad28
|
don't automatically enable lora kernels for RL training (#2600)
|
2025-04-30 11:06:50 -04:00 |
|
Wing Lian
|
fc79606b6d
|
only import vllm serve cli if its being called (#2597) [skip ci]
|
2025-04-30 09:11:25 -04:00 |
|
Wing Lian
|
baeb00231b
|
Handle other reasoning trace dataset formats (#2591)
* Handle other reasoning trace dataset formats
* rename var to improve readability
* chore: refactor with comments
---------
Co-authored-by: NanoCode012 <nano@axolotl.ai>
|
2025-04-30 03:32:55 -04:00 |
|
Wing Lian
|
2413688b08
|
upload the deepspeed json to wandb (#2593) [skip ci]
|
2025-04-30 03:32:44 -04:00 |
|
NanoCode012
|
5bb1f3da56
|
feat: add qwen3 moe block for ds3 (#2596) [skip ci]
|
2025-04-30 03:32:23 -04:00 |
|
Wing Lian
|
a21b9cc472
|
patch to convert LR from tensor to float when using DS (#2595) [skip ci]
|
2025-04-30 03:31:57 -04:00 |
|
Aleksandr Dremov
|
41a1ec0c95
|
Plugins create_lr_scheduler support (#2584)
* lr_scheduler support
* fix
* Update scheduler.py
* Update scheduler.py
* cfg handling
* black
* remove debug
* remove adding the axolotl cfg to the scheduler mixin
---------
Co-authored-by: Wing Lian <wing@axolotl.ai>
|
2025-04-29 17:08:30 -04:00 |
|
Dan Saunders
|
ecac731922
|
auto-enable lora kernels where possible (#2589)
* auto-enable lora kernels where possible
* test
* revert change to example yaml
* naming
* remove print
* slight logic change
|
2025-04-29 16:18:49 -04:00 |
|
NanoCode012
|
742fef4200
|
fix(doc): key used to point to url in multimodal doc (#2575) [skip ci]
|
2025-04-29 15:10:59 -04:00 |
|
Wing Lian
|
a39caf8824
|
bump vllm==0.8.5 for qwen3 support (#2583) [skip ci]
|
2025-04-29 15:10:40 -04:00 |
|
Wing Lian
|
07e4f2e25b
|
support for qwen3 with lora kernels (#2588)
* support for qwen3 with lora kernels
* fix patch
* typo
|
2025-04-29 15:02:49 -04:00 |
|
Dan Saunders
|
c7d07de6b4
|
Fix eval + add smoke test (#2586)
* fix evaluate CLI
* add smoke test
* fix naming
* lint
|
2025-04-29 12:58:54 -04:00 |
|
Wing Lian
|
6565ae85d8
|
set config on the PluginManager for callback access (#2587)
|
2025-04-29 12:05:44 -04:00 |
|
Wing Lian
|
80b4edb4a7
|
Post release fixes (#2581)
* fix missing kwarg on child
* make the runpod test shorter
* update docs
* rename runpod test json file
* typing fixes and ordering of doc
|
2025-04-29 10:01:38 -04:00 |
|
Wing Lian
|
fedbcc0254
|
remove torch 2.4.1 CI as part of support deprecation (#2582)
|
2025-04-29 08:28:32 -04:00 |
|
Wing Lian
|
8175896ada
|
add dev tag for v0.10.0.dev0 (#2580)
|
2025-04-28 20:30:14 -04:00 |
|
Wing Lian
|
14d670dbf0
|
v0.9.0 release (#2578)
ci-cd / build-axolotl (<nil>, 124, 12.4.1, 3.11, 2.4.1) (push) Has been cancelled
ci-cd / build-axolotl (<nil>, 124, 12.4.1, 3.11, 2.5.1) (push) Has been cancelled
ci-cd / build-axolotl (vllm, 124, 12.4.1, true, 3.11, 2.6.0) (push) Has been cancelled
ci-cd / build-axolotl (vllm, 126, 12.6.3, 3.11, 2.7.0) (push) Has been cancelled
publish pypi / Create Release (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 124, 12.4.1, 3.11, 2.4.1) (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 124, 12.4.1, 3.11, 2.5.1) (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 124, 12.4.1, true, 3.11, 2.6.0) (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 126, 12.6.3, 3.11, 2.7.0) (push) Has been cancelled
ci-cd / build-axolotl-cloud-no-tmux (<nil>, 124, 12.4.1, 3.11, 2.6.0) (push) Has been cancelled
publish pypi / Upload release to PyPI (push) Has been cancelled
v0.9.0
|
2025-04-28 18:23:17 -04:00 |
|
Wing Lian
|
2d77165dc0
|
automatically split out reasoning trace from dataset (#2579)
* automatically split out reasoning trace from dataset
* chore: lint
* fix import
|
2025-04-28 18:23:03 -04:00 |
|
Wing Lian
|
63b17e3109
|
chat template and example for qwen3 (#2577)
|
2025-04-28 15:09:41 -04:00 |
|
NanoCode012
|
1178a15ede
|
Feat: Add qwen3 and CCE for qwen family (#2518)
|
2025-04-28 12:18:46 -04:00 |
|