Commit Graph

  • 3035676ef0 Built site for gh-pages Quarto GHA Workflow Runner 2025-05-01 17:26:55 +00:00
  • bcb59c70e2 automatically set pad_to_sequence_len when use packing (#2607) Wing Lian 2025-05-01 13:24:38 -04:00
  • 25e07f68f3 Built site for gh-pages Quarto GHA Workflow Runner 2025-05-01 17:23:41 +00:00
  • 6a3e6f8c53 fix: run preview-docs only when md/qmd changes (#2606) NanoCode012 2025-05-02 00:21:28 +07:00
  • 1a22d16842 handle empty offset for quant state lora-quant-state-offset Wing Lian 2025-05-01 13:01:00 -04:00
  • 967bfff5fc Built site for gh-pages Quarto GHA Workflow Runner 2025-05-01 17:00:16 +00:00
  • fee3c13bb5 Logging config for colab (#2611) Wing Lian 2025-05-01 12:58:00 -04:00
  • c6274d0582 Built site for gh-pages Quarto GHA Workflow Runner 2025-05-01 16:27:24 +00:00
  • 996fc124e5 Add: Sparse Finetuning Integration with llmcompressor (#2479) Rahul Tuli 2025-05-01 11:25:16 -05:00
  • 7cec02149d Built site for gh-pages Quarto GHA Workflow Runner 2025-05-01 13:43:40 +00:00
  • e963990ad7 add missing __init__ for lr monkeypatch fix (#2609) Wing Lian 2025-05-01 09:41:32 -04:00
  • 96db0f1554 Built site for gh-pages Quarto GHA Workflow Runner 2025-05-01 01:02:42 +00:00
  • c3f2b1c5c2 Add num_completions_to_print for trl and grpo (#2604) Dhruv Mullick 2025-04-30 19:00:30 -06:00
  • 667cc1a482 Built site for gh-pages Quarto GHA Workflow Runner 2025-04-30 22:29:48 +00:00
  • 6ba5c0ed2c use latest hf-xet and don't install vllm for torch 2.7.0 (#2603) Wing Lian 2025-04-30 18:27:39 -04:00
  • 6affbb1f85 move import of llmcompressor to reset session inside test llmcompressor-sft-v2 Wing Lian 2025-04-30 18:10:44 -04:00
  • 0ed4b4c310 make sure to reset the session after each test Wing Lian 2025-04-30 17:21:25 -04:00
  • f4a0f496a0 move decorator to test method instead of class Wing Lian 2025-04-30 14:38:36 -04:00
  • 82b16bd040 split llmcompressor from vllm checks Wing Lian 2025-04-29 08:35:06 -04:00
  • fd5c985038 additional fixes for docker and saving compressed Wing Lian 2025-04-28 13:06:59 -04:00
  • 5246aebc04 Fix: Test Rahul Tuli 2025-04-28 09:18:26 -04:00
  • f4bcc71c86 Apply patch from @winglian Rahul Tuli 2025-04-26 15:30:27 -04:00
  • 3a9e172272 Add: line about further optimizations using llmcompressor Rahul Tuli 2025-04-24 14:06:25 -04:00
  • 372f0e137b Address Review Comments: * deleted redundant docs/llm_compressor.qmd * incorporated feedback in integration README.md * added llmcompressor integration to docs/custom_integrations.qmd Rahul Tuli 2025-04-23 18:00:00 -04:00
  • 17dffec71d Add: .qmd file Rahul Tuli 2025-04-21 20:40:49 -04:00
  • 3a8b637598 Tests, Style, Updates Rahul Tuli 2025-04-21 20:33:59 -04:00
  • 12cd09e6f5 Rebase and updates! Rahul Tuli 2025-04-17 17:19:59 -04:00
  • fe82f62248 Add: llm_compressor integration documentation Rahul Tuli 2025-04-09 01:03:45 +00:00
  • db31d7ad22 Move: LLMCompressorPlugin into it's own submodule Rahul Tuli 2025-04-09 00:27:48 +00:00
  • eb7f2aa4b9 Update model config Rahul Tuli 2025-04-08 23:53:29 +00:00
  • f80e36ddd2 Use: absolute import Rahul Tuli 2025-04-08 23:51:49 +00:00
  • 412d2ec6d0 Rename: sft.yaml to sparse-finetuning.yaml Rahul Tuli 2025-04-08 23:46:32 +00:00
  • 50fc5e6984 Add: llcompressor installable Rahul Tuli 2025-04-08 23:35:12 +00:00
  • 83a88b745f Address review comments from @markurtz Rahul Tuli 2025-04-04 17:59:41 +00:00
  • 8855bb115f Apply suggestions from @markurtz Rahul Tuli 2025-04-04 10:36:35 -04:00
  • ef9543b371 Update llmcompressor version to latest Rahul Tuli 2025-04-03 09:37:43 -04:00
  • 25e701e885 Revert: TODO's Rahul Tuli 2025-04-02 22:54:22 +00:00
  • 891a21e599 Use: warning over warn Rahul Tuli 2025-04-02 22:38:31 +00:00
  • 8beb2f27ad pre commit hooks Rahul Tuli 2025-04-02 22:35:36 +00:00
  • 56ba66b60f Add:llmcompressor instalable Rahul Tuli 2025-04-02 22:22:04 +00:00
  • 13d4b865d6 Update: review comments! Rahul Tuli 2025-03-26 21:57:00 +00:00
  • 3da866b2b9 Add: SFTPlugin with llmcompressor Rahul Tuli 2025-03-12 07:09:06 +00:00
  • 3b8271800d Built site for gh-pages Quarto GHA Workflow Runner 2025-04-30 17:13:32 +00:00
  • 24ff5f53f8 additional args for grpo config/trainer (#2598) Wing Lian 2025-04-30 13:11:12 -04:00
  • 5e949eaa07 replace zero_only with simpler if statement (#2592) Wing Lian 2025-04-30 13:11:03 -04:00
  • edf0128b6b Built site for gh-pages Quarto GHA Workflow Runner 2025-04-30 15:38:06 +00:00
  • 89ca14d9a0 ensure we pass axolotl extras to the Dockerfile so vllm is included in shipped images (#2599) Wing Lian 2025-04-30 11:35:45 -04:00
  • 5a35b513c4 Built site for gh-pages Quarto GHA Workflow Runner 2025-04-30 15:09:05 +00:00
  • 8446b4ad28 don't automatically enable lora kernels for RL training (#2600) Wing Lian 2025-04-30 11:06:50 -04:00
  • fc79606b6d only import vllm serve cli if its being called (#2597) [skip ci] Wing Lian 2025-04-30 09:11:25 -04:00
  • b350316d61 Built site for gh-pages Quarto GHA Workflow Runner 2025-04-30 07:35:03 +00:00
  • baeb00231b Handle other reasoning trace dataset formats (#2591) Wing Lian 2025-04-30 03:32:55 -04:00
  • 2413688b08 upload the deepspeed json to wandb (#2593) [skip ci] Wing Lian 2025-04-30 03:32:44 -04:00
  • 5bb1f3da56 feat: add qwen3 moe block for ds3 (#2596) [skip ci] NanoCode012 2025-04-30 14:32:23 +07:00
  • a21b9cc472 patch to convert LR from tensor to float when using DS (#2595) [skip ci] Wing Lian 2025-04-30 03:31:57 -04:00
  • b886330bb5 Built site for gh-pages Quarto GHA Workflow Runner 2025-04-29 21:11:02 +00:00
  • 41a1ec0c95 Plugins create_lr_scheduler support (#2584) Aleksandr Dremov 2025-04-29 23:08:30 +02:00
  • 26a07feb0d Built site for gh-pages Quarto GHA Workflow Runner 2025-04-29 20:21:16 +00:00
  • ecac731922 auto-enable lora kernels where possible (#2589) Dan Saunders 2025-04-29 16:18:49 -04:00
  • 742fef4200 fix(doc): key used to point to url in multimodal doc (#2575) [skip ci] NanoCode012 2025-04-30 02:10:59 +07:00
  • a39caf8824 bump vllm==0.8.5 for qwen3 support (#2583) [skip ci] Wing Lian 2025-04-29 15:10:40 -04:00
  • a014e4a3a6 Built site for gh-pages Quarto GHA Workflow Runner 2025-04-29 19:04:59 +00:00
  • 07e4f2e25b support for qwen3 with lora kernels (#2588) Wing Lian 2025-04-29 15:02:49 -04:00
  • c532c2ad4f Built site for gh-pages Quarto GHA Workflow Runner 2025-04-29 17:01:24 +00:00
  • c7d07de6b4 Fix eval + add smoke test (#2586) Dan Saunders 2025-04-29 12:58:54 -04:00
  • 158ce4eed1 Built site for gh-pages Quarto GHA Workflow Runner 2025-04-29 16:08:03 +00:00
  • 6565ae85d8 set config on the PluginManager for callback access (#2587) Wing Lian 2025-04-29 12:05:44 -04:00
  • 761c75d8d2 Built site for gh-pages Quarto GHA Workflow Runner 2025-04-29 14:04:02 +00:00
  • 80b4edb4a7 Post release fixes (#2581) Wing Lian 2025-04-29 10:01:38 -04:00
  • c9880977be split llmcompressor from vllm checks llmcompressor-sft-wing Wing Lian 2025-04-29 08:35:06 -04:00
  • d8849e546e Built site for gh-pages Quarto GHA Workflow Runner 2025-04-29 12:30:47 +00:00
  • fedbcc0254 remove torch 2.4.1 CI as part of support deprecation (#2582) Wing Lian 2025-04-29 08:28:32 -04:00
  • aeff1fcfac Built site for gh-pages Quarto GHA Workflow Runner 2025-04-29 00:32:29 +00:00
  • 8175896ada add dev tag for v0.10.0.dev0 (#2580) Wing Lian 2025-04-28 20:30:14 -04:00
  • bd7242040a Built site for gh-pages Quarto GHA Workflow Runner 2025-04-28 22:25:25 +00:00
  • 14d670dbf0 v0.9.0 release (#2578) v0.9.0 Wing Lian 2025-04-28 18:23:17 -04:00
  • 2d77165dc0 automatically split out reasoning trace from dataset (#2579) Wing Lian 2025-04-28 18:23:03 -04:00
  • ecea4fc0a4 Built site for gh-pages Quarto GHA Workflow Runner 2025-04-28 19:11:53 +00:00
  • 63b17e3109 chat template and example for qwen3 (#2577) Wing Lian 2025-04-28 15:09:41 -04:00
  • f196941315 additional fixes for docker and saving compressed Wing Lian 2025-04-28 13:06:59 -04:00
  • 5be047ac46 Fix: Test Rahul Tuli 2025-04-28 09:18:26 -04:00
  • 758115b8c6 Apply patch from @winglian Rahul Tuli 2025-04-26 15:30:27 -04:00
  • 0dc1da5876 Add: line about further optimizations using llmcompressor Rahul Tuli 2025-04-24 14:06:25 -04:00
  • f3e876dbfc Address Review Comments: * deleted redundant docs/llm_compressor.qmd * incorporated feedback in integration README.md * added llmcompressor integration to docs/custom_integrations.qmd Rahul Tuli 2025-04-23 18:00:00 -04:00
  • 99c13ef60c Add: .qmd file Rahul Tuli 2025-04-21 20:40:49 -04:00
  • 2c24434ee0 Tests, Style, Updates Rahul Tuli 2025-04-21 20:33:59 -04:00
  • 586268a0d7 Rebase and updates! Rahul Tuli 2025-04-17 17:19:59 -04:00
  • b600e119b6 Add: llm_compressor integration documentation Rahul Tuli 2025-04-09 01:03:45 +00:00
  • a8e5ba000e Move: LLMCompressorPlugin into it's own submodule Rahul Tuli 2025-04-09 00:27:48 +00:00
  • bc3dfa666d Update model config Rahul Tuli 2025-04-08 23:53:29 +00:00
  • 4371f3459e Use: absolute import Rahul Tuli 2025-04-08 23:51:49 +00:00
  • cc58d5e072 Rename: sft.yaml to sparse-finetuning.yaml Rahul Tuli 2025-04-08 23:46:32 +00:00
  • d197b054e3 Add: llcompressor installable Rahul Tuli 2025-04-08 23:35:12 +00:00
  • 7e1e153831 Address review comments from @markurtz Rahul Tuli 2025-04-04 17:59:41 +00:00
  • 42de3096cf Apply suggestions from @markurtz Rahul Tuli 2025-04-04 10:36:35 -04:00
  • 27758840a1 Update llmcompressor version to latest Rahul Tuli 2025-04-03 09:37:43 -04:00
  • 8dbf5c215a Revert: TODO's Rahul Tuli 2025-04-02 22:54:22 +00:00
  • 6411ca3fe1 Use: warning over warn Rahul Tuli 2025-04-02 22:38:31 +00:00
  • 813809c54d pre commit hooks Rahul Tuli 2025-04-02 22:35:36 +00:00
  • af7cfdc30b Add:llmcompressor instalable Rahul Tuli 2025-04-02 22:22:04 +00:00