Commit Graph

  • 4cdfdfebb5 upgrade transformers==4.57.1 and peft==0.23.1 (#3214) Wing Lian 2025-10-14 15:54:05 -04:00
  • 6e2f5ccf9f chore: update pre-commit hooks (#3211) [skip ci] github-actions[bot] 2025-10-14 10:21:49 -04:00
  • 8c7f63cf97 fix: unpack cce imported incorrectly (#3212) [skip ci] NanoCode012 2025-10-13 17:19:15 +07:00
  • cd856b45b1 feat:add support dataset_num_processes (#3129) [skip ci] VED 2025-10-13 15:48:12 +05:30
  • 8aa2d9ddea Built site for gh-pages Quarto GHA Workflow Runner 2025-10-10 13:49:59 +00:00
  • 143dea4753 FSDPConfig (#3170) salman 2025-10-10 14:44:25 +01:00
  • 6fdb47fd4d Built site for gh-pages Quarto GHA Workflow Runner 2025-10-10 13:02:42 +00:00
  • bc2ffb8204 fix: Enable KD plugin support for PEFT/LoRA adapters (#3207) Hitesh Sagtani 2025-10-10 18:27:00 +05:30
  • 153edcfe79 fix(doc): add act checkpointing migration to fsdp2 docs (#3193) [skip ci] NanoCode012 2025-10-10 10:57:50 +07:00
  • 9436e959d9 Built site for gh-pages Quarto GHA Workflow Runner 2025-10-09 18:24:17 +00:00
  • 08b8fa62cc only calculate packed ds length once if using a large world size (#3210) Wing Lian 2025-10-09 14:18:46 -04:00
  • 3a5c97e6e5 use can_device_access_peer for P2P checks (#3209) [skip ci] Wing Lian 2025-10-09 14:17:31 -04:00
  • 37f78c8592 add chat_template_jinja to wandb (#3192) [skip ci] VED 2025-10-09 21:35:54 +05:30
  • 3c93a11045 Built site for gh-pages Quarto GHA Workflow Runner 2025-10-09 15:56:25 +00:00
  • ab63b92c38 feat: add lfm2 family and latest moe model (#3208) NanoCode012 2025-10-09 21:47:41 +07:00
  • 6f8ce024d1 Remove check_torch_compile_deepspeed (#3195) [skip ci] Manh Nguyen 2025-10-08 22:27:01 +07:00
  • a29281d4dd Built site for gh-pages Quarto GHA Workflow Runner 2025-10-08 14:49:24 +00:00
  • d0e9c3c1c5 When using Ray use prepare for dataloader fixes (#3198) Wing Lian 2025-10-08 10:43:41 -04:00
  • 4c3488cc9f chore: update pre-commit hooks (#3160) [skip ci] github-actions[bot] 2025-10-08 08:58:02 -04:00
  • 6b0d97a22c Built site for gh-pages Quarto GHA Workflow Runner 2025-10-08 12:49:27 +00:00
  • 130637a3fa upgrade transformers to 4.57.0 (#3201) Wing Lian 2025-10-08 08:43:46 -04:00
  • eb0a088ae7 Built site for gh-pages Quarto GHA Workflow Runner 2025-10-08 11:44:52 +00:00
  • 377c510e95 sleep model support (#3135) VED 2025-10-08 17:09:21 +05:30
  • 409cfb8a87 deprecate torch 2.6.0 support (#3197) [skip ci] Wing Lian 2025-10-07 11:23:41 -04:00
  • ffb307a8a7 update tags uv-first Dan Saunders 2025-10-04 12:10:43 -04:00
  • 915c258c6e contrib fix Dan Saunders 2025-10-04 11:53:48 -04:00
  • 1e58235c38 contrib Dan Saunders 2025-10-04 11:47:56 -04:00
  • 5753c5b89c mypy 3.11 Dan Saunders 2025-10-04 11:26:10 -04:00
  • 18d78f02cf fix sdist Dan Saunders 2025-10-04 09:48:19 -04:00
  • 923181aaed Merge branch 'main' into uv-first Dan Saunders 2025-10-04 09:07:22 -04:00
  • 786f1a3ff9 add missing dep Dan Saunders 2025-10-03 12:46:15 -04:00
  • 26418e6f9a Fix Dan Saunders 2025-10-02 12:53:51 -04:00
  • 19fe84ef46 Fix Dan Saunders 2025-10-02 12:33:13 -04:00
  • 98730868e7 fix Dan Saunders 2025-10-02 12:07:58 -04:00
  • 5771a65b88 fix Dan Saunders 2025-10-02 11:20:23 -04:00
  • f912d1bb97 fix Dan Saunders 2025-10-02 10:57:09 -04:00
  • 0250e5f87c fix Dan Saunders 2025-10-01 17:02:31 -04:00
  • 274c579d81 handle race cond Dan Saunders 2025-10-01 16:31:39 -04:00
  • ccd2f12335 fix? Dan Saunders 2025-10-01 16:18:40 -04:00
  • 00e0238501 fix? Dan Saunders 2025-10-01 16:15:06 -04:00
  • f782957002 fix Dan Saunders 2025-10-01 14:44:14 -04:00
  • f2f66f2bb9 fix Dan Saunders 2025-10-01 13:16:35 -04:00
  • 013474eb70 mirror dev deps Dan Saunders 2025-10-01 12:58:20 -04:00
  • f0ea98129e Built site for gh-pages Quarto GHA Workflow Runner 2025-10-01 15:17:34 +00:00
  • ce74c20109 don't cache pip install (#3194) Wing Lian 2025-10-01 11:11:39 -04:00
  • 6dc9816722 fix Dan Saunders 2025-10-01 10:18:50 -04:00
  • 8f2bcb05d0 Built site for gh-pages Quarto GHA Workflow Runner 2025-10-01 08:08:30 +00:00
  • a6bfbe3400 torch_dtype -> dtype (#3177) VED 2025-10-01 13:32:51 +05:30
  • 74715125b6 fix Dan Saunders 2025-09-30 17:28:15 -04:00
  • f0f3bfbdf0 fix Dan Saunders 2025-09-30 17:25:07 -04:00
  • 022ef7ab4e fix Dan Saunders 2025-09-30 17:12:23 -04:00
  • 04533b79d4 fix Dan Saunders 2025-09-30 17:07:57 -04:00
  • 19de29be19 fix Dan Saunders 2025-09-30 17:00:25 -04:00
  • ec75aa5889 fix Dan Saunders 2025-09-30 16:52:37 -04:00
  • cf4e3fac64 version fix Dan Saunders 2025-09-30 16:48:55 -04:00
  • 69df309cbb separate out flash-attn install (sadly) Dan Saunders 2025-09-30 14:58:56 -04:00
  • b436ecf61f fix Dan Saunders 2025-09-29 12:08:23 -04:00
  • f137ce50ec grpclib Dan Saunders 2025-09-28 21:28:53 -04:00
  • 4131bcf769 fix? Dan Saunders 2025-09-28 20:04:44 -04:00
  • 64fea39978 add back protobuf Dan Saunders 2025-09-28 19:18:06 -04:00
  • 4966496b98 revert Dan Saunders 2025-09-27 15:16:17 -04:00
  • 66a9e4fced fix? Dan Saunders 2025-09-26 23:08:29 -04:00
  • 15d35b76bb fixes Dan Saunders 2025-09-26 21:50:35 -04:00
  • 0d53e0fe8f fix -E -> --extra Dan Saunders 2025-09-26 21:21:10 -04:00
  • 9344fa5e8c fix install scripts (?) Dan Saunders 2025-09-26 20:35:08 -04:00
  • c702edae5f use container venv Dan Saunders 2025-09-26 20:19:14 -04:00
  • dfaf76659f pip install --system flag Dan Saunders 2025-09-26 19:53:51 -04:00
  • 26a58bb8af git SHA Dan Saunders 2025-09-26 19:39:08 -04:00
  • cec2490903 prune 2.7.0, docker cache invalidation Dan Saunders 2025-09-26 19:11:28 -04:00
  • dfa5224908 uv.lock Dan Saunders 2025-09-26 20:46:50 +00:00
  • ddafc6ef80 referring to temp docker images Dan Saunders 2025-09-26 16:04:39 -04:00
  • 8583e9a849 Built site for gh-pages Quarto GHA Workflow Runner 2025-09-26 19:14:14 +00:00
  • f4376748f3 debug log: multiprocess race condition fix (#3188) Dan Saunders 2025-09-26 15:07:39 -04:00
  • ad56e600e3 remove 2.7.0 images Dan Saunders 2025-09-26 14:40:41 -04:00
  • 18d9456297 loosen xformers range Dan Saunders 2025-09-26 13:32:11 -04:00
  • da5ede6372 lockfile Dan Saunders 2025-09-26 17:27:31 +00:00
  • 6cbca1ffb2 loosen xformers range Dan Saunders 2025-09-26 13:26:13 -04:00
  • 2e082d47cc constrain torch version Dan Saunders 2025-09-26 13:20:45 -04:00
  • b4c6675cd2 fix Dan Saunders 2025-09-26 13:13:13 -04:00
  • 828131332a no -y flag for uv pip install Dan Saunders 2025-09-26 13:04:33 -04:00
  • 273a03f85c simplify install script Dan Saunders 2025-09-26 12:55:55 -04:00
  • 1d0562dedd adding fp32 support fsdp2_fp32 Salman Mohammadi 2025-09-26 16:32:09 +00:00
  • 9bbe2cfe0f handle vllm pinned conflict Dan Saunders 2025-09-26 12:27:11 -04:00
  • 64da8f0044 depr warning Dan Saunders 2025-09-26 11:59:58 -04:00
  • 1fa0a98e38 update lock Dan Saunders 2025-09-26 15:44:46 +00:00
  • 8d542d9d63 deps up to date Dan Saunders 2025-09-26 10:39:34 -04:00
  • a4565476e0 find-links for wheels, auto-gptq -> gptqmodel Dan Saunders 2025-09-16 15:43:57 -04:00
  • 02dc263338 updates Dan Saunders 2025-09-16 15:23:40 -04:00
  • 2acd3e1242 dep Dan Saunders 2025-09-15 17:13:45 -04:00
  • 0437c1a4ba auto-gptq -> gptqmodel Dan Saunders 2025-09-15 17:06:52 -04:00
  • ef150fd973 updates Dan Saunders 2025-09-15 15:59:10 -04:00
  • 47ad92c6b9 fix Dan Saunders 2025-09-11 13:12:08 -04:00
  • f0fee9c56c req Dan Saunders 2025-09-11 12:31:15 -04:00
  • 37d07bd7f7 coderabbito, improvements Dan Saunders 2025-09-11 12:11:00 -04:00
  • 4c81172917 coderabbito Dan Saunders 2025-09-10 16:21:17 -04:00
  • cd8c769e84 Update cicd/Dockerfile.jinja Dan Saunders 2025-09-10 16:15:43 -04:00
  • 0d60046d08 Update .github/workflows/pypi.yml Dan Saunders 2025-09-10 16:07:23 -04:00
  • c110e3eb48 remove setup.py, requirements.txt and refs Dan Saunders 2025-08-30 01:08:53 -04:00
  • 95c259b3fb depr warning Dan Saunders 2025-08-30 00:53:31 -04:00
  • d1fd505813 update Dan Saunders 2025-08-30 00:42:38 -04:00