Commit Graph

  • 521e62daf1 remove the bos token from dpo outputs (#1733) [skip ci] Wing Lian 2024-11-15 19:09:20 -05:00
  • c16ec398d7 update to be deprecated evaluation_strategy (#1682) [skip ci] Wing Lian 2024-11-15 19:09:00 -05:00
  • 2f20cb7ebf upgrade datasets==3.1.0 and add upstream check (#2067) [skip ci] Wing Lian 2024-11-15 19:08:38 -05:00
  • 8f777abed5 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-14 18:00:08 +00:00
  • 71d4030b79 gradient accumulation tests, embeddings w pad_token fix, smaller models (#2059) Wing Lian 2024-11-14 12:59:00 -05:00
  • f3a5d119af fix env var extraction (#2043) [skip ci] Wing Lian 2024-11-14 12:58:06 -05:00
  • ba219b51a5 fix duplicate base build (#2061) [skip ci] Wing Lian 2024-11-14 10:31:19 -05:00
  • 9a9e9787e9 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-14 15:25:51 +00:00
  • 5be8e13d35 make sure to add tags for versioned tag on cloud docker images (#2060) Wing Lian 2024-11-14 10:24:49 -05:00
  • 60763b2e61 fix missing return transformers-fsdp-check Wing Lian 2024-11-14 10:14:13 -05:00
  • 082a41af9d add check for broken fsdp+grad_accum Wing Lian 2024-11-14 10:12:57 -05:00
  • d2ec2a03ed Built site for gh-pages Quarto GHA Workflow Runner 2024-11-14 12:00:21 +00:00
  • 2d7830fda6 upgrade to flash-attn 2.7.0 (#2048) Wing Lian 2024-11-14 06:59:25 -05:00
  • 5d505187d2 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-13 22:11:40 +00:00
  • 5e98cdddac Grokfast support (#1917) Wing Lian 2024-11-13 17:10:36 -05:00
  • 1d7aee0ad2 ADOPT optimizer integration (#2032) [skip ci] Sunny Liu 2024-11-13 17:10:17 -05:00
  • 659ee5d723 don't cancel the tests on main automatically for concurrency (#2055) [skip ci] Wing Lian 2024-11-13 17:07:41 -05:00
  • 66be0ee833 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-13 20:19:04 +00:00
  • 342935cff3 Update unsloth for torch.cuda.amp deprecation (#2042) Sunny Liu 2024-11-13 15:17:34 -05:00
  • 6b04b0e90e Built site for gh-pages Quarto GHA Workflow Runner 2024-11-13 19:05:24 +00:00
  • c5eb9ea2c2 fix push to main and tag semver build for docker ci (#2054) Wing Lian 2024-11-13 14:04:28 -05:00
  • b69388b83b Built site for gh-pages Quarto GHA Workflow Runner 2024-11-13 18:17:37 +00:00
  • f2145a3ccb add default torch version if not installed, and support for xformers new wheels (#2049) Wing Lian 2024-11-13 13:16:47 -05:00
  • 010d0e7ff3 retry flaky test_packing_stream_dataset test that timesout on read (#2052) [skip ci] Wing Lian 2024-11-13 13:16:16 -05:00
  • 01881c3113 make sure to tag images in docker for tagged releases (#2051) [skip ci] Wing Lian 2024-11-13 13:15:49 -05:00
  • 30230ed8a2 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-13 15:22:50 +00:00
  • 0e8eb96e07 run pypi release action on tag create w version (#2047) Wing Lian 2024-11-13 10:21:48 -05:00
  • c062983c2f Built site for gh-pages Quarto GHA Workflow Runner 2024-11-13 15:08:17 +00:00
  • 4e1891b12b feat: upgrade to liger 0.4.1 (#2045) NanoCode012 2024-11-13 22:07:24 +07:00
  • 28924fc791 feat: cancel ongoing tests if new CI is triggered (#2046) [skip ci] NanoCode012 2024-11-13 22:06:59 +07:00
  • 8c480b2804 fix: inference not using chat_template (#2019) [skip ci] NanoCode012 2024-11-13 22:06:41 +07:00
  • a4b1cc6df0 Add example YAML file for training Mistral using DPO (#2029) [skip ci] Oliver Molenschot 2024-11-13 07:06:25 -08:00
  • 7b78a31593 feat: print out dataset length even if not preprocess (#2034) [skip ci] NanoCode012 2024-11-13 22:06:00 +07:00
  • 1aeeb8925d Built site for gh-pages Quarto GHA Workflow Runner 2024-11-13 04:21:44 +00:00
  • 810ebc2c0e invert the string in string check for p2p device check (#2044) Wing Lian 2024-11-12 23:20:47 -05:00
  • d4cf975896 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-12 22:59:20 +00:00
  • ad435a3b09 add P2P env when multi-gpu but not the full node (#2041) Wing Lian 2024-11-12 17:58:26 -05:00
  • b75ecff86e Built site for gh-pages Quarto GHA Workflow Runner 2024-11-12 05:52:27 +00:00
  • 9f1cf9b17c fix: handle sharegpt dataset missing (#2035) NanoCode012 2024-11-12 12:51:37 +07:00
  • f4d457f93e Built site for gh-pages Quarto GHA Workflow Runner 2024-11-11 20:11:31 +00:00
  • 3931a42763 change deprecated modal Stub to App (#2038) Wing Lian 2024-11-11 15:10:34 -05:00
  • dc8f9059f7 feat: add metharme chat_template (#2033) [skip ci] NanoCode012 2024-11-12 03:09:58 +07:00
  • 234e94e9dd replace references to personal docker hub to org docker hub (#2036) [skip ci] Wing Lian 2024-11-11 15:09:29 -05:00
  • f68fb71005 update actions version for node16 deprecation (#2037) [skip ci] Wing Lian 2024-11-11 15:09:11 -05:00
  • 4091d4665c Built site for gh-pages Quarto GHA Workflow Runner 2024-11-11 14:49:17 +00:00
  • 9bc3ee6c75 add axolotlai docker hub org to publish list (#2031) Wing Lian 2024-11-11 09:48:19 -05:00
  • 241ccaa79c Built site for gh-pages Quarto GHA Workflow Runner 2024-11-10 17:46:50 +00:00
  • d356740ffa move deprecated kwargs from trainer to trainingargs (#2028) Wing Lian 2024-11-10 12:45:47 -05:00
  • 6dc0f4dac6 moved some DPOTrainer args to DPOConfig for future trl release upgrade-trl-v0.12.0_2 sunny 2024-11-08 16:38:51 -05:00
  • 47d9249cdc Built site for gh-pages Quarto GHA Workflow Runner 2024-11-08 19:48:57 +00:00
  • e4af51eb66 remove direct dependency on fused dense lib (#2027) v0.5.0 Wing Lian 2024-11-08 14:48:04 -05:00
  • e20b15bee3 make publish to pypi manually dispatchable as a workflow (#2026) [skip ci] Wing Lian 2024-11-08 14:18:16 -05:00
  • d4796cb645 increment version to 0.5.0 for next release (#2025) [skip ci] Wing Lian 2024-11-08 14:02:25 -05:00
  • b4e3feb6ef Built site for gh-pages Quarto GHA Workflow Runner 2024-11-08 18:46:41 +00:00
  • fd3b80716a remove fastchat and sharegpt (#2021) Wing Lian 2024-11-08 13:45:49 -05:00
  • fc97455393 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-08 16:31:26 +00:00
  • 3265b7095e Add weighted optimisation support for trl DPO trainer integration (#2016) Sunny Liu 2024-11-08 11:29:11 -05:00
  • a9296d9124 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-08 15:47:18 +00:00
  • 3cb2d75de1 upgrade pytorch to 2.5.1 (#2024) Wing Lian 2024-11-08 10:46:24 -05:00
  • f1b4030cdd WIP shampoo low bit optimizers shampoo-low_bit Wing Lian 2024-11-08 10:02:10 -05:00
  • 1fceaa20e3 , sunny 2024-11-08 09:37:28 -05:00
  • 04501a9861 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-07 22:55:21 +00:00
  • 035e9f9dd7 janky workaround to install FA2 on torch 2.5.1 base image since it takes forever to build (#2022) Wing Lian 2024-11-07 17:54:29 -05:00
  • 7137c39e74 Built site for gh-pages Quarto GHA Workflow Runner 2024-11-07 17:54:29 +00:00
  • 02ce520b7e upgrade liger to 0.4.0 (#1973) Wing Lian 2024-11-07 12:53:34 -05:00
  • 7ee7b4c493 test sunny 2024-11-07 11:57:37 -05:00
  • d2e51406a1 test sunny 2024-11-07 11:47:06 -05:00
  • 5d55c08086 test sunny 2024-11-07 11:42:52 -05:00
  • cc2815a3cc test sunny 2024-11-07 11:41:46 -05:00
  • 3b648f6bbe test sunny 2024-11-07 11:40:32 -05:00
  • 5294fe5a99 test sunny 2024-11-07 11:39:46 -05:00
  • 4b1273ae1e test sunny 2024-11-07 11:28:42 -05:00
  • 394806ab30 test sunny 2024-11-07 11:23:56 -05:00
  • 432b17eee1 test sunny 2024-11-07 11:20:32 -05:00
  • bbf5158e9c test upgrade-liger-test sunny 2024-11-07 11:06:28 -05:00
  • ec70046a2b test sunny 2024-11-07 11:04:33 -05:00
  • 7fed41550e test sunny 2024-11-07 11:02:54 -05:00
  • da3a941bc3 test sunny 2024-11-07 11:00:51 -05:00
  • ad3c179a5a test sunny 2024-11-07 10:59:29 -05:00
  • 15e26b14eb test sunny 2024-11-07 10:54:48 -05:00
  • 33bbe9b222 test sunny 2024-11-07 10:52:52 -05:00
  • 1fddf45958 test sunny 2024-11-07 10:46:47 -05:00
  • e42e319446 make sure prepared path is empty for test Wing Lian 2024-11-06 10:20:51 -05:00
  • 58cca816f8 trl version requirement sunny 2024-11-06 10:01:05 -05:00
  • 613f238e56 use kwargs to support patch release Wing Lian 2024-11-06 09:43:35 -05:00
  • 6b617a4fd5 also upgrade accelerate Wing Lian 2024-11-06 08:59:52 -05:00
  • 6ac10de9ef upgrade liger and transformers Wing Lian 2024-11-06 08:53:03 -05:00
  • 28e134e6a8 commenting out sunny 2024-11-05 14:57:35 -05:00
  • 39af2a41a5 linting sunny 2024-11-05 12:46:05 -05:00
  • 41d10278bf test sunny 2024-11-05 12:38:33 -05:00
  • d9b65f69fb test sunny 2024-11-05 12:35:36 -05:00
  • bcb1205e39 test sunny 2024-11-05 12:30:45 -05:00
  • 04b532bd37 test sunny 2024-11-05 12:20:00 -05:00
  • 8ac149e317 test sunny 2024-11-05 12:03:06 -05:00
  • 98d819d3f7 trl sunny 2024-11-05 11:59:10 -05:00
  • 9da9916ff2 trl sunny 2024-11-05 11:57:26 -05:00
  • 027ccdab4d update trl version requirements sunny 2024-11-05 11:53:49 -05:00
  • 7a00dbc367 trlv0.12.0 integration sunny 2024-11-05 11:44:46 -05:00
  • 1b8d439441 add test case Wing Lian 2024-10-17 10:00:15 -04:00
  • 1ed351781a chore: lint Wing Lian 2024-10-17 08:17:11 -04:00