Commit Graph

  • 1d339e4007 fixes Dan Saunders 2025-03-11 02:52:44 +00:00
  • 4190ad0647 updates Dan Saunders 2025-03-10 21:18:04 +00:00
  • b44a207248 update Dan Saunders 2025-03-06 17:44:32 +00:00
  • 51c326150b pytest Dan Saunders 2025-03-06 16:25:53 +00:00
  • 14baaf6e0a updates Dan Saunders 2025-03-05 15:39:45 +00:00
  • f487910444 removing unused code Dan Saunders 2025-03-05 15:18:53 +00:00
  • c5071dfd8a fix req Dan Saunders 2025-03-05 14:55:13 +00:00
  • e323145ba9 remove errant file Dan Saunders 2025-03-05 14:54:35 +00:00
  • 7efc787ac8 cleanup Dan Saunders 2025-03-05 14:53:18 +00:00
  • dce61cdab1 progress on ring attn impl Dan Saunders 2025-03-04 21:42:34 +00:00
  • bd952de9d2 progress on ring attn impl Dan Saunders 2025-03-04 21:31:11 +00:00
  • 3f8a43cab6 adding easy_context as integration for now Dan Saunders 2025-03-03 19:59:10 +00:00
  • 113e9cd193 Autodoc generation with quartodoc (#2419) Dan Saunders 2025-03-21 12:26:47 -04:00
  • 61825a464a chore(doc): add explanation on fsdp_transformer_layer_cls_to_wrap (#2429) [skip ci] NanoCode012 2025-03-21 22:59:22 +07:00
  • 94c00c1d04 pre-commit Dan Saunders 2025-03-21 11:23:39 -04:00
  • ddd84d7c65 update pylint Dan Saunders 2025-03-20 14:44:17 -04:00
  • 42bdf0bd74 update pre-commit version Dan Saunders 2025-03-20 09:59:01 -04:00
  • b03d96a228 include quartodoc build step Dan Saunders 2025-03-20 09:24:38 -04:00
  • 2653f170fc fix accidental change Dan Saunders 2025-03-20 02:55:42 +00:00
  • 3bfcce9f0a shrinking header sizes Dan Saunders 2025-03-18 15:55:06 -04:00
  • 8feb746953 fix Dan Saunders 2025-03-18 18:55:23 +00:00
  • a563815fe7 pydantic models refactor + add to autodoc + fixes Dan Saunders 2025-03-18 12:56:12 -04:00
  • 81f2203151 update to reflect recent changes Dan Saunders 2025-03-17 13:45:29 -04:00
  • 5b7e688fc5 fix broken link Dan Saunders 2025-03-17 13:18:52 -04:00
  • 5134aa66cd moving reference up near the top of the sidebar Dan Saunders 2025-03-17 12:43:53 -04:00
  • ba9a867adb more autodoc progress Dan Saunders 2025-03-17 12:26:58 -04:00
  • c618f42c39 Fix Dan Saunders 2025-03-14 14:12:13 -04:00
  • fc1f985296 Update docs/.gitignore to exclude auto-generated API documentation files Dan Saunders 2025-03-14 14:11:23 -04:00
  • a5e37f183c deletions Dan Saunders 2025-03-14 14:07:54 -04:00
  • e6a7bbe9ff quartodoc progress Dan Saunders 2025-03-14 14:00:29 -04:00
  • e4fd7aad0b quartodoc integration Dan Saunders 2025-03-14 16:16:07 +00:00
  • 486fc53c93 Built site for gh-pages Quarto GHA Workflow Runner 2025-03-21 15:03:46 +00:00
  • c907ac173e adding pre-commit auto-update GH action and bumping plugin versions (#2428) Dan Saunders 2025-03-21 11:02:43 -04:00
  • 156fede4f7 Update .pre-commit-config.yaml pre-commit-update Dan Saunders 2025-03-20 11:30:22 -04:00
  • dcbbd7af79 sorry to revert, but pylint complained Dan Saunders 2025-03-20 10:38:01 -04:00
  • 21bac7ce1a running updated pre-commit plugins Dan Saunders 2025-03-20 10:30:18 -04:00
  • aaa4571826 adding pre-commit auto-update GH action and bumping plugin versions Dan Saunders 2025-03-20 10:15:19 -04:00
  • 7aa656c980 Built site for gh-pages Quarto GHA Workflow Runner 2025-03-21 14:19:28 +00:00
  • 187227d837 Fixing KTO+QLoRA+multi-GPU (#2420) salman 2025-03-21 14:18:28 +00:00
  • e36ea0f5f9 Built site for gh-pages Quarto GHA Workflow Runner 2025-03-21 14:18:24 +00:00
  • f8de8bb4f2 chore(doc): add instructions on adding custom integrations (#2422) [skip ci] NanoCode012 2025-03-21 21:18:01 +07:00
  • 8e604848a4 add run on novita ai (#2421) [skip ci] hugo 2025-03-21 22:17:47 +08:00
  • aae4337f40 add 12.8.1 cuda to the base matrix (#2426) Wing Lian 2025-03-21 10:17:25 -04:00
  • 31799bdcc0 more parity across tests and docker images for packaging/setuptools cuda-12.8.1 Wing Lian 2025-03-21 08:56:01 -04:00
  • 25455ac25f make sure packaging version is consistent Wing Lian 2025-03-21 08:27:17 -04:00
  • edea25bd58 comment out license for validation for now Wing Lian 2025-03-21 08:20:28 -04:00
  • 42e32223c9 try rolling back packaging and setuptools versions Wing Lian 2025-03-21 08:12:07 -04:00
  • 6e0fed0ce7 use license instead of license-file Wing Lian 2025-03-21 07:25:09 -04:00
  • 5ece44b4a8 try with reversion of packaging/setuptools/wheel install Wing Lian 2025-03-21 07:19:12 -04:00
  • e7532c9b0c make sure ninja is installed Wing Lian 2025-03-21 06:57:06 -04:00
  • 2518a9b2a2 multiline fix Wing Lian 2025-03-20 20:51:16 -04:00
  • faeae323cb install deepspeed by itself Wing Lian 2025-03-20 20:04:39 -04:00
  • bb683644c3 deepspeed binary fixes hopefully Wing Lian 2025-03-20 19:52:07 -04:00
  • 7009a48398 bump deepspeed and set no binary Wing Lian 2025-03-20 14:01:01 -04:00
  • ee529e2354 use nightly Wing Lian 2025-03-20 00:15:55 -04:00
  • b2976e64ec add 12.8.1 cuda to the base matrix Wing Lian 2025-03-19 23:43:59 -04:00
  • 1eb1405d7b Built site for gh-pages Quarto GHA Workflow Runner 2025-03-20 14:23:09 +00:00
  • 38df5a36ea bump HF versions except for trl (#2427) Wing Lian 2025-03-20 10:22:05 -04:00
  • 955412b58b Built site for gh-pages Quarto GHA Workflow Runner 2025-03-20 03:59:35 +00:00
  • 4d92a68a96 use default torch fused adamw optimizer as default as adamw_hf is deprecated (#2425) Wing Lian 2025-03-19 23:58:33 -04:00
  • 8fc4c420a4 Add kd coefficient scheduler kd-logprob-data Wing Lian 2025-03-18 09:01:58 -04:00
  • 0c36a6fea6 config fix -___- fix_kto Salman Mohammadi 2025-03-18 11:35:20 +00:00
  • 64aca3c23c linting v2 Salman Mohammadi 2025-03-18 11:33:54 +00:00
  • 22abfd6170 simplifying check Salman Mohammadi 2025-03-18 11:26:53 +00:00
  • 0658c458b7 Merge branch 'fix_kto' of github.com:axolotl-ai-cloud/axolotl into fix_kto Salman Mohammadi 2025-03-18 11:23:48 +00:00
  • 690908cf2f linting Salman Mohammadi 2025-03-18 11:23:23 +00:00
  • b9378e9b39 Merge branch 'main' into fix_kto salman 2025-03-18 11:22:00 +00:00
  • 57b0ad1467 adding adapter check Salman Mohammadi 2025-03-18 11:21:42 +00:00
  • ec4ead6e3e adding error Salman Mohammadi 2025-03-18 11:20:34 +00:00
  • a319ac7d3e removing artifacts Salman Mohammadi 2025-03-17 20:00:09 +00:00
  • 09d3f2cffa WIP Salman Mohammadi 2025-03-17 19:59:19 +00:00
  • e04ff3569a Built site for gh-pages Quarto GHA Workflow Runner 2025-03-17 12:40:44 +00:00
  • 85147ec430 Update README.md (#2360) SicariusSicariiStuff 2025-03-17 14:39:17 +02:00
  • 51cd409488 Feat: minor docs improvements for RLHF and faq on embeddings (#2401) [skip ci] NanoCode012 2025-03-17 19:39:04 +07:00
  • 7235123d44 chore(docs): add cookbook/blog link to docs (#2410) [skip ci] NanoCode012 2025-03-17 19:38:19 +07:00
  • a87ee51e2e Built site for gh-pages Quarto GHA Workflow Runner 2025-03-15 12:50:38 +00:00
  • 4f5eb42a73 remove reference to deprecated import (#2407) Wing Lian 2025-03-15 08:49:41 -04:00
  • 92c217677c wip fix kto_fix Salman Mohammadi 2025-03-14 18:54:39 +00:00
  • a6ed036a26 Built site for gh-pages Quarto GHA Workflow Runner 2025-03-14 03:30:12 +00:00
  • fbe54be6b8 only validate hf user token on rank 0 (#2408) Wing Lian 2025-03-13 23:29:06 -04:00
  • 04f6324833 build cloud images with torch 2.6.0 (#2413) Wing Lian 2025-03-13 23:28:51 -04:00
  • e6f63901c2 Built site for gh-pages Quarto GHA Workflow Runner 2025-03-11 16:03:58 +00:00
  • f0072f3b9d use max of 32 dataset processes if not explicit (#2403) Wing Lian 2025-03-11 12:02:58 -04:00
  • 59899b9817 pass additional info for fix untrained tokens when using distributed + offloading (#2388) Wing Lian 2025-03-11 12:02:43 -04:00
  • 9cb05283b2 use v2 branch update-lgpl Wing Lian 2025-03-10 19:46:19 -04:00
  • aafa6245f4 try with deepspeed import Wing Lian 2025-03-10 19:39:55 -04:00
  • 3001e6d93c use commit sha for previous release dev Wing Lian 2025-03-10 18:41:15 -04:00
  • ed0456557d use revised branch Wing Lian 2025-03-10 16:53:57 -04:00
  • 09e4393a6a use branch again Wing Lian 2025-03-10 16:48:33 -04:00
  • 31a81106dd revert to previous known good commit Wing Lian 2025-03-10 16:36:33 -04:00
  • 93c20cc0d5 test branch Wing Lian 2025-03-07 14:30:59 -05:00
  • 3f5e2d6cc9 bump axolotl-contribs-lgpl Wing Lian 2025-03-07 13:35:34 -05:00
  • 6e1ad1137d Built site for gh-pages Quarto GHA Workflow Runner 2025-03-10 19:15:41 +00:00
  • 4a736986fa fix(modal): add git pull when getting branch files (#2399) NanoCode012 2025-03-11 02:14:41 +07:00
  • ab3c24373b Built site for gh-pages Quarto GHA Workflow Runner 2025-03-10 19:14:39 +00:00
  • 5d0f110a3b include iproute2 and nvtop in cloud image (#2393) Wing Lian 2025-03-10 15:13:38 -04:00
  • a38d2aeef1 Built site for gh-pages Quarto GHA Workflow Runner 2025-03-10 09:28:43 +00:00
  • 5d2f8fffb9 Built site for gh-pages Quarto GHA Workflow Runner 2025-03-10 09:27:43 +00:00
  • 83f8698b8a fix: create mount folder on modal if not exist (#2390) NanoCode012 2025-03-10 16:27:42 +07:00
  • 089c1c2c18 Built site for gh-pages Quarto GHA Workflow Runner 2025-03-10 09:26:51 +00:00