tocmo0nlord
  • Joined on 2026-02-11
tocmo0nlord created branch liger-065 in tocmo0nlord/axolotl 2026-04-26 15:07:26 +00:00
tocmo0nlord pushed to liger-065 at tocmo0nlord/axolotl 2026-04-26 15:07:26 +00:00
2d13a06722 slow fsdp1 test
ba27e830e8 triton versions for older pytorch
8f7219e139 upgrade liger to 0.6.5 and triton to 3.5.1
Compare 3 commits »
tocmo0nlord created branch liger-dpo in tocmo0nlord/axolotl 2026-04-26 15:07:26 +00:00
tocmo0nlord created branch lhl-moe-aux-loss-free in tocmo0nlord/axolotl 2026-04-26 15:07:25 +00:00
tocmo0nlord pushed to lhl-moe-aux-loss-free at tocmo0nlord/axolotl 2026-04-26 15:07:25 +00:00
6636e5de7e address PR code review
0a566d7a15 chore: lint
5acb1b0ade update for transformers v5 for experts parameters and compose with moe kernels
4009a2ba5f reordered our tests to mirror llm_compressor for prepare_plugins/validate order
66b2ab8414 move configs from global config to plugin specific args
Compare 10 commits »
tocmo0nlord created branch liger-063 in tocmo0nlord/axolotl 2026-04-26 15:07:25 +00:00
tocmo0nlord pushed to liger-063 at tocmo0nlord/axolotl 2026-04-26 15:07:25 +00:00
9ee7ce5c85 set TORCH_CUDA_ARCH_LIST correctly
a41ca4d06f upgrade liger dep to 0.6.3
Compare 2 commits »
tocmo0nlord created branch kernelize-scattermoe-lora in tocmo0nlord/axolotl 2026-04-26 15:07:24 +00:00
tocmo0nlord pushed to kernelize-scattermoe-lora at tocmo0nlord/axolotl 2026-04-26 15:07:24 +00:00
8495c79fb1 properly handles kernels repo type
9a0d3016df first pass at build and deploy scattermoe-lora kernel
Compare 2 commits »
tocmo0nlord created branch kto_fix in tocmo0nlord/axolotl 2026-04-26 15:07:24 +00:00
tocmo0nlord pushed to kto_fix at tocmo0nlord/axolotl 2026-04-26 15:07:24 +00:00
92c217677c wip fix
tocmo0nlord created branch kwargs-refactor in tocmo0nlord/axolotl 2026-04-26 15:07:24 +00:00
tocmo0nlord pushed to kwargs-refactor at tocmo0nlord/axolotl 2026-04-26 15:07:24 +00:00
4dc75cc713 Merge branch 'main' into kwargs-refactor
tocmo0nlord created branch latent-space in tocmo0nlord/axolotl 2026-04-26 15:07:24 +00:00
tocmo0nlord pushed to latent-space at tocmo0nlord/axolotl 2026-04-26 15:07:24 +00:00
cf00e20270 experiment w latent space
tocmo0nlord pushed to kd-trainer-v2 at tocmo0nlord/axolotl 2026-04-26 15:07:23 +00:00
tocmo0nlord created branch kd-trainer-zscore in tocmo0nlord/axolotl 2026-04-26 15:07:23 +00:00
tocmo0nlord pushed to kd-trainer-zscore at tocmo0nlord/axolotl 2026-04-26 15:07:23 +00:00
tocmo0nlord created branch keep_in_memory in tocmo0nlord/axolotl 2026-04-26 15:07:23 +00:00
tocmo0nlord pushed to keep_in_memory at tocmo0nlord/axolotl 2026-04-26 15:07:23 +00:00
eea6e8303a Disable datasets caching when preparing dataset for packing