tocmo0nlord
  • Joined on 2026-02-11
tocmo0nlord created branch llama4 in tocmo0nlord/axolotl 2026-04-26 15:07:28 +00:00
tocmo0nlord pushed to llama4 at tocmo0nlord/axolotl 2026-04-26 15:07:28 +00:00
tocmo0nlord created branch llama4-patches in tocmo0nlord/axolotl 2026-04-26 15:07:28 +00:00
tocmo0nlord pushed to llama4-patches at tocmo0nlord/axolotl 2026-04-26 15:07:28 +00:00
tocmo0nlord created branch llava in tocmo0nlord/axolotl 2026-04-26 15:07:28 +00:00
tocmo0nlord pushed to llava at tocmo0nlord/axolotl 2026-04-26 15:07:28 +00:00
b52e61a574 pretrain fixes for mm
53f93f67bb fix to set training args so projector properly saves
ef95ea2977 additional args for parity, fix to properly save projector during pretrain
1321608dc4 add docs and tweak yml
Compare 7 commits »
tocmo0nlord created branch llava-train in tocmo0nlord/axolotl 2026-04-26 15:07:28 +00:00
tocmo0nlord created branch llama-4-examples in tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
tocmo0nlord pushed to llama-4-examples at tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
46afcf070f rename to specify fsdp
3036ca349f add README for llama4
dc4809f7dd [llama4] fix the mm yaml, add scout single gpu yaml
Compare 3 commits »
tocmo0nlord created branch llama-4-z3 in tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
tocmo0nlord pushed to llama-4-z3 at tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
9509abccdd use yet-another-deepspeed branch from transformers#37324
3acefba9ba point to branch for potential zero3 fix
100e5ea6ea llama4 support
Compare 3 commits »
tocmo0nlord created branch llama-dropout in tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
tocmo0nlord pushed to llama-dropout at tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
7771498eae add guassian dropout support
tocmo0nlord created branch llama-flash-attn-fix in tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
tocmo0nlord pushed to llama-flash-attn-fix at tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
8c171aadb4 drop unused padding_mask in llama patch
tocmo0nlord created branch llama-multipack in tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
tocmo0nlord pushed to llama-multipack at tocmo0nlord/axolotl 2026-04-26 15:07:27 +00:00
469e15607d basic llama multipack
tocmo0nlord pushed to liger-dpo at tocmo0nlord/axolotl 2026-04-26 15:07:26 +00:00
96af760e08 add option for liger_pref_rl
cfa80dace0 import typo
0a661980ca wip for liger dpo integration
Compare 3 commits »
tocmo0nlord created branch lisa in tocmo0nlord/axolotl 2026-04-26 15:07:26 +00:00
tocmo0nlord pushed to lisa at tocmo0nlord/axolotl 2026-04-26 15:07:26 +00:00
dfe591435f make lisa training example work on one 24gb gpu
5dd9364c00 example config for lisa
6185cd5227 fix LISA by ensuring params are not frozen during __init__
b357c93f23 improve lisa callback logging
21a5094226 fix default and fix attribute traversal for layers
Compare 6 commits »