tocmo0nlord
  • Joined on 2026-02-11
tocmo0nlord created branch fix/granite-speech in tocmo0nlord/axolotl 2026-04-26 15:07:11 +00:00
tocmo0nlord pushed to fix/granite-speech at tocmo0nlord/axolotl 2026-04-26 15:07:11 +00:00
380921ee56 Update ModelLoader to set default vocab_size if not defined in model config, enhancing compatibility with tokenizer defaults.
6e71819560 Update ModelLoader to set vocab_size for GraniteSpeechConfig if not defined, ensuring compatibility with tokenizer defaults.
ea234afa8a Enhance model loading logic to include support for GraniteSpeechConfig, allowing for the use of the specific model class for Granite Speech.
738adb2258 fixes
f40e8caa28 checks
Compare 8 commits »
tocmo0nlord created branch fix/hpc-root in tocmo0nlord/axolotl 2026-04-26 15:07:11 +00:00
tocmo0nlord pushed to fix/hpc-root at tocmo0nlord/axolotl 2026-04-26 15:07:11 +00:00
83ff8bfa1a fix: change docker miniconda install to workspace
tocmo0nlord created branch fix/kd-trainer-num-items in tocmo0nlord/axolotl 2026-04-26 15:07:11 +00:00
tocmo0nlord pushed to fix/dpo-labels at tocmo0nlord/axolotl 2026-04-26 15:07:10 +00:00
fc1900761b fix(trl): remove access to invalid property
tocmo0nlord created branch fix/eval-accu in tocmo0nlord/axolotl 2026-04-26 15:07:10 +00:00
tocmo0nlord pushed to fix/eval-accu at tocmo0nlord/axolotl 2026-04-26 15:07:10 +00:00
a65dbe779f fix: suspected eval vram increased usage
tocmo0nlord created branch fix/gemma3-text-only in tocmo0nlord/axolotl 2026-04-26 15:07:10 +00:00
tocmo0nlord pushed to fix/gemma3-text-only at tocmo0nlord/axolotl 2026-04-26 15:07:10 +00:00
53a12282bc fix: log merge command once done
7271754902 fix: handle plugin logging
6d5257d92e fix: ignore ds_store
0e357b5df6 fix: load gemma3 as text only model with dynamic weights
Compare 4 commits »
tocmo0nlord created branch fix/gemma3n-text-attention in tocmo0nlord/axolotl 2026-04-26 15:07:10 +00:00
tocmo0nlord pushed to fix/gemma3n-text-attention at tocmo0nlord/axolotl 2026-04-26 15:07:10 +00:00
8eba033dc4 fix: correct attention class retrieval for gemma3n model in lora_kernels.py
a9c0f43202 fix: update attention class import logic for gemma3n model
Compare 2 commits »
tocmo0nlord pushed to fix/cce-linear at tocmo0nlord/axolotl 2026-04-26 15:07:09 +00:00
4581d6a8de fix: accidentally reassigning tensor to weight
1a85fab2ca fix: lm_head is a view or related view modified
Compare 2 commits »
tocmo0nlord created branch fix/cp-waste in tocmo0nlord/axolotl 2026-04-26 15:07:09 +00:00
tocmo0nlord pushed to fix/cp-waste at tocmo0nlord/axolotl 2026-04-26 15:07:09 +00:00
255c5b90ca fix: make prepare_context_parallel_inputs no-op
tocmo0nlord created branch fix/diffusion in tocmo0nlord/axolotl 2026-04-26 15:07:09 +00:00
tocmo0nlord pushed to fix/diffusion at tocmo0nlord/axolotl 2026-04-26 15:07:09 +00:00
08c8f3f22f fix: total tokens and defaults in config
76f0fe2621 fix: steps not allowed fractional
Compare 2 commits »
tocmo0nlord created branch fix/doc-key in tocmo0nlord/axolotl 2026-04-26 15:07:09 +00:00
tocmo0nlord pushed to fix/doc-key at tocmo0nlord/axolotl 2026-04-26 15:07:09 +00:00
f5f5a3ee9b feat(doc): add llama4 to liger support
cc512a57a5 fix: wrong key used in example doc
Compare 2 commits »
tocmo0nlord created branch fix/dpo-labels in tocmo0nlord/axolotl 2026-04-26 15:07:09 +00:00