axolotl/examples at telemetry-opt-in - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

Wing Lian c67910fa6f bump hf deps (#2735 ) [skip ci]

* bump hf deps

* upgrade liger-kernel too

* install cce from fork for transformers fix

* fix reference to vocab size in gemma3 patch

* use padding_idx instead of pad_token_id

* remove fixed gemma3 patch

* use updated cce fork

* fix local mllama cce patches w docstring

* add test for multipack with trainer setup and fix trainer for trainer refactor upstream

* bump modal version

* guard for iterable datasetS

* mllama model arch layout changed in latest transformers

* fix batch sampler with drop_last

* fix: address upstream vlm changes for lora

* fix: update references to old lora target path

* fix: remove mllama fa2 patch due to upstream fix

* fix: lora kernel patch path for multimodal models

* fix: removed mllama from quarto

* run test for came optim on 2.6.0+

* fix fsdp2 patch and remove deprecated patch

* make sure to set sequence_parallel_degree for grpo

* Add SP test for GRPO

* add sp to grpo config for trainer

* use reward_funcs as kwarg to grpo trainer

* fix the comprehension for reward funcs

* reward funcs already passed in as args

* init sp_group right before training

* fix check for adding models to SP context

* make sure to pass args to super

* upgrade deepspeed

* use updated trl and add reasoning flags for vllm

* patch the worker

---------

Co-authored-by: NanoCode012 <nano@axolotl.ai>

2025-06-05 07:20:33 -07:00

..

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

native support for modal cloud from CLI (#2237 )

2025-01-30 11:34:02 -05:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

colab-notebooks

fix build w pyproject to respect insalled torch version (#2168 )

2024-12-10 16:25:25 -05:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

feat: add examples for deepcoder (#2517 )

2025-04-12 07:25:23 -07:00

Feat(examples): add deepcogito (#2516 ) [skip ci]

2025-04-11 09:52:23 -04:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

bump hf deps (#2735 ) [skip ci]

2025-06-05 07:20:33 -07:00

feat: add glm and glm4 multipack and cce (#2546 )

2025-04-23 10:27:51 -04:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

simplify the example configs to be more minimal and less daunting (#2486 ) [skip ci]

2025-04-04 13:47:26 -04:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

Rank 0-only logging (#2608 )

2025-05-28 14:57:30 +01:00

bump hf deps (#2735 ) [skip ci]

2025-06-05 07:20:33 -07:00

fix(doc): clarify instruction to delinearize llama4 similar to cli doc (#2644 ) [skip ci]

2025-05-07 10:29:47 -04:00

bump hf deps (#2735 ) [skip ci]

2025-06-05 07:20:33 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

bump hf deps (#2735 ) [skip ci]

2025-06-05 07:20:33 -07:00

simplify the example configs to be more minimal and less daunting (#2486 ) [skip ci]

2025-04-04 13:47:26 -04:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

Adds example for training a TTS model on top of a LLM. (#2614 )

2025-05-06 10:11:06 +02:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

bump hf deps (#2735 ) [skip ci]

2025-06-05 07:20:33 -07:00

simplify the example configs to be more minimal and less daunting (#2486 ) [skip ci]

2025-04-04 13:47:26 -04:00

simplify the example configs to be more minimal and less daunting (#2486 ) [skip ci]

2025-04-04 13:47:26 -04:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

SP dataloader patching + removing custom sampler / dataloader logic (#2686 )

2025-05-21 11:20:20 -04:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

QAT (#2590 )

2025-05-28 12:35:47 +01:00

simplify the example configs to be more minimal and less daunting (#2486 ) [skip ci]

2025-04-04 13:47:26 -04:00

simplify the example configs to be more minimal and less daunting (#2486 ) [skip ci]

2025-04-04 13:47:26 -04:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00

remove strict=false from example yamls [skip ci] (#2523 ) [skip ci]

2025-04-12 07:25:11 -07:00