axolotl

Files

Wing Lian ac77da96da use smaller pretrained models for ci (#3620 ) [skip ci]

* use smaller pretrained models for ci

* more steps for loss check

* fix tests

* more train steps

* fix losses

2026-04-27 13:22:56 -04:00

test_cut_cross_entropy.py

2026-04-27 13:22:56 -04:00

test_fp8.py

2026-01-27 17:08:24 -05:00

test_hooks.py

2026-01-27 17:08:24 -05:00

test_kd.py

2026-01-27 17:08:24 -05:00

test_liger.py

2026-01-27 17:08:24 -05:00

test_llm_compressor.py

2026-01-27 17:08:24 -05:00

test_scattermoe_lora_kernels.py

2026-03-21 22:46:10 -04:00

test_scattermoe_lora_olmoe.py

2026-04-21 10:16:03 -04:00

test_sonicmoe_lora.py

2026-04-02 08:53:48 -04:00

test_sonicmoe.py

2026-04-02 08:53:48 -04:00