axolotl

Files

Seung Hyun Cho 3e51a680c2 fix: Fix evaluation loss in KD trainer (#3271 )

* fix: Fix evaluation loss in KD trainer

* Fix v2 strategy super() call

* fix: Add safety check for total_tokens in log method

* fix: simplified num items and outputs return handling

* fix: add missing model forward pass in compute_loss

* refactor: Use Template Method pattern for chat template strategies

* refactor: use pop(None) and remove v2 override

* chore: lint

---------

Co-authored-by: NanoCode012 <nano@axolotl.ai>
Co-authored-by: Wing Lian <wing@axolotl.ai>

2025-12-17 13:40:36 -05:00

__init__.py

upgrade liger to 0.4.0 (#1973 )

2024-11-07 12:53:34 -05:00

test_diffusion_callback.py

text diffusion training plugin (#3067 )

2025-09-10 20:27:00 -04:00

test_diffusion.py

text diffusion training plugin (#3067 )

2025-09-10 20:27:00 -04:00

test_kd_chat_template.py

fix: Fix evaluation loss in KD trainer (#3271 )

2025-12-17 13:40:36 -05:00

test_liger.py

feat: add arg to enable dft in liger (#3125 )

2025-11-10 21:37:47 +07:00