Seung Hyun Cho
3e51a680c2
fix: Fix evaluation loss in KD trainer (#3271)
* fix: Fix evaluation loss in KD trainer
* Fix v2 strategy super() call
* fix: Add safety check for total_tokens in log method
* fix: simplified num items and outputs return handling
* fix: add missing model forward pass in compute_loss
* refactor: Use Template Method pattern for chat template strategies
* refactor: use pop(None) and remove v2 override
* chore: lint
---------
Co-authored-by: NanoCode012 <nano@axolotl.ai>
Co-authored-by: Wing Lian <wing@axolotl.ai>
2025-12-17 13:40:36 -05:00
..
2024-11-07 12:53:34 -05:00
2025-09-10 20:27:00 -04:00
2025-09-10 20:27:00 -04:00
2025-12-17 13:40:36 -05:00
2025-11-10 21:37:47 +07:00