Nemo gym integration (#3516) [skip ci]

* nemo gym integration with grpo wip * mostly working * cleanup * simplify * update docs * nemo gym support wip * cleanup * chore: lint * address PR review and add more tests * chore: lint * post merge lora fixes for CI (#3536) [skip ci] * post merge lora fixes for CI * handle lora kernel auto-enable for moe without grouped_mm * prefer not to import torch in schema validation * address pr comments, add timeout, add tests * roundup_power2_divisions not needed with newer pytorch versions (#3540) * roundup_power2_divisions not needed with newer pytorch versions * remove typo * update qwen3.5 moe 35b-a3b yaml for 5090 * more bug fixes * fix tests to match updated trainer * don't use fa2 for hooks test * reset plugins on the instance * retry download * fix references to renamed axolotl_cfg property on trainer * Fix ref to trainer cfg * fix: robust handling of race condition on patching check (#3543) [skip ci] * EBFT: Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models (#3527) [skip ci] * EBFT wip * fixes * more fixeS * add missing strided module * ebft fixes for multi-turn * make ebft work with async * add example for ebft w qwen3.5 * fix for split thinking and update yaml for lora over linear attention only * enforce_eager for vllm arg in schema * fix sync weights * fix multi-gpu * handle updated sig for mm * ddp fixes * improve multi-gpu handling, don't calculate logits, adaptive completion length * chore: lint * chore: lint * support completion_mean * Address corereview feedback * clamp min IS ratio * Address PR code review * more fixes identified * address code review * Fix property from rebase conflict * fix for ebft sync and update docs * make trainer loss patch check a solo test --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-25 07:38:06 -04:00
parent 2fb72798e0
commit c2bd75aff6
20 changed files with 3592 additions and 19 deletions
--- a/tests/monkeypatch/test_trainer_loss_calc.py
+++ b/tests/monkeypatch/test_trainer_loss_calc.py
@@ -1,26 +0,0 @@
-"""Unit tests for trainer loss calc monkeypatch."""
-
-import unittest
-
-from axolotl.monkeypatch.transformers.trainer_loss_calc import (
-    check_evaluation_loop_is_patchable,
-    check_maybe_log_save_evaluate_is_patchable,
-)
-
-
-class TestTrainerLossCalc(unittest.TestCase):
-    """
-    Unit test class for trainer loss calc monkeypatch
-    """
-
-    def test_trainer_loss_calc_is_patchable(self):
-        """
-        Test that the upstream transformers code is still patchable. This will fail if
-        the patched code changes upstream.
-        """
-        assert check_evaluation_loop_is_patchable()
-        assert check_maybe_log_save_evaluate_is_patchable()
-
-
-if __name__ == "__main__":
-    unittest.main()