Files
axolotl/examples
NanoCode012 80d5b066ec Fix: adding magistral fsdp config, fixing not eval with test_datasets, handle mllama attention (#2789) [skip ci]
* feat: add fsdp config for magistral

* fix: add mllama self attention handling for lora kernels

* fix: no eval if val_set_size 0 despite having test_datasets

* fix: add note for cce for vlm in newer model
2025-06-14 11:53:43 -07:00
..
2025-06-05 07:20:33 -07:00
2025-06-05 07:20:33 -07:00
2025-06-05 07:20:33 -07:00
2025-06-05 07:20:33 -07:00
2025-05-28 12:35:47 +01:00