Wing Lian
e412370877
roundup_power2_divisions not needed with newer pytorch versions ( #3540 )
...
* roundup_power2_divisions not needed with newer pytorch versions
* remove typo
* update qwen3.5 moe 35b-a3b yaml for 5090
* more bug fixes
* fix tests to match updated trainer
* don't use fa2 for hooks test
* reset plugins on the instance
* retry download
* fix references to renamed axolotl_cfg property on trainer
* Fix ref to trainer cfg
2026-03-24 15:40:05 -04:00
..
2026-03-21 22:46:10 -04:00
2026-03-23 02:26:10 -04:00
2026-03-22 13:53:19 -04:00
2026-03-22 13:53:19 -04:00
2026-01-27 17:08:24 -05:00
2023-11-06 18:33:01 -05:00
2023-09-15 15:46:54 -04:00
2026-01-27 17:08:24 -05:00
2026-03-21 22:46:10 -04:00
2026-03-24 15:40:05 -04:00
2026-03-21 22:46:10 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2026-03-16 23:47:00 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-03-21 22:46:10 -04:00
2026-03-21 22:46:10 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-11-07 08:21:20 -05:00
2025-08-23 23:37:33 -04:00
2025-07-14 20:11:11 -04:00
2026-03-05 13:40:45 -05:00
2026-03-21 22:46:10 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-02-10 17:44:17 +07:00
2025-08-26 09:30:04 -04:00
2026-03-22 13:54:03 -04:00