Wing Lian
e412370877
roundup_power2_divisions not needed with newer pytorch versions ( #3540 )
...
* roundup_power2_divisions not needed with newer pytorch versions
* remove typo
* update qwen3.5 moe 35b-a3b yaml for 5090
* more bug fixes
* fix tests to match updated trainer
* don't use fa2 for hooks test
* reset plugins on the instance
* retry download
* fix references to renamed axolotl_cfg property on trainer
* Fix ref to trainer cfg
2026-03-24 15:40:05 -04:00
..
2026-02-23 10:10:06 -05:00
2026-03-22 13:53:19 -04:00
2026-03-24 15:40:05 -04:00
2023-12-12 09:39:22 -08:00
2026-03-24 15:40:05 -04:00
2026-03-22 13:19:21 -04:00
2026-03-17 11:42:47 -04:00
2026-03-21 22:47:26 -04:00
2026-03-21 22:47:26 -04:00
2026-03-06 09:11:20 -05:00
2026-03-22 13:53:19 -04:00
2025-03-31 13:40:12 +07:00
2026-03-24 15:40:05 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-03-05 12:33:28 -05:00
2026-03-16 00:12:40 -04:00
2026-02-25 11:31:11 +07:00
2025-12-22 13:59:49 -05:00
2025-08-23 23:37:33 -04:00
2025-10-13 17:18:12 +07:00
2024-03-14 11:05:42 -04:00
2025-10-16 16:07:27 +07:00
2025-09-17 13:27:03 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-10-22 19:16:55 -07:00
2025-08-23 23:37:33 -04:00
2025-10-13 17:18:12 +07:00
2025-09-02 12:08:44 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2024-08-22 11:46:57 -04:00
2026-02-25 11:11:20 +07:00
2026-03-02 12:55:59 -05:00
2025-08-23 23:37:33 -04:00
2025-09-10 20:27:00 -04:00
2026-03-06 11:40:32 -05:00
2026-03-06 09:11:20 -05:00
2025-07-14 10:05:26 -04:00
2026-03-19 02:02:43 -04:00
2025-09-17 13:27:03 -04:00
2026-03-16 23:47:00 -04:00