VED
bb622b83de
super nemo support (#3508)
* nemo support
* config
* rename , config
* nemotron packing
* config fix
* read me + configs
* gc compat bug
* config chnages for qwen and pad token nemo
* patch nemotron_h weight renaming so it doesn't get reversed to embedding (singular noun) on checkpoint save
* lint
* revert qwen3.5 config changes, not needed in this pr
* lint
* Update examples/nemotron-h/120b-a12b-qlora.yaml
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
* Update examples/nemotron-h/nano-30b-a3b-qlora.yaml
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
* readme + validation
* lazy load comment
* Update examples/nemotron-h/120b-a12b-qlora.yaml
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
* val fix
* add nemo to multi packing
---------
Co-authored-by: Wing Lian <wing@axolotl.ai>
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
2026-03-30 18:12:50 -04:00
..
2025-09-24 16:13:49 -04:00
2026-01-21 20:00:18 -05:00
2026-01-21 20:00:18 -05:00
2025-09-23 21:22:15 +07:00
2025-08-26 09:29:50 -04:00
2025-07-21 11:40:56 -04:00
2026-03-19 23:07:42 -04:00
2025-09-23 21:22:15 +07:00
2025-07-30 06:44:06 -04:00
2026-01-21 20:00:18 -05:00
2025-08-08 12:45:36 +01:00
2026-01-28 06:44:15 -05:00
2026-03-25 07:38:06 -04:00
2025-07-21 11:40:56 -04:00
2025-07-21 11:40:56 -04:00
2026-03-20 16:14:06 +07:00
2026-01-21 20:00:18 -05:00
2025-07-30 06:44:06 -04:00
2026-03-05 09:58:09 -05:00
2026-02-10 17:43:53 +07:00
2026-03-05 09:58:09 -05:00
2026-01-21 20:00:18 -05:00
2026-01-21 20:00:18 -05:00
2026-01-21 20:00:18 -05:00
2025-12-25 18:07:59 +07:00
2026-01-27 17:08:24 -05:00
2025-12-25 17:53:52 +07:00
2025-10-09 10:47:41 -04:00
2025-09-26 10:23:59 +01:00
2026-03-05 13:40:45 -05:00
2025-07-22 10:00:30 -04:00
2025-09-23 21:22:15 +07:00
2025-07-22 10:00:30 -04:00
2026-01-21 20:00:18 -05:00
2026-01-27 17:08:24 -05:00
2025-12-25 18:09:03 +07:00
2025-12-04 21:44:44 +07:00
2026-01-13 14:33:11 +07:00
2025-12-25 19:17:25 +07:00
2026-03-20 17:11:46 +07:00
2025-12-25 19:17:25 +07:00
2026-03-20 16:23:42 +07:00
2026-03-30 18:12:50 -04:00
2025-12-25 17:56:20 +07:00
2025-07-30 06:44:06 -04:00
2025-09-23 21:22:15 +07:00
2025-09-18 15:42:20 +07:00
2025-12-25 18:09:03 +07:00
2025-12-17 09:35:22 -05:00
2025-12-19 10:43:47 -05:00
2025-09-23 21:22:15 +07:00
2025-09-23 21:22:15 +07:00
2025-12-09 14:31:03 +07:00
2026-03-03 09:26:46 -05:00
2026-03-24 15:40:05 -04:00
2025-11-24 10:21:31 +07:00
2025-08-08 08:02:03 -04:00
2025-11-24 10:21:31 +07:00
2025-09-02 12:08:44 -04:00
2026-01-06 09:19:18 -05:00
2026-03-03 10:06:23 -05:00
2026-01-21 20:00:18 -05:00