Files
axolotl/examples
Wing Lian ecbe8b2b61 [GPT-OSS] improve FSDP shard merging and documentation for GPT-OSS (#3073)
* improve fsdp shard merging

* improve logging

* update information on merging and inferencing GPT-OSS

* cleanup readme

* automate cleanup of FSDP prefix

* import GRPO only if necessary

* only modify config.json on rank0

* merge final checkpoint at end of training

* prevent circular import

* Fix saving for sharded state dict

* devx, move merged to output dir

* move import back to top

* Fix stuck merge

* fix conditionals from pr feedback and add test
2025-08-15 21:25:01 -04:00
..
2025-07-31 15:25:02 -04:00
2025-08-08 08:09:11 -04:00
2025-08-15 10:52:57 -04:00
2025-08-08 08:09:11 -04:00
2025-08-15 10:52:57 -04:00