Dan Saunders
1b53c49e1a
text diffusion training plugin (#3067)
* diffusion training plugin
* cleanup
* nits
* fixes + improvements
* add back in reinit_weights (clobbered?); masking / pretrain fixes
* nits
* cleanup; tests draft
* sample generation, tests fixes
* fixes
* nits
* add inference support; add auto-mask token support
* nits
* nits
* progress
* simplify logging
* lint
* prefix args with diffusion_
* coderabbito
* tests fix
* nit
* nits
* cleanup + nits
* nits
* fix SFT sample gen
* fixes
* fix
* comments
* comments
* lint
* reward model lora fix
* cleanup; fix pretraining_dataset case
* gradio inference
* update cfgs
* update cfgs
* train, generation parity, cleanup
* fix
* simplify
* test
* test fix
2025-09-10 20:27:00 -04:00
..
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-09-10 20:27:00 -04:00
2023-12-12 09:39:22 -08:00
2025-09-10 20:27:00 -04:00
2025-09-05 11:00:54 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-03-31 13:40:12 +07:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-09-02 12:08:44 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-03-21 11:02:43 -04:00
2024-03-14 11:05:42 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-07-14 09:25:44 -04:00
2025-08-23 23:37:33 -04:00
2025-09-02 12:08:44 -04:00
2025-09-02 12:08:44 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2024-08-22 11:46:57 -04:00
2025-08-23 23:37:33 -04:00
2025-09-10 20:27:00 -04:00
2025-05-23 15:51:11 -04:00
2025-07-14 10:05:26 -04:00
2025-08-23 23:37:33 -04:00