Dan Saunders
1b53c49e1a
text diffusion training plugin (#3067)
* diffusion training plugin
* cleanup
* nits
* fixes + improvements
* add back in reinit_weights (clobbered?); masking / pretrain fixes
* nits
* cleanup; tests draft
* sample generation, tests fixes
* fixes
* nits
* add inference support; add auto-mask token support
* nits
* nits
* progress
* simplify logging
* lint
* prefix args with diffusion_
* coderabbito
* tests fix
* nit
* nits
* cleanup + nits
* nits
* fix SFT sample gen
* fixes
* fix
* comments
* comments
* lint
* reward model lora fix
* cleanup; fix pretraining_dataset case
* gradio inference
* update cfgs
* update cfgs
* train, generation parity, cleanup
* fix
* simplify
* test
* test fix
2025-09-10 20:27:00 -04:00
..
2025-09-02 12:08:44 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2023-11-06 18:33:01 -05:00
2023-09-15 15:46:54 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-09-10 20:27:00 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-07-14 20:11:11 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-09-02 12:08:44 -04:00
2025-08-26 09:30:04 -04:00
2025-08-23 23:37:33 -04:00