Dan Saunders
1b53c49e1a
text diffusion training plugin (#3067)
* diffusion training plugin
* cleanup
* nits
* fixes + improvements
* add back in reinit_weights (clobbered?); masking / pretrain fixes
* nits
* cleanup; tests draft
* sample generation, tests fixes
* fixes
* nits
* add inference support; add auto-mask token support
* nits
* nits
* progress
* simplify logging
* lint
* prefix args with diffusion_
* coderabbito
* tests fix
* nit
* nits
* cleanup + nits
* nits
* fix SFT sample gen
* fixes
* fix
* comments
* comments
* lint
* reward model lora fix
* cleanup; fix pretraining_dataset case
* gradio inference
* update cfgs
* update cfgs
* train, generation parity, cleanup
* fix
* simplify
* test
* test fix
2025-09-10 20:27:00 -04:00
..
2025-07-31 15:25:02 -04:00
2025-08-08 08:09:11 -04:00
2025-08-06 08:02:39 -04:00
2025-08-26 09:29:50 -04:00
2025-07-21 11:40:56 -04:00
2025-09-10 20:27:00 -04:00
2025-07-30 06:44:06 -04:00
2025-07-30 06:44:06 -04:00
2025-09-10 09:03:30 +07:00
2025-08-08 12:45:36 +01:00
2025-07-21 11:40:56 -04:00
2025-07-21 11:40:56 -04:00
2025-09-03 16:20:32 -04:00
2025-08-08 08:00:26 -04:00
2025-07-30 06:44:06 -04:00
2025-08-21 15:04:10 -04:00
2025-09-10 09:03:30 +07:00
2025-07-30 06:44:06 -04:00
2025-08-15 10:52:57 -04:00
2025-08-06 08:02:39 -04:00
2025-09-10 20:27:00 -04:00
2025-07-22 10:00:30 -04:00
2025-07-30 06:44:06 -04:00
2025-07-22 10:00:30 -04:00
2025-09-10 09:03:30 +07:00
2025-07-30 06:44:06 -04:00
2025-07-30 06:44:06 -04:00
2025-07-30 06:44:06 -04:00
2025-07-30 06:44:06 -04:00
2025-07-22 10:00:30 -04:00
2025-07-30 06:44:06 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-09-03 16:22:37 -04:00
2025-09-10 09:01:02 +07:00
2025-08-08 08:02:03 -04:00
2025-08-15 10:52:57 -04:00
2025-09-02 12:08:44 -04:00
2025-09-10 09:03:30 +07:00