Dan Saunders
|
1b53c49e1a
|
text diffusion training plugin (#3067)
* diffusion training plugin
* cleanup
* nits
* fixes + improvements
* add back in reinit_weights (clobbered?); masking / pretrain fixes
* nits
* cleanup; tests draft
* sample generation, tests fixes
* fixes
* nits
* add inference support; add auto-mask token support
* nits
* nits
* progress
* simplify logging
* lint
* prefix args with diffusion_
* coderabbito
* tests fix
* nit
* nits
* cleanup + nits
* nits
* fix SFT sample gen
* fixes
* fix
* comments
* comments
* lint
* reward model lora fix
* cleanup; fix pretraining_dataset case
* gradio inference
* update cfgs
* update cfgs
* train, generation parity, cleanup
* fix
* simplify
* test
* test fix
|
2025-09-10 20:27:00 -04:00 |
|
Dan Saunders
|
231a67e70b
|
Streaming SFT support (#3101)
* working
* fixes
* deprecate --iterable; cleanup
* pretrain_multipack_buffer_size -> streaming_multipack_buffer_size
* improvements
* tests
* remove unused
* docs, examples
* nit
* nit
* add val_set_size validation
* val
* nit
* min
* coderabbito
* cleanup
* nit
* add depr warning, cleanup
* nit
* fix test, fix quarto
* fix
* review comments
* review comments
* fix
|
2025-09-02 12:08:44 -04:00 |
|