Wing Lian
|
ac77da96da
|
use smaller pretrained models for ci (#3620) [skip ci]
* use smaller pretrained models for ci
* more steps for loss check
* fix tests
* more train steps
* fix losses
|
2026-04-27 13:22:56 -04:00 |
|
Lê Nam Khánh
|
80270a92fa
|
Fix typos in some files (#3250) [skip ci]
|
2025-11-07 08:21:20 -05:00 |
|
Dan Saunders
|
79ddaebe9a
|
Add ruff, remove black, isort, flake8, pylint (#3092)
* black, isort, flake8 -> ruff
* remove unused
* add back needed import
* fix
|
2025-08-23 23:37:33 -04:00 |
|
Wing Lian
|
ca4d4ef793
|
don't init distributed for deepspeed if preprocessing (#2920)
* don't init distributed for deepspeed if preprocessing
* add e2e test to validate preprocess cli with deepspeed
* ignore duplicate code for cfg
|
2025-07-14 14:19:19 -04:00 |
|