salman
294c7fe7a6
Distributed/ND-Parallel ( #2977 )
2025-07-31 15:25:02 -04:00
Dan Saunders
b5f1e53a0f
models.py -> loaders/ module refactor ( #2680 )
...
* models.py -> loaders/ module refactor
* refactor ModelLoader class
* plugin manager changes
* circular import fix
* pytest
* pytest
* minor improvements
* fix
* minor changes
* fix test
* remove dead code
* coderabbit comments
* lint
* fix
* coderabbit suggestion I liked
* more coderabbit
* review comments, yak shaving
* lint
* updating in light of SP ctx manager changes
* review comment
* review comment 2
2025-05-23 15:51:11 -04:00
Wing Lian
40f4ea23ab
replace references to random 68m model w 135m smollm2 ( #2570 ) [skip ci]
...
* replace references to random 68m model w 135m smollm2
* use AutoTokenizer for smollm2
2025-04-28 10:08:07 -04:00
Wing Lian
de8a625dd7
make e2e tests a bit faster by reducing test split size ( #2522 ) [skip ci]
...
* [ci] make e2e tests a bit faster by reducing test split size
* use 10% split of alpaca dataset to speed up dataset loading/tokenization
* reduce gas 4->2 for most e2e tests
* increase val set size for packing
2025-04-12 07:24:43 -07:00
salman
ac471a697a
updating to fused ( #2293 )
2025-01-30 11:45:56 -05:00
Mengqing Cao
1d6a5e2bd6
Refactor func load_model to class ModelLoader ( #1909 )
2024-10-25 09:06:56 -04:00