Wing Lian
|
e50a64e85e
|
prepared dataset caching, other misc fixes (#665)
* prepared dataset caching, other misc fixes
* also don't load from disk cache unless explicit
|
2023-10-02 21:07:24 -04:00 |
|
Doan Minh Phuong
|
1aa400721e
|
Fix Codellama examples (#582)
* Fix seq_len
* Update lora.yml
* Update qlora.yml
* Update lora.yml
* Update lora.yml
* Update qlora.yml
|
2023-09-15 04:19:13 -04:00 |
|
Wing Lian
|
343714972b
|
recommend padding when using sample packing (#531)
|
2023-09-06 17:00:21 -04:00 |
|
mhenrichsen
|
35130711d6
|
Feat(cfg): Add code-llama configs for all sizes (#479)
* configs for all sizes
* update tokenizer type
---------
Co-authored-by: mhenrichsen <some_email@hey.com>
|
2023-08-27 10:20:17 +09:00 |
|