Wing Lian
|
8b79ff0e94
|
fix eval_steps to be a sane default (#797)
* fix eval_steps to be a sane default
* update docs for fractional eval_steps
|
2023-10-27 22:36:30 -04:00 |
|
Wing Lian
|
9b43e7ea15
|
disable eval table w sample packing in examples (#778)
|
2023-10-23 09:18:44 -04:00 |
|
Wing Lian
|
2d8def68dc
|
simplify by removing duplicate base_model_config (#772)
|
2023-10-23 01:42:38 -04:00 |
|
mhenrichsen
|
f91db198f3
|
fix unneeded space (#699)
|
2023-10-07 14:19:25 -04:00 |
|
mhenrichsen
|
83a950bb87
|
lint
|
2023-10-07 11:04:35 +02:00 |
|
mhenrichsen
|
4c8ddf2c6f
|
new lr, sample pack
|
2023-10-06 22:58:13 +02:00 |
|
Wing Lian
|
e50a64e85e
|
prepared dataset caching, other misc fixes (#665)
* prepared dataset caching, other misc fixes
* also don't load from disk cache unless explicit
|
2023-10-02 21:07:24 -04:00 |
|
NanoCode012
|
eb41f76f92
|
Feat: Add example for Mistral (#644)
* Feat: Add example for Mistral
* chore: turn off flash
* chore: add is_mistral_derived_model
* chore: update following PR
|
2023-09-28 20:15:00 +09:00 |
|