Wing Lian
|
8b79ff0e94
|
fix eval_steps to be a sane default (#797)
* fix eval_steps to be a sane default
* update docs for fractional eval_steps
|
2023-10-27 22:36:30 -04:00 |
|
Wing Lian
|
2d8def68dc
|
simplify by removing duplicate base_model_config (#772)
|
2023-10-23 01:42:38 -04:00 |
|
Wing Lian
|
e50a64e85e
|
prepared dataset caching, other misc fixes (#665)
* prepared dataset caching, other misc fixes
* also don't load from disk cache unless explicit
|
2023-10-02 21:07:24 -04:00 |
|
mhenrichsen
|
4fecbfe5e1
|
default model changed
|
2023-09-24 18:52:53 +02:00 |
|
Wing Lian
|
343714972b
|
recommend padding when using sample packing (#531)
|
2023-09-06 17:00:21 -04:00 |
|
Charles O. Goddard
|
fe4d6baf92
|
Add example Llama 2 ReLoRA config (#471)
* Add example Llama 2 ReLoRA config
* Use adamw_bnb_8bit in example relora config
|
2023-08-27 10:08:34 +09:00 |
|