Wing Lian
|
e50a64e85e
|
prepared dataset caching, other misc fixes (#665)
* prepared dataset caching, other misc fixes
* also don't load from disk cache unless explicit
|
2023-10-02 21:07:24 -04:00 |
|
Birch-san
|
8e197f6fb4
|
pad_to_worst_case_seq_len boolean, for testing memory limits (#498)
* pad_to_worst_case_seq_len boolean, for testing memory limits
* remove collator_pad_to_longest option since it does nothing
see docs: https://huggingface.co/docs/transformers/main_classes/data_collator#transformers.DataCollatorWithPadding.padding
True and "longest" mean the same thing
* rename to `pad_to_sequence_len, and ensure 64 alignment
---------
Co-authored-by: Aman Karmani <aman@tmm1.net>
|
2023-08-28 18:47:16 -04:00 |
|
Morgan McGuire
|
7019509daa
|
Add wandb_entity to wandb options, update example configs, update README (#361)
* Update wandb_entity and add wandb descriptions
* add wandb to config section
* remove trailing whitespace for pre-commit hook
* remove trailing whitespace for pre-commit hook
---------
Co-authored-by: Morgan McGuire <morganmcguire@Morgans-MacBook-Pro.local>
Co-authored-by: Wing Lian <wing.lian@gmail.com>
|
2023-08-12 12:17:11 -04:00 |
|
Wing Lian
|
0c6f928601
|
address PR feedback
|
2023-06-10 14:23:56 -04:00 |
|
Wing Lian
|
1db46a9c72
|
linting fix
|
2023-06-10 14:23:56 -04:00 |
|
Wing Lian
|
39619028a3
|
use pythia-12b, neox-20b is flaky
|
2023-06-10 14:22:30 -04:00 |
|