Wing Lian
|
19cf0bda99
|
params are adam_*, not adamw_*
|
2023-07-08 12:13:39 -04:00 |
|
Wing Lian
|
ad5ca4f734
|
Additional test case per pr
|
2023-06-15 10:12:47 -04:00 |
|
Wing Lian
|
cb9d3af5c0
|
add validation and tests for adamw hyperparam
|
2023-06-15 09:39:42 -04:00 |
|
Wing Lian
|
fd2c9814c9
|
Merge branch 'main' into flash-optimum
|
2023-06-12 13:12:15 -04:00 |
|
Wing Lian
|
14668fa54e
|
new validation for mpt w grad checkpoints
|
2023-06-11 09:26:10 -04:00 |
|
Wing Lian
|
eea2731a5e
|
add streaming dataset support for pretraining datasets
|
2023-06-10 14:23:56 -04:00 |
|
NanoCode012
|
babf0fdb71
|
Validate falcon with fsdp
|
2023-06-09 00:29:04 +09:00 |
|
NanoCode012
|
3c71c8debe
|
Update doc for grad_accu and add validation tests for batch size
|
2023-06-01 06:13:47 +09:00 |
|
Wing Lian
|
6fa40bf8ad
|
black formatting
|
2023-05-30 23:33:37 -04:00 |
|
Wing Lian
|
3aad5f3b3e
|
add support for gradient accumulation steps
|
2023-05-30 23:24:37 -04:00 |
|
NanoCode012
|
37293dce07
|
Apply isort then black
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
0dd35c74af
|
Ignore unsupported-binary-operation
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
b832a0ac62
|
Black formatting
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
1f3c3f5ea0
|
Lint validation
|
2023-05-31 02:53:53 +09:00 |
|
Wing Lian
|
fd5f9656a2
|
update for pr feedback
|
2023-05-28 14:23:27 -04:00 |
|
Wing Lian
|
1c33eb88a7
|
new hf_use_auth_token setting so login to hf isn't required
|
2023-05-28 13:08:49 -04:00 |
|
NanoCode012
|
52dd92a0cd
|
Feat: Update validate_config and add tests
|
2023-05-29 00:25:54 +09:00 |
|