Wing Lian
|
8d20e0a3d3
|
initial wip to get sys prompt from dataset
|
2023-06-25 22:28:07 -04:00 |
|
Wing Lian
|
47d601fa23
|
optionally define whether to use_fast tokenizer
|
2023-06-25 10:19:49 -04:00 |
|
Wing Lian
|
ad5ca4f734
|
Additional test case per pr
|
2023-06-15 10:12:47 -04:00 |
|
Wing Lian
|
cb9d3af5c0
|
add validation and tests for adamw hyperparam
|
2023-06-15 09:39:42 -04:00 |
|
Wing Lian
|
1925eaf1e6
|
Merge pull request #214 from OpenAccess-AI-Collective/fix-tokenizing-labels
Fix tokenizing labels
|
2023-06-15 08:13:43 -04:00 |
|
Wing Lian
|
1ab3bf3e67
|
fix test name
|
2023-06-15 02:09:33 -04:00 |
|
Wing Lian
|
baed440fa1
|
ingore duplicate code in tests
|
2023-06-15 02:03:53 -04:00 |
|
Wing Lian
|
7925ddce86
|
bugfix for potential off by one
|
2023-06-15 01:59:33 -04:00 |
|
Wing Lian
|
fd2c9814c9
|
Merge branch 'main' into flash-optimum
|
2023-06-12 13:12:15 -04:00 |
|
Wing Lian
|
14668fa54e
|
new validation for mpt w grad checkpoints
|
2023-06-11 09:26:10 -04:00 |
|
Wing Lian
|
eea2731a5e
|
add streaming dataset support for pretraining datasets
|
2023-06-10 14:23:56 -04:00 |
|
NanoCode012
|
babf0fdb71
|
Validate falcon with fsdp
|
2023-06-09 00:29:04 +09:00 |
|
NanoCode012
|
3c71c8debe
|
Update doc for grad_accu and add validation tests for batch size
|
2023-06-01 06:13:47 +09:00 |
|
Wing Lian
|
0136f510f2
|
don't worry about duplicate code here
|
2023-05-31 12:05:43 -04:00 |
|
Wing Lian
|
9b8585dc70
|
fix packing so that concatenated sequences reset the attention
|
2023-05-31 11:38:52 -04:00 |
|
Wing Lian
|
6fa40bf8ad
|
black formatting
|
2023-05-30 23:33:37 -04:00 |
|
Wing Lian
|
3aad5f3b3e
|
add support for gradient accumulation steps
|
2023-05-30 23:24:37 -04:00 |
|
NanoCode012
|
b81c97ff76
|
Fix pre-commit for rebased files
|
2023-05-31 03:01:38 +09:00 |
|
Wing Lian
|
cfcc549f6b
|
fix relative path for fixtures
|
2023-05-31 02:55:21 +09:00 |
|
NanoCode012
|
37293dce07
|
Apply isort then black
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
0dd35c74af
|
Ignore unsupported-binary-operation
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
b832a0ac62
|
Black formatting
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
1f3c3f5ea0
|
Lint validation
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
0e952889dc
|
Lint test_dict
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
7eb33a77dd
|
Lint test_prompters
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
392dfd9b07
|
Lint and format
|
2023-05-31 02:53:22 +09:00 |
|
Wing Lian
|
e65aeedce7
|
fix relative path for fixtures
|
2023-05-30 10:38:20 -04:00 |
|
Wing Lian
|
e6fdeb087f
|
add unit test for sharegpt tokenization
|
2023-05-30 10:28:17 -04:00 |
|
Wing Lian
|
fd5f9656a2
|
update for pr feedback
|
2023-05-28 14:23:27 -04:00 |
|
Wing Lian
|
1c33eb88a7
|
new hf_use_auth_token setting so login to hf isn't required
|
2023-05-28 13:08:49 -04:00 |
|
NanoCode012
|
52dd92a0cd
|
Feat: Update validate_config and add tests
|
2023-05-29 00:25:54 +09:00 |
|
NanoCode012
|
f87bd20555
|
Fix incorrect syntax in test
|
2023-05-28 23:35:29 +09:00 |
|
NanoCode012
|
923151ffab
|
Add test for DictDefault
|
2023-05-28 23:06:10 +09:00 |
|
Wing Lian
|
d199d6c261
|
automated testing in github actions
|
2023-05-27 11:51:01 -04:00 |
|