Wing Lian
|
f6721baf10
|
tweak to make it work when we have no explicit test split
pre-commit / pre-commit (push) Has been cancelled
PyTest / test (3.10) (push) Has been cancelled
PyTest / test (3.9) (push) Has been cancelled
|
2023-07-11 22:40:21 -04:00 |
|
Wing Lian
|
33814cc94e
|
make sure we eval for openorca
|
2023-07-02 17:59:10 -04:00 |
|
Wing Lian
|
50254a7ccc
|
handle orca splits
|
2023-07-01 07:20:23 -04:00 |
|
Wing Lian
|
3a783c04e4
|
Merge pull request #247 from OpenAccess-AI-Collective/fix-apex-base
update pip install command for apex
|
2023-07-01 06:18:25 -04:00 |
|
Wing Lian
|
1e5014acec
|
Merge pull request #255 from OpenAccess-AI-Collective/open-orca-prompts
open orca support
|
2023-07-01 01:11:23 -04:00 |
|
Wing Lian
|
a10da1caff
|
11.7.0 nvidia/cuda docker images are deprecated, move to 11.7.1
ci-cd-base / build-base (<nil>, 117, 11.7.1, 3.9, 1.13.1) (push) Has been cancelled
ci-cd-base / build-base (<nil>, 118, 11.8.0, 3.10, 2.0.0) (push) Has been cancelled
ci-cd-base / build-base (<nil>, 118, 11.8.0, 3.9, 2.0.0) (push) Has been cancelled
ci-cd-base / build-base (gptq, 118, 11.8.0, 3.9, 2.0.0) (push) Has been cancelled
pre-commit / pre-commit (push) Has been cancelled
PyTest / test (3.10) (push) Has been cancelled
PyTest / test (3.9) (push) Has been cancelled
|
2023-07-01 00:29:07 -04:00 |
|
Wing Lian
|
4066c78631
|
Merge pull request #246 from OpenAccess-AI-Collective/sys-prompts-instruct
add option for instruct w sys prompts
|
2023-07-01 00:27:29 -04:00 |
|
Wing Lian
|
78a1e1fa12
|
open orca support
|
2023-07-01 00:19:41 -04:00 |
|
NanoCode012
|
bc8a2e5547
|
Merge pull request #249 from OpenAccess-AI-Collective/NanoCode012-patch-1
Fix typing list in prompt tokenizer
|
2023-06-30 15:01:41 +09:00 |
|
NanoCode012
|
910ebe47f5
|
Merge pull request #252 from OpenAccess-AI-Collective/NanoCode012-readme-fix
Add cfg.push_to_hub_model_id to readme
|
2023-06-30 14:56:55 +09:00 |
|
NanoCode012
|
c146880a75
|
Update README.md
|
2023-06-30 11:33:53 +09:00 |
|
NanoCode012
|
77bdb7d144
|
Fix typing list
|
2023-06-29 14:29:55 +09:00 |
|
Wing Lian
|
530809fd74
|
update pip install command for apex
|
2023-06-28 22:36:28 -04:00 |
|
Wing Lian
|
924bbfddec
|
add option for instruct w sys prompts
|
2023-06-28 22:27:17 -04:00 |
|
Wing Lian
|
f150c027e3
|
Merge pull request #224 from OpenAccess-AI-Collective/system-prompt-data
System prompt data
|
2023-06-27 17:57:43 -04:00 |
|
Wing Lian
|
5c39c006c9
|
Merge pull request #244 from OpenAccess-AI-Collective/push-to-hub
push intermediate model checkpoints to hub
|
2023-06-27 17:57:30 -04:00 |
|
Wing Lian
|
612aabd8c4
|
push intermediate model checkpoints to hub
|
2023-06-27 15:40:25 -04:00 |
|
Wing Lian
|
af05883f75
|
Merge pull request #243 from OpenAccess-AI-Collective/unprompted-instruct
skip the system prompt
|
2023-06-25 22:50:35 -04:00 |
|
Wing Lian
|
05ab9092e3
|
skip the system prompt
|
2023-06-25 22:40:50 -04:00 |
|
Wing Lian
|
7b57ed7618
|
pylint for duplicated code for system prompts
|
2023-06-25 22:28:07 -04:00 |
|
Wing Lian
|
3a38271276
|
add tests and supoort for loader for sys prompt data
|
2023-06-25 22:28:07 -04:00 |
|
Wing Lian
|
8d20e0a3d3
|
initial wip to get sys prompt from dataset
|
2023-06-25 22:28:07 -04:00 |
|
Wing Lian
|
de8ed229c3
|
Merge pull request #240 from OpenAccess-AI-Collective/tokenizer-fast
optionally define whether to use_fast tokenizer
|
2023-06-25 12:47:55 -04:00 |
|
Wing Lian
|
478d8c7b8e
|
Merge pull request #241 from OpenAccess-AI-Collective/py3-pre-commit
better py3 support w pre-commit
|
2023-06-25 12:47:02 -04:00 |
|
Wing Lian
|
645c13592c
|
better py3 support w pre-commit
|
2023-06-25 10:26:02 -04:00 |
|
Wing Lian
|
47d601fa23
|
optionally define whether to use_fast tokenizer
|
2023-06-25 10:19:49 -04:00 |
|
Wing Lian
|
756dfba97b
|
Merge pull request #218 from OpenAccess-AI-Collective/no-fail-fast
don't fail fast
|
2023-06-23 15:42:54 -04:00 |
|
Wing Lian
|
91ab0592af
|
Merge pull request #235 from msinha251/Fixing-data-readme
|
2023-06-23 13:52:01 -04:00 |
|
Mahesh Sinha
|
0aeb7c7802
|
Fixing Data Readme
|
2023-06-21 15:34:48 +02:00 |
|
Wing Lian
|
d35278aaf1
|
don't fail fast
|
2023-06-15 16:01:27 -04:00 |
|
Wing Lian
|
9492d4ebb7
|
Merge pull request #215 from OpenAccess-AI-Collective/adamw-hyperparams-cfg
support adamw and grad norm hyperparams
|
2023-06-15 12:20:55 -04:00 |
|
Wing Lian
|
ad5ca4f734
|
Additional test case per pr
|
2023-06-15 10:12:47 -04:00 |
|
Wing Lian
|
cb9d3af5c0
|
add validation and tests for adamw hyperparam
|
2023-06-15 09:39:42 -04:00 |
|
Wing Lian
|
c969f0a9dc
|
add docs
|
2023-06-15 08:43:20 -04:00 |
|
Wing Lian
|
6d0ee4ba34
|
support adamw and grad norm hyperparams
|
2023-06-15 08:40:41 -04:00 |
|
Wing Lian
|
a81f52d575
|
Merge pull request #212 from OpenAccess-AI-Collective/doc-20230615-v1
add float16 docs and tweak typehints
|
2023-06-15 08:28:57 -04:00 |
|
Wing Lian
|
1925eaf1e6
|
Merge pull request #214 from OpenAccess-AI-Collective/fix-tokenizing-labels
Fix tokenizing labels
|
2023-06-15 08:13:43 -04:00 |
|
Wing Lian
|
1ab3bf3e67
|
fix test name
|
2023-06-15 02:09:33 -04:00 |
|
Wing Lian
|
d7635b7148
|
hint to what AMP means
|
2023-06-15 02:06:27 -04:00 |
|
Wing Lian
|
88e17ffc50
|
add float16 docs and tweak typehints
|
2023-06-15 02:05:31 -04:00 |
|
Wing Lian
|
baed440fa1
|
ingore duplicate code in tests
|
2023-06-15 02:03:53 -04:00 |
|
Wing Lian
|
7925ddce86
|
bugfix for potential off by one
|
2023-06-15 01:59:33 -04:00 |
|
Wing Lian
|
6f849809c5
|
Merge pull request #206 from MaciejKarasek/issue205
issue #205 bugfix
|
2023-06-14 14:23:38 -04:00 |
|
Wing Lian
|
c16644d05e
|
Merge pull request #209 from sroecker/fix_redpajama_example_tokenizer
Use AutoTokenizer for redpajama example
|
2023-06-14 14:23:21 -04:00 |
|
Steffen Röcker
|
945c4191a3
|
Use AutoTokenizer for redpajama example
|
2023-06-14 20:09:26 +02:00 |
|
maciej.karasek
|
136522f9c9
|
style correction
|
2023-06-14 20:02:09 +02:00 |
|
maciej.karasek
|
556fe408b3
|
issue #205 bugfix
|
2023-06-14 16:59:57 +02:00 |
|
Wing Lian
|
16bb6276a5
|
Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum
add support for opimum bettertransformers
|
2023-06-14 07:50:15 -04:00 |
|
NanoCode012
|
06674a11f2
|
Merge pull request #202 from OpenAccess-AI-Collective/NanoCode012-patch-1
Fix sharegpt type in doc
|
2023-06-14 09:48:35 +09:00 |
|
NanoCode012
|
3513885f43
|
Fix sharegpt type
|
2023-06-14 01:10:58 +09:00 |
|