Charles Goddard
|
8bba64258e
|
Add example of dataset with configuration name to README
|
2023-07-14 20:46:21 -07:00 |
|
Charles Goddard
|
88089e8b32
|
Add ability to pass 'name' argument to load_dataset
|
2023-07-14 16:46:39 -07:00 |
|
NanoCode012
|
168a7a09cc
|
Merge pull request #274 from OpenAccess-AI-Collective/NanoCode012-patch-2
Feat: Set push to hub as private by default
|
2023-07-14 23:15:47 +09:00 |
|
NanoCode012
|
231031a0e1
|
Merge pull request #275 from NanoCode012/feat/safetensors
Feat: Add save_safetensors
|
2023-07-14 23:07:26 +09:00 |
|
NanoCode012
|
5daf7d5299
|
Merge pull request #273 from OpenAccess-AI-Collective/NanoCode012-patch-1
Feat(docs): Add model_revision arg
|
2023-07-14 21:09:50 +09:00 |
|
NanoCode012
|
5491278a79
|
Feat: Add save_safetensors
|
2023-07-14 13:21:47 +09:00 |
|
NanoCode012
|
1514739f0f
|
Set push to hub as private by default
|
2023-07-14 13:17:49 +09:00 |
|
NanoCode012
|
896c1aebcf
|
Feat(docs): Add model_revision arg
|
2023-07-14 12:56:07 +09:00 |
|
Wing Lian
|
ef17e15483
|
Merge pull request #272 from OpenAccess-AI-Collective/model-revision
support for loading a model by git revision
|
2023-07-13 23:12:00 -04:00 |
|
Wing Lian
|
69a235061b
|
support for loading a model by git revision
|
2023-07-13 22:58:25 -04:00 |
|
Wing Lian
|
687d889928
|
Merge pull request #271 from OpenAccess-AI-Collective/quadratic-warmup
Quadratic warmup
|
2023-07-10 12:48:02 -04:00 |
|
Wing Lian
|
c4cf567b55
|
Merge branch 'main' into quadratic-warmup
|
2023-07-10 12:42:12 -04:00 |
|
Wing Lian
|
c49729d2bc
|
better configuration for quadratic warmup
|
2023-07-10 11:52:59 -04:00 |
|
Wing Lian
|
13ac4d8de2
|
Merge pull request #268 from OpenAccess-AI-Collective/fix-adam-args
params are adam_*, not adamw_*
|
2023-07-08 12:33:34 -04:00 |
|
Wing Lian
|
19cf0bda99
|
params are adam_*, not adamw_*
|
2023-07-08 12:13:39 -04:00 |
|
Wing Lian
|
f74edd5b56
|
Merge pull request #266 from OpenAccess-AI-Collective/trust-remote-no-llama
|
2023-07-07 21:38:11 -04:00 |
|
Wing Lian
|
d69da99c2c
|
skip explicit model type too if using trust_remote_code
|
2023-07-07 21:33:11 -04:00 |
|
Wing Lian
|
66afb76a15
|
don't use llama if trust_remote_code is set since that needs to use AutoModel path
|
2023-07-07 21:31:02 -04:00 |
|
NanoCode012
|
a692ad3f4c
|
Merge pull request #264 from OpenAccess-AI-Collective/NanoCode012-patch-1
Fix(readme): local path loading and custom strategy type
|
2023-07-06 23:34:57 +09:00 |
|
NanoCode012
|
41da98b982
|
Fix for linter
|
2023-07-06 23:20:11 +09:00 |
|
NanoCode012
|
9e64f42e0f
|
Fix local path loading and custom strategy type
|
2023-07-06 23:08:09 +09:00 |
|
Wing Lian
|
b9b7d4ce92
|
Merge pull request #221 from utensil/local_dataset
[WIP] Support loading data files from a local directory
|
2023-07-03 09:10:13 -04:00 |
|
Wing Lian
|
9bed281867
|
Merge pull request #258 from NanoCode012/fix/deprecate-push
Fix future deprecation push_to_hub_model_id
|
2023-07-03 09:08:26 -04:00 |
|
NanoCode012
|
e79c8e617e
|
Fix future deprecation push_to_hub_model_id
|
2023-07-03 12:44:29 +09:00 |
|
Wing Lian
|
71456955f5
|
pin pydantic so deepspeed isn't broken
|
2023-07-02 22:26:51 -04:00 |
|
Wing Lian
|
3a783c04e4
|
Merge pull request #247 from OpenAccess-AI-Collective/fix-apex-base
update pip install command for apex
|
2023-07-01 06:18:25 -04:00 |
|
Wing Lian
|
1e5014acec
|
Merge pull request #255 from OpenAccess-AI-Collective/open-orca-prompts
open orca support
|
2023-07-01 01:11:23 -04:00 |
|
Wing Lian
|
a10da1caff
|
11.7.0 nvidia/cuda docker images are deprecated, move to 11.7.1
ci-cd-base / build-base (<nil>, 117, 11.7.1, 3.9, 1.13.1) (push) Has been cancelled
ci-cd-base / build-base (<nil>, 118, 11.8.0, 3.10, 2.0.0) (push) Has been cancelled
ci-cd-base / build-base (<nil>, 118, 11.8.0, 3.9, 2.0.0) (push) Has been cancelled
ci-cd-base / build-base (gptq, 118, 11.8.0, 3.9, 2.0.0) (push) Has been cancelled
pre-commit / pre-commit (push) Has been cancelled
PyTest / test (3.10) (push) Has been cancelled
PyTest / test (3.9) (push) Has been cancelled
|
2023-07-01 00:29:07 -04:00 |
|
Wing Lian
|
4066c78631
|
Merge pull request #246 from OpenAccess-AI-Collective/sys-prompts-instruct
add option for instruct w sys prompts
|
2023-07-01 00:27:29 -04:00 |
|
Wing Lian
|
78a1e1fa12
|
open orca support
|
2023-07-01 00:19:41 -04:00 |
|
NanoCode012
|
bc8a2e5547
|
Merge pull request #249 from OpenAccess-AI-Collective/NanoCode012-patch-1
Fix typing list in prompt tokenizer
|
2023-06-30 15:01:41 +09:00 |
|
NanoCode012
|
910ebe47f5
|
Merge pull request #252 from OpenAccess-AI-Collective/NanoCode012-readme-fix
Add cfg.push_to_hub_model_id to readme
|
2023-06-30 14:56:55 +09:00 |
|
NanoCode012
|
c146880a75
|
Update README.md
|
2023-06-30 11:33:53 +09:00 |
|
NanoCode012
|
77bdb7d144
|
Fix typing list
|
2023-06-29 14:29:55 +09:00 |
|
Wing Lian
|
530809fd74
|
update pip install command for apex
|
2023-06-28 22:36:28 -04:00 |
|
Wing Lian
|
924bbfddec
|
add option for instruct w sys prompts
|
2023-06-28 22:27:17 -04:00 |
|
Wing Lian
|
f150c027e3
|
Merge pull request #224 from OpenAccess-AI-Collective/system-prompt-data
System prompt data
|
2023-06-27 17:57:43 -04:00 |
|
Wing Lian
|
5c39c006c9
|
Merge pull request #244 from OpenAccess-AI-Collective/push-to-hub
push intermediate model checkpoints to hub
|
2023-06-27 17:57:30 -04:00 |
|
Wing Lian
|
612aabd8c4
|
push intermediate model checkpoints to hub
|
2023-06-27 15:40:25 -04:00 |
|
Wing Lian
|
af05883f75
|
Merge pull request #243 from OpenAccess-AI-Collective/unprompted-instruct
skip the system prompt
|
2023-06-25 22:50:35 -04:00 |
|
Wing Lian
|
05ab9092e3
|
skip the system prompt
|
2023-06-25 22:40:50 -04:00 |
|
Wing Lian
|
7b57ed7618
|
pylint for duplicated code for system prompts
|
2023-06-25 22:28:07 -04:00 |
|
Wing Lian
|
3a38271276
|
add tests and supoort for loader for sys prompt data
|
2023-06-25 22:28:07 -04:00 |
|
Wing Lian
|
8d20e0a3d3
|
initial wip to get sys prompt from dataset
|
2023-06-25 22:28:07 -04:00 |
|
Wing Lian
|
de8ed229c3
|
Merge pull request #240 from OpenAccess-AI-Collective/tokenizer-fast
optionally define whether to use_fast tokenizer
|
2023-06-25 12:47:55 -04:00 |
|
Wing Lian
|
478d8c7b8e
|
Merge pull request #241 from OpenAccess-AI-Collective/py3-pre-commit
better py3 support w pre-commit
|
2023-06-25 12:47:02 -04:00 |
|
Wing Lian
|
645c13592c
|
better py3 support w pre-commit
|
2023-06-25 10:26:02 -04:00 |
|
Wing Lian
|
47d601fa23
|
optionally define whether to use_fast tokenizer
|
2023-06-25 10:19:49 -04:00 |
|
Wing Lian
|
756dfba97b
|
Merge pull request #218 from OpenAccess-AI-Collective/no-fail-fast
don't fail fast
|
2023-06-23 15:42:54 -04:00 |
|
Wing Lian
|
91ab0592af
|
Merge pull request #235 from msinha251/Fixing-data-readme
|
2023-06-23 13:52:01 -04:00 |
|