Wing Lian
|
756dfba97b
|
Merge pull request #218 from OpenAccess-AI-Collective/no-fail-fast
don't fail fast
|
2023-06-23 15:42:54 -04:00 |
|
Wing Lian
|
91ab0592af
|
Merge pull request #235 from msinha251/Fixing-data-readme
|
2023-06-23 13:52:01 -04:00 |
|
Mahesh Sinha
|
0aeb7c7802
|
Fixing Data Readme
|
2023-06-21 15:34:48 +02:00 |
|
Wing Lian
|
d35278aaf1
|
don't fail fast
|
2023-06-15 16:01:27 -04:00 |
|
Wing Lian
|
9492d4ebb7
|
Merge pull request #215 from OpenAccess-AI-Collective/adamw-hyperparams-cfg
support adamw and grad norm hyperparams
|
2023-06-15 12:20:55 -04:00 |
|
Wing Lian
|
ad5ca4f734
|
Additional test case per pr
|
2023-06-15 10:12:47 -04:00 |
|
Wing Lian
|
cb9d3af5c0
|
add validation and tests for adamw hyperparam
|
2023-06-15 09:39:42 -04:00 |
|
Wing Lian
|
c969f0a9dc
|
add docs
|
2023-06-15 08:43:20 -04:00 |
|
Wing Lian
|
6d0ee4ba34
|
support adamw and grad norm hyperparams
|
2023-06-15 08:40:41 -04:00 |
|
Wing Lian
|
a81f52d575
|
Merge pull request #212 from OpenAccess-AI-Collective/doc-20230615-v1
add float16 docs and tweak typehints
|
2023-06-15 08:28:57 -04:00 |
|
Wing Lian
|
1925eaf1e6
|
Merge pull request #214 from OpenAccess-AI-Collective/fix-tokenizing-labels
Fix tokenizing labels
|
2023-06-15 08:13:43 -04:00 |
|
Wing Lian
|
1ab3bf3e67
|
fix test name
|
2023-06-15 02:09:33 -04:00 |
|
Wing Lian
|
d7635b7148
|
hint to what AMP means
|
2023-06-15 02:06:27 -04:00 |
|
Wing Lian
|
88e17ffc50
|
add float16 docs and tweak typehints
|
2023-06-15 02:05:31 -04:00 |
|
Wing Lian
|
baed440fa1
|
ingore duplicate code in tests
|
2023-06-15 02:03:53 -04:00 |
|
Wing Lian
|
7925ddce86
|
bugfix for potential off by one
|
2023-06-15 01:59:33 -04:00 |
|
Wing Lian
|
6f849809c5
|
Merge pull request #206 from MaciejKarasek/issue205
issue #205 bugfix
|
2023-06-14 14:23:38 -04:00 |
|
Wing Lian
|
c16644d05e
|
Merge pull request #209 from sroecker/fix_redpajama_example_tokenizer
Use AutoTokenizer for redpajama example
|
2023-06-14 14:23:21 -04:00 |
|
Steffen Röcker
|
945c4191a3
|
Use AutoTokenizer for redpajama example
|
2023-06-14 20:09:26 +02:00 |
|
maciej.karasek
|
136522f9c9
|
style correction
|
2023-06-14 20:02:09 +02:00 |
|
maciej.karasek
|
556fe408b3
|
issue #205 bugfix
|
2023-06-14 16:59:57 +02:00 |
|
Wing Lian
|
16bb6276a5
|
Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum
add support for opimum bettertransformers
|
2023-06-14 07:50:15 -04:00 |
|
NanoCode012
|
06674a11f2
|
Merge pull request #202 from OpenAccess-AI-Collective/NanoCode012-patch-1
Fix sharegpt type in doc
|
2023-06-14 09:48:35 +09:00 |
|
NanoCode012
|
3513885f43
|
Fix sharegpt type
|
2023-06-14 01:10:58 +09:00 |
|
Wing Lian
|
06652c1c39
|
Merge pull request #196 from OpenAccess-AI-Collective/openllama-ft-config
pre-commit / pre-commit (push) Has been cancelled
PyTest / test (3.10) (push) Has been cancelled
PyTest / test (3.9) (push) Has been cancelled
tweak config to work
v0.2.1
|
2023-06-13 11:51:04 -04:00 |
|
NanoCode012
|
068fc48978
|
Merge pull request #199 from NanoCode012/chore/prompter-arg
chore: Refactor inf_kwargs out
|
2023-06-13 17:56:22 +09:00 |
|
Wing Lian
|
aaadacf6b3
|
Merge pull request #200 from PocketDocLabs/main
Update README.md to include a community showcase
|
2023-06-13 04:44:34 -04:00 |
|
PocketDoc Labs
|
5ff547dc70
|
Update README.md to include a community showcase
|
2023-06-12 22:38:10 -07:00 |
|
NanoCode012
|
dc77c8ebce
|
chore: Refactor inf_kwargs out
|
2023-06-13 12:01:46 +09:00 |
|
NanoCode012
|
51a4c12242
|
Merge pull request #197 from mhenrichsen/chore/update-readme
chore: Fix inference README.
|
2023-06-13 11:53:26 +09:00 |
|
Wing Lian
|
4b43a66a0b
|
update alpaca_chat prompts for instructions to explainn the conversation
|
2023-06-12 18:38:38 -04:00 |
|
mhenrichsen
|
34ae69989f
|
fix inference
|
2023-06-12 21:39:19 +02:00 |
|
Wing Lian
|
fd2c9814c9
|
Merge branch 'main' into flash-optimum
|
2023-06-12 13:12:15 -04:00 |
|
Wing Lian
|
2ba4ae8f46
|
tweak config to work
|
2023-06-12 10:07:18 -04:00 |
|
Wing Lian
|
93dacba228
|
Merge pull request #187 from OpenAccess-AI-Collective/strip-peft-device-map
peft no longer needs device_map
|
2023-06-12 09:10:49 -04:00 |
|
Wing Lian
|
8002ffb41f
|
Merge pull request #177 from NanoCode012/fix/landmark-patch
Fix landmark attention patch
|
2023-06-12 08:27:12 -04:00 |
|
Wing Lian
|
74ef5cc083
|
Merge pull request #192 from OpenAccess-AI-Collective/sharegpt-custom-prompt
misc fixes
|
2023-06-12 08:26:38 -04:00 |
|
Wing Lian
|
5e616d91c0
|
Merge branch 'main' into strip-peft-device-map
|
2023-06-12 08:25:54 -04:00 |
|
Wing Lian
|
94f310c7a6
|
Merge pull request #193 from OpenAccess-AI-Collective/config-fixes-20230612
config fixes
|
2023-06-12 08:24:52 -04:00 |
|
NanoCode012
|
8e568bbdae
|
Merge pull request #159 from AngainorDev/patch-1
Fix training over existing lora
|
2023-06-12 20:27:11 +09:00 |
|
NanoCode012
|
e21dab49fd
|
Merge pull request #194 from NanoCode012/fix/config-path
Fix config path after config moved
|
2023-06-12 19:28:12 +09:00 |
|
NanoCode012
|
52cde69288
|
Fix config path after config moved
|
2023-06-12 17:06:15 +09:00 |
|
Wing Lian
|
9a58e99e81
|
config fixes
|
2023-06-12 01:52:58 -04:00 |
|
Wing Lian
|
c7dee56b87
|
add typehints
|
2023-06-11 19:52:34 -04:00 |
|
Wing Lian
|
aac4b7691e
|
add new sharegpt, refactor prompt so it can be customized later, add exception if no data is processed
|
2023-06-11 19:42:25 -04:00 |
|
NanoCode012
|
f31a338cbb
|
Merge pull request #191 from OpenAccess-AI-Collective/NanoCode012-patch-1
Add save_steps and eval_steps to Readme
|
2023-06-12 02:55:37 +09:00 |
|
NanoCode012
|
4cd1deeef2
|
Add save_steps and eval_steps to Readme
|
2023-06-12 02:44:46 +09:00 |
|
Wing Lian
|
9ac16ed8d1
|
Merge pull request #190 from OpenAccess-AI-Collective/fixes-20230711-v2
more config pruning and migrating
|
2023-06-11 13:27:08 -04:00 |
|
Wing Lian
|
6b3f509d9e
|
forgot to add this file
|
2023-06-11 11:50:12 -04:00 |
|
Wing Lian
|
336aa3fd48
|
gptq lora llama is obviously good
|
2023-06-11 11:05:29 -04:00 |
|