Wing Lian
|
5b33e295bd
|
update docs
|
2023-06-05 22:48:16 -04:00 |
|
Wing Lian
|
618816d4df
|
Update README.md for correct image tags
|
2023-06-02 14:10:23 -04:00 |
|
FarisHijazi
|
84169d15b3
|
added docker-compose file
|
2023-06-02 18:17:43 +03:00 |
|
Wing Lian
|
ecfe8d0a1a
|
Merge pull request #142 from NanoCode012/feat/custom-prompt-readme
Feat: Add custom prompt readme and add missing prompt strategies to Readme
|
2023-06-02 07:21:04 -04:00 |
|
NanoCode012
|
078a43eef8
|
Remove redundant instruction
|
2023-06-02 12:30:11 +09:00 |
|
NanoCode012
|
33e1890086
|
Add pygmalion
|
2023-06-02 12:27:51 +09:00 |
|
NanoCode012
|
1c38253692
|
Add other prompt_strategies
|
2023-06-02 12:24:44 +09:00 |
|
NanoCode012
|
496b83f778
|
Add short instruction for custom prompts
|
2023-06-02 12:16:20 +09:00 |
|
NanoCode012
|
ff68a95781
|
Add lambdalabs instruction
|
2023-06-02 12:09:40 +09:00 |
|
NanoCode012
|
3c71c8debe
|
Update doc for grad_accu and add validation tests for batch size
|
2023-06-01 06:13:47 +09:00 |
|
Wing Lian
|
f94dd626f0
|
Merge pull request #130 from OpenAccess-AI-Collective/gas
swap batch size for gradient accumulation steps to decouple from num gpu
|
2023-05-31 13:03:51 -04:00 |
|
Wing Lian
|
c2a0792680
|
swap batch size for gradient accumulation steps to decouple from num gpu
|
2023-05-31 09:38:12 -04:00 |
|
Wing Lian
|
b267d24a2b
|
add badge info to readme
|
2023-05-31 09:28:44 -04:00 |
|
NanoCode012
|
0e4be625ae
|
Merge pull request #118 from NanoCode012/feat/torch-readme
Fix(readme): Fix torch missing from readme
|
2023-05-31 13:29:41 +09:00 |
|
NanoCode012
|
bdc4bd7d4e
|
Update README.md
|
2023-05-31 13:24:28 +09:00 |
|
Wing Lian
|
2d0ba3b818
|
Merge pull request #124 from OpenAccess-AI-Collective/xformers-fix
copy xformers attn from ooba since we removed dep on alpaca_lora_4bit
|
2023-05-31 00:11:40 -04:00 |
|
Wing Lian
|
2675fb756e
|
update readme for SDP
|
2023-05-31 00:04:54 -04:00 |
|
Wing Lian
|
e3c494ca7b
|
remove unused import and update readme
|
2023-05-30 23:55:45 -04:00 |
|
NanoCode012
|
cf61f14bff
|
FIx(readme): Fix torch missing from readme
|
2023-05-31 10:28:49 +09:00 |
|
Wing Lian
|
bbc5bc5791
|
Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq
default to qlora support, make gptq specific image
|
2023-05-30 15:07:04 -04:00 |
|
NanoCode012
|
b1cc54b14a
|
Update pip install to also setup tests
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
afb31e13a3
|
Add badge and update contribution section
|
2023-05-31 02:53:53 +09:00 |
|
Wing Lian
|
48612f8376
|
cleanup from pr feedback
|
2023-05-30 09:56:30 -04:00 |
|
Wing Lian
|
d91a769b88
|
update docs
|
2023-05-29 20:37:32 -04:00 |
|
Wing Lian
|
6ef96f569b
|
default to qlora support, make gptq specific image
|
2023-05-29 20:34:41 -04:00 |
|
Wing Lian
|
1c33eb88a7
|
new hf_use_auth_token setting so login to hf isn't required
|
2023-05-28 13:08:49 -04:00 |
|
NanoCode012
|
8626b54aab
|
Add dataset_shard_num and dataset_shard_idx
|
2023-05-27 23:51:17 +09:00 |
|
Wing Lian
|
147241ca66
|
Merge branch 'main' into refactor/rename-4b-to-gptq
|
2023-05-27 09:37:52 -04:00 |
|
Wing Lian
|
d5f944ce2a
|
add example for falcoln support
|
2023-05-27 09:16:43 -04:00 |
|
Thytu
|
dd0065773a
|
refactor(param): rename load_4bit config param by gptq
Signed-off-by: Thytu <vdmatos@gladia.io>
|
2023-05-27 12:36:03 +00:00 |
|
NanoCode012
|
919623793a
|
Add cfg.lora_target_linear
|
2023-05-26 14:32:30 +09:00 |
|
Wing Lian
|
bbfc333a01
|
Merge pull request #62 from OpenAccess-AI-Collective/qlora-fixes
Qlora fixes
|
2023-05-26 00:28:16 -04:00 |
|
Wing Lian
|
a4f12415a0
|
update readme and add typehints
|
2023-05-25 23:10:11 -04:00 |
|
Wing Lian
|
e7e1a777bd
|
fix bool args according to python fire docs
|
2023-05-25 22:45:41 -04:00 |
|
Wing Lian
|
3f6017db9e
|
qlora merge and load requires that base model isn't loaded in 4 or 8 bit
|
2023-05-25 22:39:13 -04:00 |
|
NanoCode012
|
85326bfbf3
|
Update quickstart config
|
2023-05-26 11:15:57 +09:00 |
|
NanoCode012
|
e689069afd
|
Add xformers error
|
2023-05-26 11:12:03 +09:00 |
|
NanoCode012
|
d7d8bc739e
|
Add strict yml
|
2023-05-26 11:10:59 +09:00 |
|
NanoCode012
|
60e32ff457
|
Fix shard config
|
2023-05-26 11:09:28 +09:00 |
|
Wing Lian
|
71d600fc43
|
Merge branch 'main' into winglian-patch-1
|
2023-05-25 12:45:13 -04:00 |
|
NanoCode012
|
a9e502ef45
|
Update 4bit notes
|
2023-05-25 23:48:18 +09:00 |
|
NanoCode012
|
f92245dbd6
|
Fix missing closing code block
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
e65c203e9e
|
Add more detail on minimum GPU
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
1377400c33
|
Add info on Runtime Error
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
2c34f8d0c7
|
Update dataset type
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
7bc28eb8a8
|
Add more data formats
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
29273b5a5b
|
Add other minor configs
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
05c18340d6
|
Update scheduler configs
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
5b712afbe4
|
Update bf16 options
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
9083910036
|
Update lora config
|
2023-05-25 23:06:33 +09:00 |
|