NanoCode012
|
224d186ec9
|
Simplify docker instruction
|
2023-05-25 09:51:22 +09:00 |
|
NanoCode012
|
5417824b31
|
Add seq length
|
2023-05-25 09:50:43 +09:00 |
|
NanoCode012
|
e1a91b0918
|
Remove redundant formats
|
2023-05-25 09:48:18 +09:00 |
|
NanoCode012
|
2a1b5728e6
|
Add line break
|
2023-05-25 09:37:18 +09:00 |
|
NanoCode012
|
702f2eee4b
|
Fix inference command
|
2023-05-25 09:36:33 +09:00 |
|
NanoCode012
|
88bba24d9e
|
Clean up data readme
|
2023-05-25 09:34:35 +09:00 |
|
NanoCode012
|
ba9ac723f1
|
Update quickstart. Add common error and contribution section.
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
db73b94a58
|
Add image. Add quickstart. Simplify dataset.
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
00dfe43b1d
|
Add image
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
9aab0b8cfe
|
Update Docker instructions
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
857a80b70e
|
Format dataset types
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
cba0048067
|
Update typo
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
c22df8db9b
|
Add all dataset types
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
68237ea90a
|
Add extra note to Readme
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
4ee79f2641
|
Fix typo
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
2b436680a0
|
Add new config options to Readme
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
04d281312c
|
Feat: Rewrite Readme
|
2023-05-25 09:32:04 +09:00 |
|
Wing Lian
|
3960936bf7
|
Merge pull request #37 from Thytu/main
fix: handles AutoTokenizer from untrusted source
|
2023-05-24 15:42:41 -04:00 |
|
Valentin De Matos
|
88ad05df54
|
fix: handles AutoTokenizer from untrusted source
Set trust_remote_code param depending of cfg.trust_remote_code when calling AutoTokenizer.from_pretrained
|
2023-05-24 20:57:10 +02:00 |
|
Wing Lian
|
ed37b2268d
|
Merge pull request #32 from NanoCode012/patch-2
Feat: Set `half` using `cfg.fp16` for 4bit
|
2023-05-20 18:21:02 -04:00 |
|
NanoCode012
|
641f8012f9
|
Set half using cfg.fp16 for 4bit
|
2023-05-20 02:29:31 +09:00 |
|
Wing Lian
|
e553c9080b
|
Merge pull request #29 from NanoCode012/patch-1
|
2023-05-16 07:12:06 -04:00 |
|
NanoCode012
|
2c73c81348
|
Add lora_modules_to_save
|
2023-05-16 19:22:00 +09:00 |
|
Wing Lian
|
42410c783c
|
more fixes
|
2023-05-14 09:16:41 -04:00 |
|
Wing Lian
|
aef00b6c13
|
fix torch_dtype for model load
|
2023-05-14 08:44:22 -04:00 |
|
Wing Lian
|
0d28df0fd2
|
move filter to before saving so it doesn't happen everytime, update runpod manual script
|
2023-05-13 21:51:41 -04:00 |
|
Wing Lian
|
84c7bc4b68
|
whoops, gt vs lt
|
2023-05-12 14:03:25 -04:00 |
|
Wing Lian
|
aa3c3f97ae
|
optimize dataloading to use cache, fix model token embedding sizes
|
2023-05-12 13:53:27 -04:00 |
|
Wing Lian
|
f6d1fa4a85
|
Merge pull request #25 from NanoCode012/patch-2
Fix Trainer() got multiple values for keyword argument 'callbacks'
|
2023-05-11 09:20:15 -04:00 |
|
NanoCode012
|
89b7f26b9d
|
Merge branch 'main' into patch-2
|
2023-05-11 21:18:38 +09:00 |
|
Wing Lian
|
165da584b3
|
fix config for parity with previous change
5159d00a86\#diff-65b4693504c4e8ffac76c7f2c90913faee381f802cf64e7f49c995a2134ed3b3R164
|
2023-05-11 08:13:09 -04:00 |
|
Wing Lian
|
4cc7ed8898
|
Merge pull request #27 from NanoCode012/patch-1
Fix save typo
|
2023-05-11 07:27:31 -04:00 |
|
NanoCode012
|
52aada7174
|
Fix typo
|
2023-05-11 20:22:30 +09:00 |
|
Wing Lian
|
688c73a81e
|
Merge pull request #26 from OpenAccess-AI-Collective/mpt-triton
Mpt triton
|
2023-05-10 16:02:05 -04:00 |
|
Wing Lian
|
2bc1a5bde1
|
black formatting
|
2023-05-10 16:01:08 -04:00 |
|
Wing Lian
|
7a490a4646
|
various fixes
|
2023-05-10 16:00:09 -04:00 |
|
NanoCode012
|
813aab378f
|
Fix Trainer() got multiple values for keyword argument 'callbacks'
|
2023-05-10 18:28:28 +09:00 |
|
Wing Lian
|
e2e68c3965
|
testing mpt triton
|
2023-05-09 20:57:40 -04:00 |
|
Wing Lian
|
a27d594788
|
fix conditional so alpaca doesn't choke
|
2023-05-09 20:57:07 -04:00 |
|
Wing Lian
|
1fb0376150
|
Merge pull request #23 from NanoCode012/patch-1
Fix: Save adapter for lora
|
2023-05-09 15:05:58 -04:00 |
|
Wing Lian
|
915c56cd97
|
Update finetune.py
|
2023-05-09 15:05:39 -04:00 |
|
Wing Lian
|
df9c5085b5
|
not everyone has bf16 available
|
2023-05-09 14:47:48 -04:00 |
|
Wing Lian
|
7967cd1039
|
add 4bit lora 7b
|
2023-05-09 14:38:32 -04:00 |
|
NanoCode012
|
cd2395987e
|
Don't save full model for lora
|
2023-05-10 03:18:38 +09:00 |
|
NanoCode012
|
71a1f7f38c
|
Save adapter for lora
|
2023-05-10 01:08:22 +09:00 |
|
Wing Lian
|
02c59832a3
|
push up redpajama 3b example
|
2023-05-08 19:19:18 -04:00 |
|
Wing Lian
|
3f9c9530ea
|
Merge pull request #15 from NanoCode012/feat/completion
Feat: Add Completion dataset type
|
2023-05-08 19:04:54 -04:00 |
|
NanoCode012
|
174b74ddc9
|
Rename variable to use same convention
|
2023-05-09 02:49:44 +09:00 |
|
NanoCode012
|
cf681537ec
|
Add CompletionPrompt type
|
2023-05-09 02:49:44 +09:00 |
|
Wing Lian
|
bd3c5a5cb3
|
Merge pull request #21 from NanoCode012/patch-1
Fix: Scheduler and optimizer condition
|
2023-05-08 13:34:44 -04:00 |
|