NanoCode012
|
68237ea90a
|
Add extra note to Readme
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
4ee79f2641
|
Fix typo
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
2b436680a0
|
Add new config options to Readme
|
2023-05-25 09:32:04 +09:00 |
|
NanoCode012
|
04d281312c
|
Feat: Rewrite Readme
|
2023-05-25 09:32:04 +09:00 |
|
Wing Lian
|
3960936bf7
|
Merge pull request #37 from Thytu/main
fix: handles AutoTokenizer from untrusted source
|
2023-05-24 15:42:41 -04:00 |
|
Valentin De Matos
|
88ad05df54
|
fix: handles AutoTokenizer from untrusted source
Set trust_remote_code param depending of cfg.trust_remote_code when calling AutoTokenizer.from_pretrained
|
2023-05-24 20:57:10 +02:00 |
|
Wing Lian
|
ed37b2268d
|
Merge pull request #32 from NanoCode012/patch-2
Feat: Set `half` using `cfg.fp16` for 4bit
|
2023-05-20 18:21:02 -04:00 |
|
NanoCode012
|
641f8012f9
|
Set half using cfg.fp16 for 4bit
|
2023-05-20 02:29:31 +09:00 |
|
Wing Lian
|
e553c9080b
|
Merge pull request #29 from NanoCode012/patch-1
|
2023-05-16 07:12:06 -04:00 |
|
NanoCode012
|
2c73c81348
|
Add lora_modules_to_save
|
2023-05-16 19:22:00 +09:00 |
|
Wing Lian
|
42410c783c
|
more fixes
|
2023-05-14 09:16:41 -04:00 |
|
Wing Lian
|
aef00b6c13
|
fix torch_dtype for model load
|
2023-05-14 08:44:22 -04:00 |
|
Wing Lian
|
0d28df0fd2
|
move filter to before saving so it doesn't happen everytime, update runpod manual script
|
2023-05-13 21:51:41 -04:00 |
|
Wing Lian
|
84c7bc4b68
|
whoops, gt vs lt
|
2023-05-12 14:03:25 -04:00 |
|
Wing Lian
|
aa3c3f97ae
|
optimize dataloading to use cache, fix model token embedding sizes
|
2023-05-12 13:53:27 -04:00 |
|
Wing Lian
|
f6d1fa4a85
|
Merge pull request #25 from NanoCode012/patch-2
Fix Trainer() got multiple values for keyword argument 'callbacks'
|
2023-05-11 09:20:15 -04:00 |
|
NanoCode012
|
89b7f26b9d
|
Merge branch 'main' into patch-2
|
2023-05-11 21:18:38 +09:00 |
|
Wing Lian
|
165da584b3
|
fix config for parity with previous change
5159d00a86\#diff-65b4693504c4e8ffac76c7f2c90913faee381f802cf64e7f49c995a2134ed3b3R164
|
2023-05-11 08:13:09 -04:00 |
|
Wing Lian
|
4cc7ed8898
|
Merge pull request #27 from NanoCode012/patch-1
Fix save typo
|
2023-05-11 07:27:31 -04:00 |
|
NanoCode012
|
52aada7174
|
Fix typo
|
2023-05-11 20:22:30 +09:00 |
|
Wing Lian
|
688c73a81e
|
Merge pull request #26 from OpenAccess-AI-Collective/mpt-triton
Mpt triton
|
2023-05-10 16:02:05 -04:00 |
|
Wing Lian
|
2bc1a5bde1
|
black formatting
|
2023-05-10 16:01:08 -04:00 |
|
Wing Lian
|
7a490a4646
|
various fixes
|
2023-05-10 16:00:09 -04:00 |
|
NanoCode012
|
813aab378f
|
Fix Trainer() got multiple values for keyword argument 'callbacks'
|
2023-05-10 18:28:28 +09:00 |
|
Wing Lian
|
e2e68c3965
|
testing mpt triton
|
2023-05-09 20:57:40 -04:00 |
|
Wing Lian
|
a27d594788
|
fix conditional so alpaca doesn't choke
|
2023-05-09 20:57:07 -04:00 |
|
Wing Lian
|
1fb0376150
|
Merge pull request #23 from NanoCode012/patch-1
Fix: Save adapter for lora
|
2023-05-09 15:05:58 -04:00 |
|
Wing Lian
|
915c56cd97
|
Update finetune.py
|
2023-05-09 15:05:39 -04:00 |
|
Wing Lian
|
df9c5085b5
|
not everyone has bf16 available
|
2023-05-09 14:47:48 -04:00 |
|
Wing Lian
|
7967cd1039
|
add 4bit lora 7b
|
2023-05-09 14:38:32 -04:00 |
|
NanoCode012
|
cd2395987e
|
Don't save full model for lora
|
2023-05-10 03:18:38 +09:00 |
|
NanoCode012
|
71a1f7f38c
|
Save adapter for lora
|
2023-05-10 01:08:22 +09:00 |
|
Wing Lian
|
02c59832a3
|
push up redpajama 3b example
|
2023-05-08 19:19:18 -04:00 |
|
Wing Lian
|
3f9c9530ea
|
Merge pull request #15 from NanoCode012/feat/completion
Feat: Add Completion dataset type
|
2023-05-08 19:04:54 -04:00 |
|
NanoCode012
|
174b74ddc9
|
Rename variable to use same convention
|
2023-05-09 02:49:44 +09:00 |
|
NanoCode012
|
cf681537ec
|
Add CompletionPrompt type
|
2023-05-09 02:49:44 +09:00 |
|
Wing Lian
|
bd3c5a5cb3
|
Merge pull request #21 from NanoCode012/patch-1
Fix: Scheduler and optimizer condition
|
2023-05-08 13:34:44 -04:00 |
|
Wing Lian
|
bcbc99e655
|
Merge pull request #19 from NanoCode012/feat/callback-save-lora
Feat: Add callback save peft_model on_save
|
2023-05-08 13:34:07 -04:00 |
|
Wing Lian
|
b0d2594de9
|
Merge pull request #22 from NanoCode012/patch-2
Fix BNB OOM by pinning version
|
2023-05-08 13:33:52 -04:00 |
|
NanoCode012
|
fe582df7d3
|
Fix BNB OOM by pinning version
|
2023-05-09 02:10:31 +09:00 |
|
NanoCode012
|
36aaea02b9
|
Update trainer.py
|
2023-05-09 02:01:08 +09:00 |
|
NanoCode012
|
5b6690ac25
|
Fix condition scheduler
|
2023-05-09 01:44:12 +09:00 |
|
Wing Lian
|
a125693122
|
add support for trust_remote_code for mpt models
|
2023-05-08 12:07:27 -04:00 |
|
Wing Lian
|
709be5af81
|
use printf instead of echo in dockerfile for portability
|
2023-05-08 11:45:38 -04:00 |
|
NanoCode012
|
cc77bab526
|
Add callbacks to Trainer
|
2023-05-09 00:41:19 +09:00 |
|
NanoCode012
|
0d6708bfe4
|
Add callback save peft_model on_save
|
2023-05-09 00:38:27 +09:00 |
|
Wing Lian
|
807cca81c0
|
fix path name to sorkspace
|
2023-05-08 11:20:03 -04:00 |
|
Wing Lian
|
79deb35c68
|
setup runpod images
use github.ref_name
|
2023-05-08 10:48:32 -04:00 |
|
Wing Lian
|
7576d85c73
|
fix to cd to path in docker
|
2023-05-08 03:43:46 -04:00 |
|
Wing Lian
|
3b4b476828
|
use existing state of repo to build, not the checkout
|
2023-05-08 03:29:48 -04:00 |
|