Wing Lian
|
fa8bd14be4
|
update entrypoint and force min accelerate
|
2023-05-18 06:25:34 -04:00 |
|
Wing Lian
|
13650732f8
|
concise multiple choice and tldr summarize
|
2023-05-17 11:29:17 -04:00 |
|
Wing Lian
|
8c2f3cb0f8
|
support for replit lm
|
2023-05-17 08:49:03 -04:00 |
|
Wing Lian
|
b46bc02f0a
|
add alpaca multiple choice instruct dataset support
|
2023-05-16 21:45:34 -04:00 |
|
Wing Lian
|
f98e173b59
|
reorder options so debug can happen in the same prepare step
|
2023-05-15 22:26:30 -04:00 |
|
Wing Lian
|
5e37144754
|
fix prompters, especially the sharegpt prompter
|
2023-05-15 22:15:36 -04:00 |
|
Wing Lian
|
bdbca8fa6c
|
more fixes
|
2023-05-15 14:07:17 -04:00 |
|
Wing Lian
|
42410c783c
|
more fixes
|
2023-05-14 09:16:41 -04:00 |
|
Wing Lian
|
aef00b6c13
|
fix torch_dtype for model load
|
2023-05-14 08:44:22 -04:00 |
|
Wing Lian
|
0d28df0fd2
|
move filter to before saving so it doesn't happen everytime, update runpod manual script
|
2023-05-13 21:51:41 -04:00 |
|
Wing Lian
|
84c7bc4b68
|
whoops, gt vs lt
|
2023-05-12 14:03:25 -04:00 |
|
Wing Lian
|
aa3c3f97ae
|
optimize dataloading to use cache, fix model token embedding sizes
|
2023-05-12 13:53:27 -04:00 |
|
Wing Lian
|
f6d1fa4a85
|
Merge pull request #25 from NanoCode012/patch-2
Fix Trainer() got multiple values for keyword argument 'callbacks'
|
2023-05-11 09:20:15 -04:00 |
|
NanoCode012
|
89b7f26b9d
|
Merge branch 'main' into patch-2
|
2023-05-11 21:18:38 +09:00 |
|
Wing Lian
|
165da584b3
|
fix config for parity with previous change
5159d00a86\#diff-65b4693504c4e8ffac76c7f2c90913faee381f802cf64e7f49c995a2134ed3b3R164
|
2023-05-11 08:13:09 -04:00 |
|
Wing Lian
|
4cc7ed8898
|
Merge pull request #27 from NanoCode012/patch-1
Fix save typo
|
2023-05-11 07:27:31 -04:00 |
|
NanoCode012
|
52aada7174
|
Fix typo
|
2023-05-11 20:22:30 +09:00 |
|
Wing Lian
|
688c73a81e
|
Merge pull request #26 from OpenAccess-AI-Collective/mpt-triton
Mpt triton
|
2023-05-10 16:02:05 -04:00 |
|
Wing Lian
|
2bc1a5bde1
|
black formatting
|
2023-05-10 16:01:08 -04:00 |
|
Wing Lian
|
7a490a4646
|
various fixes
|
2023-05-10 16:00:09 -04:00 |
|
NanoCode012
|
813aab378f
|
Fix Trainer() got multiple values for keyword argument 'callbacks'
|
2023-05-10 18:28:28 +09:00 |
|
Wing Lian
|
e2e68c3965
|
testing mpt triton
|
2023-05-09 20:57:40 -04:00 |
|
Wing Lian
|
a27d594788
|
fix conditional so alpaca doesn't choke
|
2023-05-09 20:57:07 -04:00 |
|
Wing Lian
|
1fb0376150
|
Merge pull request #23 from NanoCode012/patch-1
Fix: Save adapter for lora
|
2023-05-09 15:05:58 -04:00 |
|
Wing Lian
|
915c56cd97
|
Update finetune.py
|
2023-05-09 15:05:39 -04:00 |
|
Wing Lian
|
df9c5085b5
|
not everyone has bf16 available
|
2023-05-09 14:47:48 -04:00 |
|
Wing Lian
|
7967cd1039
|
add 4bit lora 7b
|
2023-05-09 14:38:32 -04:00 |
|
NanoCode012
|
cd2395987e
|
Don't save full model for lora
|
2023-05-10 03:18:38 +09:00 |
|
NanoCode012
|
71a1f7f38c
|
Save adapter for lora
|
2023-05-10 01:08:22 +09:00 |
|
Wing Lian
|
02c59832a3
|
push up redpajama 3b example
|
2023-05-08 19:19:18 -04:00 |
|
Wing Lian
|
3f9c9530ea
|
Merge pull request #15 from NanoCode012/feat/completion
Feat: Add Completion dataset type
|
2023-05-08 19:04:54 -04:00 |
|
NanoCode012
|
174b74ddc9
|
Rename variable to use same convention
|
2023-05-09 02:49:44 +09:00 |
|
NanoCode012
|
cf681537ec
|
Add CompletionPrompt type
|
2023-05-09 02:49:44 +09:00 |
|
Wing Lian
|
bd3c5a5cb3
|
Merge pull request #21 from NanoCode012/patch-1
Fix: Scheduler and optimizer condition
|
2023-05-08 13:34:44 -04:00 |
|
Wing Lian
|
bcbc99e655
|
Merge pull request #19 from NanoCode012/feat/callback-save-lora
Feat: Add callback save peft_model on_save
|
2023-05-08 13:34:07 -04:00 |
|
Wing Lian
|
b0d2594de9
|
Merge pull request #22 from NanoCode012/patch-2
Fix BNB OOM by pinning version
|
2023-05-08 13:33:52 -04:00 |
|
NanoCode012
|
fe582df7d3
|
Fix BNB OOM by pinning version
|
2023-05-09 02:10:31 +09:00 |
|
NanoCode012
|
36aaea02b9
|
Update trainer.py
|
2023-05-09 02:01:08 +09:00 |
|
NanoCode012
|
5b6690ac25
|
Fix condition scheduler
|
2023-05-09 01:44:12 +09:00 |
|
Wing Lian
|
a125693122
|
add support for trust_remote_code for mpt models
|
2023-05-08 12:07:27 -04:00 |
|
Wing Lian
|
709be5af81
|
use printf instead of echo in dockerfile for portability
|
2023-05-08 11:45:38 -04:00 |
|
NanoCode012
|
cc77bab526
|
Add callbacks to Trainer
|
2023-05-09 00:41:19 +09:00 |
|
NanoCode012
|
0d6708bfe4
|
Add callback save peft_model on_save
|
2023-05-09 00:38:27 +09:00 |
|
Wing Lian
|
807cca81c0
|
fix path name to sorkspace
|
2023-05-08 11:20:03 -04:00 |
|
Wing Lian
|
79deb35c68
|
setup runpod images
use github.ref_name
|
2023-05-08 10:48:32 -04:00 |
|
Wing Lian
|
7576d85c73
|
fix to cd to path in docker
|
2023-05-08 03:43:46 -04:00 |
|
Wing Lian
|
3b4b476828
|
use existing state of repo to build, not the checkout
|
2023-05-08 03:29:48 -04:00 |
|
Wing Lian
|
b5fe063687
|
fix base for dockerfile
|
2023-05-08 03:27:10 -04:00 |
|
Wing Lian
|
a12fb0a8da
|
Jeopardy bot! (#17)
* support for jeopardy dataset
* commit the final config for jeopardy bot
|
2023-05-08 03:21:40 -04:00 |
|
Wing Lian
|
a4329b1068
|
fix #16 load best model setting when using 8bit
|
2023-05-07 18:30:48 -04:00 |
|