NanoCode012
|
813aab378f
|
Fix Trainer() got multiple values for keyword argument 'callbacks'
|
2023-05-10 18:28:28 +09:00 |
|
Wing Lian
|
a27d594788
|
fix conditional so alpaca doesn't choke
|
2023-05-09 20:57:07 -04:00 |
|
Wing Lian
|
1fb0376150
|
Merge pull request #23 from NanoCode012/patch-1
Fix: Save adapter for lora
|
2023-05-09 15:05:58 -04:00 |
|
Wing Lian
|
915c56cd97
|
Update finetune.py
|
2023-05-09 15:05:39 -04:00 |
|
Wing Lian
|
df9c5085b5
|
not everyone has bf16 available
|
2023-05-09 14:47:48 -04:00 |
|
Wing Lian
|
7967cd1039
|
add 4bit lora 7b
|
2023-05-09 14:38:32 -04:00 |
|
NanoCode012
|
cd2395987e
|
Don't save full model for lora
|
2023-05-10 03:18:38 +09:00 |
|
NanoCode012
|
71a1f7f38c
|
Save adapter for lora
|
2023-05-10 01:08:22 +09:00 |
|
Wing Lian
|
02c59832a3
|
push up redpajama 3b example
|
2023-05-08 19:19:18 -04:00 |
|
Wing Lian
|
3f9c9530ea
|
Merge pull request #15 from NanoCode012/feat/completion
Feat: Add Completion dataset type
|
2023-05-08 19:04:54 -04:00 |
|
NanoCode012
|
174b74ddc9
|
Rename variable to use same convention
|
2023-05-09 02:49:44 +09:00 |
|
NanoCode012
|
cf681537ec
|
Add CompletionPrompt type
|
2023-05-09 02:49:44 +09:00 |
|
Wing Lian
|
bd3c5a5cb3
|
Merge pull request #21 from NanoCode012/patch-1
Fix: Scheduler and optimizer condition
|
2023-05-08 13:34:44 -04:00 |
|
Wing Lian
|
bcbc99e655
|
Merge pull request #19 from NanoCode012/feat/callback-save-lora
Feat: Add callback save peft_model on_save
|
2023-05-08 13:34:07 -04:00 |
|
Wing Lian
|
b0d2594de9
|
Merge pull request #22 from NanoCode012/patch-2
Fix BNB OOM by pinning version
|
2023-05-08 13:33:52 -04:00 |
|
NanoCode012
|
fe582df7d3
|
Fix BNB OOM by pinning version
|
2023-05-09 02:10:31 +09:00 |
|
NanoCode012
|
36aaea02b9
|
Update trainer.py
|
2023-05-09 02:01:08 +09:00 |
|
NanoCode012
|
5b6690ac25
|
Fix condition scheduler
|
2023-05-09 01:44:12 +09:00 |
|
Wing Lian
|
a125693122
|
add support for trust_remote_code for mpt models
|
2023-05-08 12:07:27 -04:00 |
|
Wing Lian
|
709be5af81
|
use printf instead of echo in dockerfile for portability
|
2023-05-08 11:45:38 -04:00 |
|
NanoCode012
|
cc77bab526
|
Add callbacks to Trainer
|
2023-05-09 00:41:19 +09:00 |
|
NanoCode012
|
0d6708bfe4
|
Add callback save peft_model on_save
|
2023-05-09 00:38:27 +09:00 |
|
Wing Lian
|
807cca81c0
|
fix path name to sorkspace
|
2023-05-08 11:20:03 -04:00 |
|
Wing Lian
|
79deb35c68
|
setup runpod images
use github.ref_name
|
2023-05-08 10:48:32 -04:00 |
|
Wing Lian
|
7576d85c73
|
fix to cd to path in docker
|
2023-05-08 03:43:46 -04:00 |
|
Wing Lian
|
3b4b476828
|
use existing state of repo to build, not the checkout
|
2023-05-08 03:29:48 -04:00 |
|
Wing Lian
|
b5fe063687
|
fix base for dockerfile
|
2023-05-08 03:27:10 -04:00 |
|
Wing Lian
|
a12fb0a8da
|
Jeopardy bot! (#17)
* support for jeopardy dataset
* commit the final config for jeopardy bot
|
2023-05-08 03:21:40 -04:00 |
|
Wing Lian
|
a4329b1068
|
fix #16 load best model setting when using 8bit
|
2023-05-07 18:30:48 -04:00 |
|
Wing Lian
|
550502b321
|
use micro batch size for eval size if not specified
|
2023-05-07 18:26:05 -04:00 |
|
Wing Lian
|
fae36c7111
|
blah, wrong base tag
|
2023-05-07 17:54:26 -04:00 |
|
Wing Lian
|
a31746baa2
|
whoops, build from base image
|
2023-05-07 17:47:54 -04:00 |
|
Wing Lian
|
17345c8a4b
|
hanging slash typo
|
2023-05-07 17:38:56 -04:00 |
|
Wing Lian
|
9cd5d3fcfc
|
build on self hosted GPU runners
|
2023-05-07 17:25:31 -04:00 |
|
Wing Lian
|
990bec63e6
|
docker layer caching, build w axolotl from base build
|
2023-05-07 17:16:05 -04:00 |
|
Wing Lian
|
0c46806ae2
|
typo in git repo for pip
|
2023-05-07 16:00:21 -04:00 |
|
Wing Lian
|
66fa751c18
|
add huggingface packages and awscli
|
2023-05-07 11:51:57 -04:00 |
|
Wing Lian
|
21b74397de
|
fix typo and add apex
|
2023-05-07 11:48:47 -04:00 |
|
Wing Lian
|
3f11b47488
|
needs libaio-dev from apt
|
2023-05-07 11:23:43 -04:00 |
|
Wing Lian
|
ece46b2504
|
pip install packaging dep
|
2023-05-07 11:09:03 -04:00 |
|
Wing Lian
|
92d800a394
|
build dependencies and aws-cli
|
2023-05-07 11:02:26 -04:00 |
|
Wing Lian
|
2734e3f1a2
|
build base separately
fix arg order for image
fix dockerfile var excaping
move args around
|
2023-05-07 10:56:12 -04:00 |
|
Wing Lian
|
14ebd2e007
|
build base too
|
2023-05-07 09:48:41 -04:00 |
|
Wing Lian
|
4a79dabff0
|
fix push to docker hub
|
2023-05-07 08:52:49 -04:00 |
|
Wing Lian
|
47ad3890bc
|
fix whitespace and instruction on inference
|
2023-05-07 08:28:15 -04:00 |
|
Wing Lian
|
76b24bca2e
|
push to docker hub
set docker image name
|
2023-05-07 08:06:50 -04:00 |
|
Wing Lian
|
73450d9de7
|
TORCH_CUDA_ARCH_LIST should be an ARG
|
2023-05-07 07:28:57 -04:00 |
|
Wing Lian
|
97cf77891e
|
run this on self hosted runner for now
fix typo
fixes to docker build
need pip wheel
don't duplicate pip install
|
2023-05-07 07:21:25 -04:00 |
|
Wing Lian
|
e2599edab9
|
runs on larger git runner?
|
2023-05-07 04:12:47 -04:00 |
|
Wing Lian
|
75bc8561c0
|
don't push the image
|
2023-05-07 03:39:05 -04:00 |
|