Viktorius Suwandi
|
2aacf75ee1
|
Update wandb_log_model on galactica_1_3B.yml
|
2023-05-29 15:42:19 +07:00 |
|
Viktorius Suwandi
|
71871345a6
|
Update wandb_log_model on llama_7B_4bit.yml
|
2023-05-29 15:41:59 +07:00 |
|
Viktorius Suwandi
|
0d14e951a8
|
Update wandb_log_model on stability_3b.yml
|
2023-05-29 15:41:42 +07:00 |
|
Viktorius Suwandi
|
84fc217f79
|
Update wandb_log_model on gpt_neox_20b.yml
|
2023-05-29 15:41:24 +07:00 |
|
Viktorius Suwandi
|
f317296259
|
Update wandb_log_model on quickstart.yml
|
2023-05-29 15:40:58 +07:00 |
|
Viktorius Suwandi
|
42a971df32
|
Update wandb_log_model on sample.yml
|
2023-05-29 15:39:42 +07:00 |
|
Wing Lian
|
7f7fd68e8e
|
Merge pull request #104 from OpenAccess-AI-Collective/training-fixes-20230529
bnb fix, trainer debug fix
|
2023-05-29 02:19:03 -04:00 |
|
Wing Lian
|
21f17cca69
|
bnb fixes
|
2023-05-29 00:06:35 -04:00 |
|
Wing Lian
|
319e34bfb5
|
Merge pull request #101 from OpenAccess-AI-Collective/sharegpt-conv
refactor conversation plucking in sharegpt
|
2023-05-28 19:43:54 -04:00 |
|
Wing Lian
|
809ccebb38
|
use python setup install, bdist wheel is unreliable in installing extension
|
2023-05-28 15:49:13 -04:00 |
|
Wing Lian
|
21c8e2deab
|
refactor conversation plucking in sharegpt
|
2023-05-28 14:36:33 -04:00 |
|
Wing Lian
|
8fe12e3bc1
|
Merge pull request #100 from OpenAccess-AI-Collective/py310-tests
add py310 to the test matrix
|
2023-05-28 14:31:07 -04:00 |
|
Wing Lian
|
37fc85ac52
|
Merge pull request #99 from OpenAccess-AI-Collective/hf_use_auth_token
new hf_use_auth_token setting so login to hf isn't required
|
2023-05-28 14:30:04 -04:00 |
|
Wing Lian
|
658ed86cb5
|
add py310 to the test matrix
|
2023-05-28 14:25:57 -04:00 |
|
Wing Lian
|
fd5f9656a2
|
update for pr feedback
|
2023-05-28 14:23:27 -04:00 |
|
Wing Lian
|
1c33eb88a7
|
new hf_use_auth_token setting so login to hf isn't required
|
2023-05-28 13:08:49 -04:00 |
|
Wing Lian
|
a798ba1659
|
ensure libbitsandbytes*.so gets included with wheel
|
2023-05-28 12:28:37 -04:00 |
|
NanoCode012
|
666febcfb5
|
Merge pull request #97 from NanoCode012/feat/test-validation
Feat: Update validate_config and add tests
|
2023-05-29 00:38:22 +09:00 |
|
NanoCode012
|
52dd92a0cd
|
Feat: Update validate_config and add tests
|
2023-05-29 00:25:54 +09:00 |
|
Wing Lian
|
88889590ec
|
Merge pull request #90 from NanoCode012/feat/addict
Feat: Convert attrdict to addict
|
2023-05-28 10:43:07 -04:00 |
|
NanoCode012
|
f87bd20555
|
Fix incorrect syntax in test
|
2023-05-28 23:35:29 +09:00 |
|
NanoCode012
|
dd83a20c27
|
Update test to run on PR
|
2023-05-28 23:30:17 +09:00 |
|
NanoCode012
|
7bf2069afd
|
Apply black formatter
|
2023-05-28 23:14:04 +09:00 |
|
NanoCode012
|
923151ffab
|
Add test for DictDefault
|
2023-05-28 23:06:10 +09:00 |
|
NanoCode012
|
56f9ca5709
|
refactor: fix previous refactors
|
2023-05-28 23:06:10 +09:00 |
|
NanoCode012
|
8bd7a49cd7
|
Refactor to use DictDefault instead
|
2023-05-28 23:06:10 +09:00 |
|
NanoCode012
|
18d41cee4a
|
Add DictDefault
|
2023-05-28 23:06:10 +09:00 |
|
NanoCode012
|
93acb648bd
|
Fix load error
|
2023-05-28 23:06:10 +09:00 |
|
NanoCode012
|
bdfe7c9201
|
Convert attrdict to addict
|
2023-05-28 23:06:10 +09:00 |
|
Wing Lian
|
0d4a7f4c04
|
Merge pull request #67 from OpenAccess-AI-Collective/refactor-tokenizer-load
load the tokenizer seperately from the model
|
2023-05-28 08:49:34 -04:00 |
|
Wing Lian
|
af3aacbe16
|
Merge pull request #93 from OpenAccess-AI-Collective/dev-base
cuda properly compiled bitsandbytes for qlora support
|
2023-05-27 19:40:29 -04:00 |
|
Wing Lian
|
cc67862dd3
|
move list not in list logic to fn
|
2023-05-27 16:42:05 -04:00 |
|
Wing Lian
|
cf37980395
|
fix missing run coninuation
|
2023-05-27 15:28:54 -04:00 |
|
NanoCode012
|
ed2dd77e35
|
Merge pull request #89 from OpenAccess-AI-Collective/NanoCode012-update-action-version
Feat: Update actions version
|
2023-05-28 02:12:26 +09:00 |
|
NanoCode012
|
2b8c28bab8
|
Update actions version
|
2023-05-28 01:51:10 +09:00 |
|
Wing Lian
|
312b8d51d6
|
update docker to compile latest bnb to properly support qlora
|
2023-05-27 12:36:53 -04:00 |
|
NanoCode012
|
782996d94a
|
Merge pull request #86 from OpenAccess-AI-Collective/NanoCode012-warning-remote-code
Feat: Add warning for `trust_remote_code`
|
2023-05-28 01:29:35 +09:00 |
|
NanoCode012
|
b50d7d311c
|
Merge pull request #88 from OpenAccess-AI-Collective/NanoCode012-completion-prompter-no-inherit
Fix: Remove base class inherit for CompletionPrompter
|
2023-05-28 01:29:03 +09:00 |
|
Wing Lian
|
35af017001
|
Merge pull request #87 from OpenAccess-AI-Collective/add_prompter_tests
automated testing in github actions
|
2023-05-27 12:21:23 -04:00 |
|
Wing Lian
|
a653392287
|
use requirements file for tests
|
2023-05-27 12:17:46 -04:00 |
|
Wing Lian
|
72b6ca0d9f
|
cache pip
|
2023-05-27 12:16:54 -04:00 |
|
Wing Lian
|
7f53fd2ab6
|
alright, just local install it
|
2023-05-27 12:16:06 -04:00 |
|
Wing Lian
|
c29d33352c
|
move python path to same step as tests
|
2023-05-27 12:06:23 -04:00 |
|
Wing Lian
|
403af0b1d7
|
fix path and streamline pip installs
|
2023-05-27 11:58:37 -04:00 |
|
NanoCode012
|
9ac1884323
|
Fix: Remove base class inherit for CompletionPrompter
|
2023-05-28 00:51:35 +09:00 |
|
Wing Lian
|
d199d6c261
|
automated testing in github actions
|
2023-05-27 11:51:01 -04:00 |
|
NanoCode012
|
2824423d10
|
Add warning for trust_remote_code
|
2023-05-28 00:46:56 +09:00 |
|
NanoCode012
|
cb18856fc2
|
Merge pull request #85 from NanoCode012/fix/add-dataset-shard-readme
Feat: Add `dataset_shard_num` and `dataset_shard_idx` to Readme
|
2023-05-27 23:52:50 +09:00 |
|
NanoCode012
|
8626b54aab
|
Add dataset_shard_num and dataset_shard_idx
|
2023-05-27 23:51:17 +09:00 |
|
Wing Lian
|
87dffbc451
|
Merge pull request #75 from Thytu/refactor/rename-4b-to-gptq
refactor: change 4bit nomenclature to gptq
|
2023-05-27 09:37:57 -04:00 |
|