Wing Lian
|
d5f944ce2a
|
add example for falcoln support
|
2023-05-27 09:16:43 -04:00 |
|
Wing Lian
|
c3d256271e
|
fix wheel install glob
|
2023-05-26 10:37:02 -04:00 |
|
NanoCode012
|
46c5a44003
|
Merge pull request #69 from OpenAccess-AI-Collective/NanoCode012-quickstart-disable-xformers
Fix: Disable xformers for QuickStart config
|
2023-05-26 22:40:16 +09:00 |
|
NanoCode012
|
ec3c0314bf
|
Merge pull request #65 from NanoCode012/feat/target-linear
Feat: Add `cfg.lora_target_linear`
|
2023-05-26 22:39:38 +09:00 |
|
NanoCode012
|
79560934f9
|
Disable formers for QuickStart config
|
2023-05-26 22:23:38 +09:00 |
|
NanoCode012
|
353cebd838
|
Merge pull request #68 from OpenAccess-AI-Collective/NanoCode012-patch-1
Fix: Incorrect recommendation condition
|
2023-05-26 22:20:31 +09:00 |
|
NanoCode012
|
fe0e69f4f9
|
Fix recommendation condition
|
2023-05-26 22:19:50 +09:00 |
|
Wing Lian
|
1fc9b44e3d
|
fix wheel blobs in dockerfile
|
2023-05-26 07:40:11 -04:00 |
|
NanoCode012
|
919623793a
|
Add cfg.lora_target_linear
|
2023-05-26 14:32:30 +09:00 |
|
Wing Lian
|
bbfc333a01
|
Merge pull request #62 from OpenAccess-AI-Collective/qlora-fixes
Qlora fixes
|
2023-05-26 00:28:16 -04:00 |
|
Wing Lian
|
a5bf838685
|
add logging and make sure model unloads to float16
|
2023-05-26 00:09:55 -04:00 |
|
Wing Lian
|
a4f12415a0
|
update readme and add typehints
|
2023-05-25 23:10:11 -04:00 |
|
Wing Lian
|
48f4c0571e
|
fix validation for qlora merge
|
2023-05-25 23:02:03 -04:00 |
|
Wing Lian
|
1987e5cf56
|
qlora and 4bit check so we are able to merge and unload
|
2023-05-25 22:55:13 -04:00 |
|
Wing Lian
|
e7e1a777bd
|
fix bool args according to python fire docs
|
2023-05-25 22:45:41 -04:00 |
|
Wing Lian
|
7b5e762be2
|
fix merge conflict failure, black format
|
2023-05-25 22:40:27 -04:00 |
|
Wing Lian
|
3f6017db9e
|
qlora merge and load requires that base model isn't loaded in 4 or 8 bit
|
2023-05-25 22:39:13 -04:00 |
|
Wing Lian
|
34c99f9812
|
fixes to make qlora actually work
|
2023-05-25 22:37:23 -04:00 |
|
NanoCode012
|
3815c054b6
|
Merge pull request #61 from NanoCode012/feat/update-readme
Feat: Update readme
|
2023-05-26 11:27:31 +09:00 |
|
NanoCode012
|
85326bfbf3
|
Update quickstart config
|
2023-05-26 11:15:57 +09:00 |
|
NanoCode012
|
e689069afd
|
Add xformers error
|
2023-05-26 11:12:03 +09:00 |
|
NanoCode012
|
d7d8bc739e
|
Add strict yml
|
2023-05-26 11:10:59 +09:00 |
|
NanoCode012
|
60e32ff457
|
Fix shard config
|
2023-05-26 11:09:28 +09:00 |
|
Wing Lian
|
259262bf42
|
fix xentropy wheel name typo
|
2023-05-25 17:25:38 -04:00 |
|
Wing Lian
|
2e56203b50
|
another fix for shard and train split
|
2023-05-25 17:23:57 -04:00 |
|
Wing Lian
|
be3d3963cd
|
Merge pull request #58 from OpenAccess-AI-Collective/shards-fix
shard fix
|
2023-05-25 16:32:31 -04:00 |
|
Wing Lian
|
ac79360161
|
shard fix
|
2023-05-25 16:31:59 -04:00 |
|
Wing Lian
|
b2fb61845e
|
Merge pull request #54 from OpenAccess-AI-Collective/winglian-patch-1
add discord link to #axolotl-help channel
|
2023-05-25 12:45:19 -04:00 |
|
Wing Lian
|
71d600fc43
|
Merge branch 'main' into winglian-patch-1
|
2023-05-25 12:45:13 -04:00 |
|
Wing Lian
|
4fd0c2d1b9
|
Merge pull request #57 from OpenAccess-AI-Collective/fixes-for-basic-samples
fixes w/ example for super basic lora starter
|
2023-05-25 12:43:22 -04:00 |
|
Wing Lian
|
943961fd10
|
missed ...
|
2023-05-25 12:42:56 -04:00 |
|
Wing Lian
|
d2a6f79fd1
|
change auth token setting back
|
2023-05-25 12:41:17 -04:00 |
|
Wing Lian
|
98b1bce57e
|
pr comments addressed
|
2023-05-25 12:25:07 -04:00 |
|
Wing Lian
|
004820209d
|
Update src/axolotl/prompters.py
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
|
2023-05-25 12:21:02 -04:00 |
|
Wing Lian
|
8d6a28953f
|
fix relative path in flash-attn build:
|
2023-05-25 12:18:28 -04:00 |
|
Wing Lian
|
e396654319
|
fix tokenizer loading, got openllama 3b working
|
2023-05-25 12:15:12 -04:00 |
|
Wing Lian
|
a5d739b66b
|
fixes w/ example for super basic lora starter
|
2023-05-25 11:59:08 -04:00 |
|
Wing Lian
|
951facbb1f
|
Merge pull request #56 from OpenAccess-AI-Collective/fix-build-flash-attn
fix cd within flash-attn
|
2023-05-25 11:29:47 -04:00 |
|
Wing Lian
|
f5fa3d131b
|
fix cd within flash-attn
|
2023-05-25 11:29:15 -04:00 |
|
NanoCode012
|
7ec105041d
|
Merge pull request #48 from NanoCode012/feat/update-readme
Feat: Minor update readme from dev changes
|
2023-05-25 23:49:58 +09:00 |
|
NanoCode012
|
a9e502ef45
|
Update 4bit notes
|
2023-05-25 23:48:18 +09:00 |
|
NanoCode012
|
68f0c71424
|
Merge pull request #49 from NanoCode012/feat/gitignore
Feat: Update gitignore using standard Python template
|
2023-05-25 23:42:49 +09:00 |
|
NanoCode012
|
52fb6d8a34
|
Update gitignore using standard Python template
|
2023-05-25 23:07:27 +09:00 |
|
NanoCode012
|
f92245dbd6
|
Fix missing closing code block
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
e65c203e9e
|
Add more detail on minimum GPU
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
1377400c33
|
Add info on Runtime Error
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
2c34f8d0c7
|
Update dataset type
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
7bc28eb8a8
|
Add more data formats
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
29273b5a5b
|
Add other minor configs
|
2023-05-25 23:06:33 +09:00 |
|
NanoCode012
|
05c18340d6
|
Update scheduler configs
|
2023-05-25 23:06:33 +09:00 |
|