NanoCode012
|
288fd62431
|
Merge pull request #135 from NanoCode012/fix/grad-accu-readme
Fix: Update doc for grad_accu and add validation tests for batch size
|
2023-06-01 06:33:05 +09:00 |
|
NanoCode012
|
3c71c8debe
|
Update doc for grad_accu and add validation tests for batch size
|
2023-06-01 06:13:47 +09:00 |
|
Wing Lian
|
a6f5e5eaec
|
Merge pull request #134 from OpenAccess-AI-Collective/gas-batch-fix
fix batch size calculation
|
2023-05-31 14:24:48 -04:00 |
|
Wing Lian
|
5a631b305b
|
fix batch size calculation
|
2023-05-31 14:11:32 -04:00 |
|
Wing Lian
|
f94dd626f0
|
Merge pull request #130 from OpenAccess-AI-Collective/gas
swap batch size for gradient accumulation steps to decouple from num gpu
|
2023-05-31 13:03:51 -04:00 |
|
Wing Lian
|
5079753b7a
|
Merge pull request #131 from OpenAccess-AI-Collective/fix-packing-mask
fix packing so that concatenated sequences reset the attention
|
2023-05-31 13:03:37 -04:00 |
|
Wing Lian
|
0136f510f2
|
don't worry about duplicate code here
|
2023-05-31 12:05:43 -04:00 |
|
Wing Lian
|
9b8585dc70
|
fix packing so that concatenated sequences reset the attention
|
2023-05-31 11:38:52 -04:00 |
|
Wing Lian
|
8eb5811d4e
|
Merge pull request #129 from OpenAccess-AI-Collective/builder-badge
add badge info to readme
|
2023-05-31 10:37:59 -04:00 |
|
Wing Lian
|
e0011fdf55
|
Fix base builder, missing tags
|
2023-05-31 09:52:03 -04:00 |
|
Wing Lian
|
6e9e98720e
|
Merge pull request #127 from OpenAccess-AI-Collective/py310-docker-runpod
add py310 support from base image
|
2023-05-31 09:39:42 -04:00 |
|
Wing Lian
|
c2a0792680
|
swap batch size for gradient accumulation steps to decouple from num gpu
|
2023-05-31 09:38:12 -04:00 |
|
Wing Lian
|
b267d24a2b
|
add badge info to readme
|
2023-05-31 09:28:44 -04:00 |
|
Wing Lian
|
5c3f5db38b
|
Add files via upload
|
2023-05-31 09:22:54 -04:00 |
|
Wing Lian
|
e3d03745ba
|
add py310 support from base image
|
2023-05-31 09:07:28 -04:00 |
|
NanoCode012
|
fac46002d4
|
Merge pull request #119 from NanoCode012/feat/update-inference
Feat(inference): Swap to GenerationConfig
|
2023-05-31 14:09:18 +09:00 |
|
NanoCode012
|
33d40179ba
|
Increase max_new_tokens
Co-authored-by: Wing Lian <wing.lian@gmail.com>
|
2023-05-31 14:04:49 +09:00 |
|
Wing Lian
|
dcb03d6da4
|
Merge pull request #114 from OpenAccess-AI-Collective/accelerate-dep
Add accelerate dep
|
2023-05-31 00:47:17 -04:00 |
|
NanoCode012
|
0e4be625ae
|
Merge pull request #118 from NanoCode012/feat/torch-readme
Fix(readme): Fix torch missing from readme
|
2023-05-31 13:29:41 +09:00 |
|
NanoCode012
|
bdc4bd7d4e
|
Update README.md
|
2023-05-31 13:24:28 +09:00 |
|
Wing Lian
|
2d0ba3b818
|
Merge pull request #124 from OpenAccess-AI-Collective/xformers-fix
copy xformers attn from ooba since we removed dep on alpaca_lora_4bit
|
2023-05-31 00:11:40 -04:00 |
|
Wing Lian
|
c7021e191f
|
Merge pull request #120 from OpenAccess-AI-Collective/model-from-path
split up llama model loading so config can be loaded from base config and models can be loaded from a path
|
2023-05-31 00:08:38 -04:00 |
|
Wing Lian
|
c56818b119
|
don't worry about dupes
|
2023-05-31 00:06:47 -04:00 |
|
Wing Lian
|
2675fb756e
|
update readme for SDP
|
2023-05-31 00:04:54 -04:00 |
|
Wing Lian
|
1076bcbbca
|
Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
|
2023-05-31 00:00:19 -04:00 |
|
Wing Lian
|
2daa6835f0
|
Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
|
2023-05-30 23:59:05 -04:00 |
|
Wing Lian
|
e3c494ca7b
|
remove unused import and update readme
|
2023-05-30 23:55:45 -04:00 |
|
Wing Lian
|
ad0ea6aaab
|
black formatting
ignore copied file
fix linting
|
2023-05-30 23:50:29 -04:00 |
|
Wing Lian
|
876edd83d0
|
Merge pull request #123 from OpenAccess-AI-Collective/bas-batch
add support for gradient accumulation steps
|
2023-05-30 23:45:29 -04:00 |
|
Wing Lian
|
6cb2310592
|
copy xformers attn from ooba since we removed dep on alpaca_lora_4bit
|
2023-05-30 23:34:36 -04:00 |
|
Wing Lian
|
6fa40bf8ad
|
black formatting
|
2023-05-30 23:33:37 -04:00 |
|
Wing Lian
|
3aad5f3b3e
|
add support for gradient accumulation steps
|
2023-05-30 23:24:37 -04:00 |
|
Wing Lian
|
39a208c2bc
|
fix up tokenizer config, isort fix
|
2023-05-30 23:00:02 -04:00 |
|
Wing Lian
|
2520ecd6df
|
split up llama model loading so config can be loaded from base config and models can be loaded from a path
|
2023-05-30 22:32:44 -04:00 |
|
Wing Lian
|
c5b0af1a7e
|
define python version (3.10) explicitly as string in yaml
|
2023-05-30 22:23:35 -04:00 |
|
NanoCode012
|
988aeb9c34
|
Feat: Swap to GenerationConfig
|
2023-05-31 10:48:19 +09:00 |
|
NanoCode012
|
cf61f14bff
|
FIx(readme): Fix torch missing from readme
|
2023-05-31 10:28:49 +09:00 |
|
Wing Lian
|
0abcd71a85
|
Merge pull request #115 from OpenAccess-AI-Collective/docker-version-fixes
docker fixes: py310, fix cuda arg in deepspeed
|
2023-05-30 18:11:26 -04:00 |
|
Wing Lian
|
c43c5c84ff
|
py310, fix cuda arg in deepspeed
|
2023-05-30 18:02:34 -04:00 |
|
Wing Lian
|
36ec6e1a0e
|
Add accelerate dep
|
2023-05-30 16:36:13 -04:00 |
|
Wing Lian
|
13b80937f9
|
add release draft template for gh
pre-commit / pre-commit (push) Has been cancelled
PyTest / test (3.10) (push) Has been cancelled
PyTest / test (3.9) (push) Has been cancelled
v0.2.0
|
2023-05-30 15:10:19 -04:00 |
|
Wing Lian
|
bbc5bc5791
|
Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq
default to qlora support, make gptq specific image
|
2023-05-30 15:07:04 -04:00 |
|
Wing Lian
|
4df9da74e3
|
Merge pull request #105 from viktoriussuwandi/viktoriussuwandi-patch
Viktoriussuwandi patch
|
2023-05-30 15:05:23 -04:00 |
|
Wing Lian
|
2531ea24c1
|
Merge pull request #106 from fearnworks/qlora-openllama-3b-example
Qlora openllama 3b example
|
2023-05-30 15:05:05 -04:00 |
|
Wing Lian
|
01a75fd027
|
Merge pull request #98 from NanoCode012/feat/pre-commit
Add pre-commit: black+flake8+pylint+mypy+isort+bandit
|
2023-05-30 14:57:15 -04:00 |
|
NanoCode012
|
b81c97ff76
|
Fix pre-commit for rebased files
|
2023-05-31 03:01:38 +09:00 |
|
NanoCode012
|
594e72b6e8
|
Fix incorrect rebase
|
2023-05-31 02:58:50 +09:00 |
|
NanoCode012
|
25eeeeba0b
|
Fix sharegpt prompt
|
2023-05-31 02:55:21 +09:00 |
|
Wing Lian
|
cfcc549f6b
|
fix relative path for fixtures
|
2023-05-31 02:55:21 +09:00 |
|
NanoCode012
|
a1f9850b91
|
Fix security issue or ignore false positives
|
2023-05-31 02:53:53 +09:00 |
|