Wing Lian
|
c56818b119
|
don't worry about dupes
|
2023-05-31 00:06:47 -04:00 |
|
Wing Lian
|
1076bcbbca
|
Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
|
2023-05-31 00:00:19 -04:00 |
|
Wing Lian
|
2daa6835f0
|
Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
|
2023-05-30 23:59:05 -04:00 |
|
Wing Lian
|
e3c494ca7b
|
remove unused import and update readme
|
2023-05-30 23:55:45 -04:00 |
|
Wing Lian
|
ad0ea6aaab
|
black formatting
ignore copied file
fix linting
|
2023-05-30 23:50:29 -04:00 |
|
Wing Lian
|
6cb2310592
|
copy xformers attn from ooba since we removed dep on alpaca_lora_4bit
|
2023-05-30 23:34:36 -04:00 |
|
Wing Lian
|
3aad5f3b3e
|
add support for gradient accumulation steps
|
2023-05-30 23:24:37 -04:00 |
|
Wing Lian
|
39a208c2bc
|
fix up tokenizer config, isort fix
|
2023-05-30 23:00:02 -04:00 |
|
Wing Lian
|
2520ecd6df
|
split up llama model loading so config can be loaded from base config and models can be loaded from a path
|
2023-05-30 22:32:44 -04:00 |
|
NanoCode012
|
594e72b6e8
|
Fix incorrect rebase
|
2023-05-31 02:58:50 +09:00 |
|
NanoCode012
|
25eeeeba0b
|
Fix sharegpt prompt
|
2023-05-31 02:55:21 +09:00 |
|
Wing Lian
|
cfcc549f6b
|
fix relative path for fixtures
|
2023-05-31 02:55:21 +09:00 |
|
NanoCode012
|
a1f9850b91
|
Fix security issue or ignore false positives
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
c17dae6d07
|
Update src/axolotl/prompt_strategies/alpaca_instruct.py
Co-authored-by: Wing Lian <wing.lian@gmail.com>
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
37293dce07
|
Apply isort then black
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
e9650d3ae4
|
Fix mypy typing
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
be22551435
|
Fix unsupported operand type(s) for |
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
b832a0ac62
|
Black formatting
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
8e46c0fb0d
|
Refactor duplicate code between Prompter and Pygmalion
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
9c6750a075
|
Lint wandb
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
c2dbf2c526
|
Lint validation
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
e6b57decbd
|
Lint tokenization
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
fe1f4c4e7d
|
Lint schedulers
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
633ff2150f
|
Lint dict
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
5d86137f70
|
Lint prompt_tokenizers
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
01c8a333b3
|
Lint pygmalion
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
1645a4ddd5
|
Lint creative_acr
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
145b060cbe
|
Lint alpaca_instruct
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
8cc0aadcb8
|
Lint alpaca_chat
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
6abb7f6a16
|
Lint datasets
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
de2406c488
|
Lint convert.py
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
ddb86ea821
|
Lint trainer.py
|
2023-05-31 02:53:53 +09:00 |
|
NanoCode012
|
f4e5d86268
|
Lint models.py
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
69722aeef4
|
Remove fixme disable
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
5658717dbd
|
Remove disable too many arg
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
e8717d3bef
|
Remove disable
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
5062eca069
|
Lint callbacks.py
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
cb4f0e9342
|
Lint prompters.py
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
4c0eddb3f8
|
Refactor
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
1c60c10e00
|
Lint flash_attn.py
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
903ea3080d
|
Fix lint
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
cb7cd3429f
|
Fix data.py lint
|
2023-05-31 02:53:23 +09:00 |
|
NanoCode012
|
392dfd9b07
|
Lint and format
|
2023-05-31 02:53:22 +09:00 |
|
Wing Lian
|
e65aeedce7
|
fix relative path for fixtures
|
2023-05-30 10:38:20 -04:00 |
|
Wing Lian
|
21c8e2deab
|
refactor conversation plucking in sharegpt
|
2023-05-28 14:36:33 -04:00 |
|
Wing Lian
|
1c33eb88a7
|
new hf_use_auth_token setting so login to hf isn't required
|
2023-05-28 13:08:49 -04:00 |
|
NanoCode012
|
52dd92a0cd
|
Feat: Update validate_config and add tests
|
2023-05-29 00:25:54 +09:00 |
|
NanoCode012
|
7bf2069afd
|
Apply black formatter
|
2023-05-28 23:14:04 +09:00 |
|
NanoCode012
|
56f9ca5709
|
refactor: fix previous refactors
|
2023-05-28 23:06:10 +09:00 |
|
NanoCode012
|
8bd7a49cd7
|
Refactor to use DictDefault instead
|
2023-05-28 23:06:10 +09:00 |
|