diff --git a/README.md b/README.md index 8d201e739..a31eee5fb 100644 --- a/README.md +++ b/README.md @@ -16,14 +16,14 @@ ## Axolotl supports -| | fp16/fp32 | fp16/fp32 w/ lora | qlora | gptq | gptq w/ lora | gptq w/flash attention | flash attention | xformers attention | -|----------|:----------|:------------------|-------|------|:-------------|------------------------|-----------------|--------------------| -| llama | ✅ | ✅ | ✅ | ✅ | ❓ | ✅ | ✅ | ✅ | -| Pythia | ✅ | ✅ | ✅ | ❌ | ❓ | ❌ | ❌ | ❓ | -| cerebras | ✅ | ✅ | ✅ | ❌ | ❓ | ❌ | ❌ | ✅ | -| mpt | ✅ | ❌ | ❓ | ❌ | ❓ | ❌ | ❌ | ❓ | -| falcon | ✅ | ✅ | ✅ | ❌ | ❓ | ❌ | ❌ | ✅ | -| gpt-j | ✅ | ✅ | ✅ | ❌ | ❓ | ❌ | ❓ | ✅ | +| | fp16/fp32 | lora | qlora | gptq | gptq w/ lora | gptq w/flash attn | flash attn | xformers attn | +|----------|:----------|:-----|-------|------|:-------------|-------------------|------------|---------------| +| llama | ✅ | ✅ | ✅ | ✅ | ❓ | ✅ | ✅ | ✅ | +| Pythia | ✅ | ✅ | ✅ | ❌ | ❓ | ❌ | ❌ | ❓ | +| cerebras | ✅ | ✅ | ✅ | ❌ | ❓ | ❌ | ❌ | ✅ | +| mpt | ✅ | ❌ | ❓ | ❌ | ❓ | ❌ | ❌ | ❓ | +| falcon | ✅ | ✅ | ✅ | ❌ | ❓ | ❌ | ❌ | ✅ | +| gpt-j | ✅ | ✅ | ✅ | ❌ | ❓ | ❌ | ❓ | ✅ | ## Quickstart ⚡ diff --git a/examples/gptj-qlora/config.yml b/examples/gptj/qlora.yml similarity index 100% rename from examples/gptj-qlora/config.yml rename to examples/gptj/qlora.yml diff --git a/examples/openllama-3b/README.md b/examples/openllama-3b/README.md new file mode 100644 index 000000000..9e8f3a9e8 --- /dev/null +++ b/examples/openllama-3b/README.md @@ -0,0 +1,16 @@ +# openllama-3b + +Basic full tune +```shell +accelerate launch scripts/finetune.py examples/qlora-openllama-3b/config.yml +``` + +LoRA +```shell +accelerate launch scripts/finetune.py examples/qlora-openllama-3b/lora.yml +``` + +QLoRA +```shell +accelerate launch scripts/finetune.py examples/qlora-openllama-3b/qlora.yml +``` diff --git a/examples/lora-openllama-3b/config.yml b/examples/openllama-3b/lora.yml similarity index 90% rename from examples/lora-openllama-3b/config.yml rename to examples/openllama-3b/lora.yml index 2e1644546..98e2c2adc 100644 --- a/examples/lora-openllama-3b/config.yml +++ b/examples/openllama-3b/lora.yml @@ -1,5 +1,5 @@ -base_model: openlm-research/open_llama_3b_600bt_preview -base_model_config: openlm-research/open_llama_3b_600bt_preview +base_model: openlm-research/open_llama_3b +base_model_config: openlm-research/open_llama_3b model_type: LlamaForCausalLM tokenizer_type: LlamaTokenizer load_in_8bit: true diff --git a/examples/qlora-openllama-3b/config.yml b/examples/openllama-3b/qlora.yml similarity index 90% rename from examples/qlora-openllama-3b/config.yml rename to examples/openllama-3b/qlora.yml index 87e1dfd94..83ae31f91 100644 --- a/examples/qlora-openllama-3b/config.yml +++ b/examples/openllama-3b/qlora.yml @@ -1,5 +1,5 @@ -base_model: openlm-research/open_llama_3b_600bt_preview -base_model_config: openlm-research/open_llama_3b_600bt_preview +base_model: openlm-research/open_llama_3b +base_model_config: openlm-research/open_llama_3b model_type: LlamaForCausalLM tokenizer_type: LlamaTokenizer load_in_8bit: false diff --git a/examples/qlora-openllama-3b/README.md b/examples/qlora-openllama-3b/README.md deleted file mode 100644 index d79ea7f3f..000000000 --- a/examples/qlora-openllama-3b/README.md +++ /dev/null @@ -1,6 +0,0 @@ -# qlora-openllama-3b - -```shell -accelerate launch scripts/finetune.py examples/qlora-openllama-3b/config.yml - -```