qlora merge and load requires that base model isn't loaded in 4 or 8 bit

2023-05-25 22:39:13 -04:00
parent 34c99f9812
commit 3f6017db9e
1 changed files with 11 additions and 5 deletions
--- a/README.md
+++ b/README.md
@@ -24,7 +24,7 @@

 ## Quickstart ⚡

-**Requirements**: Python 3.9. 
+**Requirements**: Python 3.9.

 ```bash
 git clone https://github.com/OpenAccess-AI-Collective/axolotl
@@ -45,7 +45,7 @@ accelerate launch scripts/finetune.py examples/4bit-lora-7b/config.yml \

 ### Environment

- Docker 
+- Docker
  ```bash
  docker run --gpus '"all"' --rm -it winglian/axolotl:main
  ```
@@ -332,7 +332,7 @@ seed:

 ### Accelerate

-Configure accelerate 
+Configure accelerate

 ```bash
 accelerate config
@@ -363,12 +363,18 @@ Pass the appropriate flag to the train command:

 ### Merge LORA to base

-Add below flag to train command above
+Add below flag to train command above (and using LoRA)

 ```bash
 --merge_lora --lora_model_dir="./completed-model"
 ```

+Add below flag to train command above (and using QLoRA)
+
+```bash
+--merge_lora --lora_model_dir="./completed-model" --load_in_8bit False --load_in_4bit False
+```
+
 ## Common Errors 🧰

 > Cuda out of memory
@@ -383,7 +389,7 @@ Please reduce any below
 Try set `fp16: true`

 ## Need help? 🙋‍♂️
-  
+
 Join our [Discord server](https://discord.gg/HhrNrHJPRb) where we can help you

 ## Contributing 🤝