feat: remove need to add load_in* during merge (#1017)
This commit is contained in:
@@ -996,7 +996,7 @@ When you include these tokens in your axolotl config, axolotl adds these tokens
|
|||||||
### Inference Playground
|
### Inference Playground
|
||||||
|
|
||||||
Axolotl allows you to load your model in an interactive terminal playground for quick experimentation.
|
Axolotl allows you to load your model in an interactive terminal playground for quick experimentation.
|
||||||
The config file is the same config file used for training.
|
The config file is the same config file used for training.
|
||||||
|
|
||||||
Pass the appropriate flag to the inference command, depending upon what kind of model was trained:
|
Pass the appropriate flag to the inference command, depending upon what kind of model was trained:
|
||||||
|
|
||||||
@@ -1027,7 +1027,7 @@ Please use `--sample_packing False` if you have it on and receive the error simi
|
|||||||
Add below flag to train command above
|
Add below flag to train command above
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
python3 -m axolotl.cli.merge_lora examples/your_config.yml --lora_model_dir="./completed-model" --load_in_8bit=False --load_in_4bit=False
|
python3 -m axolotl.cli.merge_lora examples/your_config.yml --lora_model_dir="./completed-model"
|
||||||
```
|
```
|
||||||
|
|
||||||
If you run out of CUDA memory, you can try to merge in system RAM with
|
If you run out of CUDA memory, you can try to merge in system RAM with
|
||||||
|
|||||||
@@ -18,7 +18,15 @@ def do_cli(config: Path = Path("examples/"), **kwargs):
|
|||||||
return_remaining_strings=True
|
return_remaining_strings=True
|
||||||
)
|
)
|
||||||
parsed_cli_args.merge_lora = True
|
parsed_cli_args.merge_lora = True
|
||||||
parsed_cfg = load_cfg(config, merge_lora=True, **kwargs)
|
|
||||||
|
parsed_cfg = load_cfg(
|
||||||
|
config,
|
||||||
|
merge_lora=True,
|
||||||
|
load_in_8bit=False,
|
||||||
|
load_in_4bit=False,
|
||||||
|
flash_attention=False,
|
||||||
|
**kwargs
|
||||||
|
)
|
||||||
|
|
||||||
do_merge_lora(cfg=parsed_cfg, cli_args=parsed_cli_args)
|
do_merge_lora(cfg=parsed_cfg, cli_args=parsed_cli_args)
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user