2.1 KiB
2.1 KiB
Axolotl
Go ahead and axolotl questions
Support Matrix
| fp16/fp32 | fp16/fp32 w/ lora | 4bit-quant | 4bit-quant w/flash attention | flash attention | xformers attention | |
|---|---|---|---|---|---|---|
| llama | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Pythia | ✅ | ✅ | ❌ | ❌ | ❌ | ❓ |
| cerebras | ✅ | ✅ | ❌ | ❌ | ❌ | ❓ |
Getting Started
- Point the config you are using to a huggingface hub dataset (see configs/llama_7B_4bit.yml)
datasets:
- path: vicgalle/alpaca-gpt4
type: alpaca
-
Optionally Download some datasets, see data/README.md
-
Create a new or update the existing YAML config config/pythia_1_2B_alpaca.yml
-
Install python dependencies with ONE of the following:
pip3 install -e .[int4](recommended)pip3 install -e .[int4_triton]pip3 install -e .
-
If not using
int4orint4_triton, runpip install "peft @ git+https://github.com/huggingface/peft.git" -
Configure accelerate
accelerate configor update~/.cache/huggingface/accelerate/default_config.yaml
compute_environment: LOCAL_MACHINE
distributed_type: MULTI_GPU
downcast_bf16: 'no'
gpu_ids: all
machine_rank: 0
main_training_function: main
mixed_precision: bf16
num_machines: 1
num_processes: 4
rdzv_backend: static
same_network: true
tpu_env: []
tpu_use_cluster: false
tpu_use_sudo: false
use_cpu: false
- Train!
accelerate launch scripts/finetune.py, make sure to choose the correct YAML config file - Alternatively you can pass in the config file like:
accelerate launch scripts/finetune.py configs/llama_7B_alpaca.yml~~