Files

Morgan McGuire 7019509daa Add wandb_entity to wandb options, update example configs, update README (#361 )

* Update wandb_entity and add wandb descriptions

* add wandb to config section

* remove trailing whitespace for pre-commit hook

* remove trailing whitespace for pre-commit hook

---------

Co-authored-by: Morgan McGuire <morganmcguire@Morgans-MacBook-Pro.local>
Co-authored-by: Wing Lian <wing.lian@gmail.com>

2023-08-12 12:17:11 -04:00

lora.yml

Add wandb_entity to wandb options, update example configs, update README (#361 )

2023-08-12 12:17:11 -04:00

qlora.yml

Add wandb_entity to wandb options, update example configs, update README (#361 )

2023-08-12 12:17:11 -04:00

README.md

feat/llama-2 examples (#319 )

2023-08-03 19:22:48 +09:00

README.md

Overview

This is an example of a llama-2 configuration for 7b and 13b. The yaml file contains configuration for the 7b variant, but you can just aswell use the same settings for 13b.

The 7b variant fits on any 24GB VRAM GPU and will take up about 17 GB of VRAM during training if using qlora and 20 GB if using lora. On a RTX 4090 it trains 3 epochs of the default dataset in about 15 minutes.

The 13b variant will fit if you change these settings to these values: gradient_accumulation_steps: 2 micro_batch_size: 1

accelerate launch scripts/finetune.py examples/llama-2/qlora.yml

accelerate launch scripts/finetune.py examples/llama-2/lora.yml