Clarify pre-tokenize before multigpu (#359)

This commit is contained in:
NanoCode012
2023-08-11 11:27:42 +09:00
committed by GitHub
parent 11ddccb80f
commit 94d03c8402

View File

@@ -524,7 +524,14 @@ Run
accelerate launch scripts/finetune.py configs/your_config.yml
```
#### Multi-GPU Config
#### Multi-GPU
It is recommended to pre-tokenize dataset with the following before finetuning:
```bash
CUDA_VISIBLE_DEVICES="" accelerate ... --prepare_ds_only
```
##### Config
- llama FSDP
```yaml