Support device_map=sequential & max_memory config parameters (#903)

* Support device_map sequential (and others). Support max_memory in cfg.

* Update documentation in README accordingly.

* Update README.md

---------

Co-authored-by: Wing Lian <wing.lian@gmail.com>
This commit is contained in:
Bryan Thornbury
2023-12-04 06:29:21 -08:00
committed by GitHub
parent a1da39cd48
commit 992e742cdc
3 changed files with 8 additions and 1 deletions

View File

@@ -612,6 +612,12 @@ eval_sample_packing:
sample_packing_eff_est:
total_num_tokens:
# Passed through to transformers when loading the model when launched without accelerate
# Use `sequential` when training w/ model parallelism to limit memory
device_map:
# Defines the max memory usage per gpu on the system. Passed through to transformers when loading the model.
max_memory:
# If you want to use 'lora' or 'qlora' or leave blank to train all parameters in original model
adapter: lora
# If you already have a lora model trained that you want to load, put that here.