axolotl

Files

NanoCode012 5c7e89105d Fix: modelloader handling of model_kwargs load_in*bit (#1999 )

* fix: load_in_*bit not properly read

* fix: load_*bit check

* fix: typo

* refactor: load * bit handling

* feat: add test dpo lora multi-gpu

* fix: turn off sample packing for dpo

* fix: missing warmup_steps

* fix: test to load in 8bit for lora

* skip 8bit lora on h100, add 4bit lora on h100 to multi gpu tests

* chore: reduce max_steps

---------

Co-authored-by: Wing Lian <wing.lian@gmail.com>

2024-10-30 14:41:34 -04:00

__init__.py

Attempt to run multigpu in PR CI for now to ensure it works (#1815 ) [skip ci]

2024-08-09 11:50:13 -04:00

test_eval.py

memoize dataset length for eval sample packing (#1974 )

2024-10-17 15:15:29 -04:00

test_llama.py

Fix: modelloader handling of model_kwargs load_in*bit (#1999 )

2024-10-30 14:41:34 -04:00

test_qwen2.py

Fix: modelloader handling of model_kwargs load_in*bit (#1999 )

2024-10-30 14:41:34 -04:00