axolotl/examples/jamba/README.md at 02af0820f764e862d27ef18b3f1f4dea3c35699c - axolotl - Gitea

tocmo0nlord/axolotl

Files

Wing Lian 02af0820f7 Jamba (#1451 )

* fixes for larger models

* add qlora example for deepspeed

* add readme for jamba

2024-03-28 21:03:22 -04:00

6 lines

156 B

Markdown

Raw Blame History

 # Jamba
 qlora w/ deepspeed needs at least 2x GPUs and 35GiB VRAM per GPU
 qlora single-gpu - training will start, but loss is off by an order of magnitude