add to docs (#703)
This commit is contained in:
@@ -901,6 +901,8 @@ CUDA_VISIBLE_DEVICES="" python3 -m axolotl.cli.merge_lora ...
|
|||||||
|
|
||||||
## Common Errors 🧰
|
## Common Errors 🧰
|
||||||
|
|
||||||
|
See also the [FAQ's](./docs/faq.md).
|
||||||
|
|
||||||
> If you encounter a 'Cuda out of memory' error, it means your GPU ran out of memory during the training process. Here's how to resolve it:
|
> If you encounter a 'Cuda out of memory' error, it means your GPU ran out of memory during the training process. Here's how to resolve it:
|
||||||
|
|
||||||
Please reduce any below
|
Please reduce any below
|
||||||
|
|||||||
14
docs/faq.md
Normal file
14
docs/faq.md
Normal file
@@ -0,0 +1,14 @@
|
|||||||
|
# Axolotl FAQ's
|
||||||
|
|
||||||
|
|
||||||
|
> The trainer stopped and hasn't progressed in several minutes.
|
||||||
|
|
||||||
|
Usually an issue with the GPU's communicating with each other. See the [NCCL doc](../docs/nccl.md)
|
||||||
|
|
||||||
|
> Exitcode -9
|
||||||
|
|
||||||
|
This usually happens when you run out of system RAM.
|
||||||
|
|
||||||
|
> Exitcode -7 while using deepspeed
|
||||||
|
|
||||||
|
Try upgrading deepspeed w: `pip install -U deepspeed`
|
||||||
Reference in New Issue
Block a user