Doc fix: TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL not necessary to use Triton kernel patches (#2343)
* removing note about TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL * suggest using TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL for memory efficient attn
This commit is contained in:
@@ -82,7 +82,7 @@ lora_o_kernel: true
|
|||||||
## Requirements
|
## Requirements
|
||||||
|
|
||||||
- One or more NVIDIA or AMD GPUs (in order to use the Triton kernels)
|
- One or more NVIDIA or AMD GPUs (in order to use the Triton kernels)
|
||||||
- AMD can be used with experimental Triton support by setting the environment variable `TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1`
|
- Note: Set `TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1` to enable [memory-efficient attention on AMD GPUs](https://github.com/ROCm/aotriton/issues/16#issuecomment-2346675491)
|
||||||
- Targeted LoRA adapters cannot use Dropout
|
- Targeted LoRA adapters cannot use Dropout
|
||||||
- This may limit model expressivity / cause overfitting
|
- This may limit model expressivity / cause overfitting
|
||||||
- Targeted LoRA adapters cannot have bias terms
|
- Targeted LoRA adapters cannot have bias terms
|
||||||
|
|||||||
Reference in New Issue
Block a user