Act offload lora fix (#2928) [skip ci]

* fix activation offloading with lora

* update w e2e test

* add docs for error
This commit is contained in:
Wing Lian
2025-07-24 16:10:04 -04:00
committed by GitHub
parent 1407aac779
commit 0ff2f172ef
5 changed files with 275 additions and 83 deletions

View File

@@ -136,3 +136,7 @@ description: Frequently asked questions
> dynamic: false
> mode: max-autotune-no-cudagraphs
> ```
**Q: `ValueError("Backward pass should have cleared tracker of all tensors")`
> A: This may happen due to edge cases in using the modern OffloadActivations context manager for CUDA streams. If you encounter this error, you may have success using the naive implementation with `offload_activations: legacy` in your YAML.