Activation checkpointing with offloading to disk with prefetch (#2663)

* offload activations to disk instead of CPU RAM

* add prefetch

* Disco :dance:

* include offload_disk in e2e test for AC

* document and make sure to cleanup

* fix annotation to match docs

* fix docs build

* address PR feedback
This commit is contained in:
Wing Lian
2025-05-13 16:39:39 -04:00
parent 5a36b6ff2d
commit 5aefebe1fe
8 changed files with 577 additions and 11 deletions

View File

@@ -124,7 +124,8 @@ quartodoc:
- utils.optimizers.adopt
- utils.data.pretraining
- utils.data.sft
- utils.gradient_checkpointing.unsloth
- utils.gradient_checkpointing.offload_cpu
- utils.gradient_checkpointing.offload_disk
- title: Schemas
desc: Pydantic data models for Axolotl config
contents: