Dan Saunders
208fb7b8e7
basic torchao fp8 mixed precision training (#2926)
* debug
* debug
* debug
* revert unneeded change
* add accelerator config to base trainer builder
* add back accumulated_cache_size_limit setting
* lint
* accelerator constructor patch for single-GPU torch fp8
* lint
* re-using existing fp8 code
* lint
* remove accelerate patch now fix in latest release
* fix
* docs
* add fp8 + fsdp2 example
* remove unused config
* update config
* smoke tests
* add validator
* add 2.7.0 guard for fsdp2
* fix
* add config descriptions
* add FSDP doc link
* nit
* set force_recompute_fp8_weight_in_bwd with enable_fsdp_float8_all_gather
* better cfg for smoke tests
* add test for accelerate patching
* update fp8 validator
2025-07-22 16:27:47 -04:00
..
2025-07-22 16:27:47 -04:00
2025-07-07 15:24:16 -04:00
2025-07-22 16:27:47 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2023-11-06 18:33:01 -05:00
2023-09-15 15:46:54 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-06-10 19:53:07 -04:00
2025-07-15 15:00:48 -04:00
2025-05-30 11:21:47 +07:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-05-23 15:51:11 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-14 14:19:19 -04:00
2025-07-15 15:00:48 -04:00
2025-07-14 20:11:11 -04:00
2025-07-15 15:00:48 -04:00
2025-05-28 12:35:47 +01:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-15 15:00:48 -04:00
2025-07-12 15:18:01 +01:00