Dan Saunders
208fb7b8e7
basic torchao fp8 mixed precision training (#2926)
* debug
* debug
* debug
* revert unneeded change
* add accelerator config to base trainer builder
* add back accumulated_cache_size_limit setting
* lint
* accelerator constructor patch for single-GPU torch fp8
* lint
* re-using existing fp8 code
* lint
* remove accelerate patch now fix in latest release
* fix
* docs
* add fp8 + fsdp2 example
* remove unused config
* update config
* smoke tests
* add validator
* add 2.7.0 guard for fsdp2
* fix
* add config descriptions
* add FSDP doc link
* nit
* set force_recompute_fp8_weight_in_bwd with enable_fsdp_float8_all_gather
* better cfg for smoke tests
* add test for accelerate patching
* update fp8 validator
2025-07-22 16:27:47 -04:00
..
2025-04-11 09:51:59 -04:00
2025-07-21 11:39:38 -04:00
2025-07-22 16:27:47 -04:00
2023-12-12 09:39:22 -08:00
2025-06-03 14:04:15 -07:00
2025-07-22 16:27:47 -04:00
2025-07-09 09:22:35 -04:00
2025-07-14 22:33:35 -04:00
2025-07-21 11:40:56 -04:00
2025-03-31 13:40:12 +07:00
2025-07-14 10:05:26 -04:00
2024-12-02 08:47:10 -05:00
2025-04-05 01:25:44 -04:00
2025-06-23 23:08:46 -04:00
2025-03-31 13:40:12 +07:00
2025-07-21 11:39:38 -04:00
2025-03-21 11:02:43 -04:00
2025-07-21 11:39:38 -04:00
2025-03-21 11:02:43 -04:00
2024-03-14 11:05:42 -04:00
2025-05-23 15:51:11 -04:00
2025-05-23 15:51:11 -04:00
2025-07-14 09:25:44 -04:00
2025-06-27 11:02:51 -04:00
2025-07-21 11:39:38 -04:00
2025-03-29 08:30:06 -04:00
2025-03-21 11:02:43 -04:00
2025-06-03 14:04:15 -07:00
2024-08-22 11:46:57 -04:00
2025-03-21 11:02:43 -04:00
2025-05-23 15:51:11 -04:00
2025-07-14 10:05:26 -04:00
2025-07-09 09:22:35 -04:00