Dan Saunders
208fb7b8e7
basic torchao fp8 mixed precision training (#2926)
* debug
* debug
* debug
* revert unneeded change
* add accelerator config to base trainer builder
* add back accumulated_cache_size_limit setting
* lint
* accelerator constructor patch for single-GPU torch fp8
* lint
* re-using existing fp8 code
* lint
* remove accelerate patch now fix in latest release
* fix
* docs
* add fp8 + fsdp2 example
* remove unused config
* update config
* smoke tests
* add validator
* add 2.7.0 guard for fsdp2
* fix
* add config descriptions
* add FSDP doc link
* nit
* set force_recompute_fp8_weight_in_bwd with enable_fsdp_float8_all_gather
* better cfg for smoke tests
* add test for accelerate patching
* update fp8 validator
2025-07-22 16:27:47 -04:00
..
2025-07-11 09:34:19 +07:00
2025-01-29 00:10:19 -05:00
2025-06-18 15:36:53 -04:00
2025-06-18 15:36:53 -04:00
2025-02-25 16:09:37 +07:00
2024-04-04 13:43:40 +09:00
2025-05-28 12:35:47 +01:00
2025-07-07 17:05:19 -04:00
2025-06-18 15:36:53 -04:00
2025-03-21 12:26:47 -04:00
2025-02-25 16:09:37 +07:00
2025-07-12 11:40:01 -04:00
2025-07-07 14:13:37 -04:00
2025-07-08 09:19:09 -04:00
2025-06-18 15:36:53 -04:00
2025-07-14 20:10:20 -04:00
2025-03-17 08:39:04 -04:00
2025-02-25 16:09:37 +07:00
2025-07-09 09:22:35 -04:00
2025-05-28 15:48:40 +07:00
2025-01-24 12:56:28 -05:00
2025-02-25 16:09:37 +07:00
2025-07-22 16:27:47 -04:00
2025-07-19 13:54:14 -04:00
2025-07-12 15:18:01 +01:00
2025-07-22 16:52:15 +07:00
2024-03-21 22:28:36 -07:00
2025-02-25 16:09:37 +07:00
2025-06-12 13:22:40 -04:00
2025-06-18 15:36:53 -04:00
2025-02-25 16:09:37 +07:00
2025-03-17 08:38:19 -04:00
2025-07-12 11:40:30 -04:00
2025-05-21 11:20:20 -04:00
2025-02-25 16:09:37 +07:00
2025-02-25 16:09:37 +07:00