Logo
Explore Help
Sign In
tocmo0nlord/axolotl
1
0
Fork 0
You've already forked axolotl
Code Issues Pull Requests Actions 3 Packages Projects Releases Wiki Activity
Files
bc1076d8a2fe567c64be3c040df3619fe9000109
axolotl/deepspeed_configs
History
Wing Lian e207762928 fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
* fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg

* replace the rest of the migrated deepspeed params
2025-07-21 11:41:31 -04:00
..
zero1_torch_compile.json
add deepspeed example with torch compile enabled (#2212) [skip ci]
2024-12-22 12:11:39 -05:00
zero1.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
zero2_torch_compile.json
KD fix w/ online distillation (#2700) [skip ci]
2025-06-17 12:09:13 -04:00
zero2.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
zero3_bf16_cpuoffload_all.json
fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
2025-07-21 11:41:31 -04:00
zero3_bf16_cpuoffload_params.json
fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
2025-07-21 11:41:31 -04:00
zero3_bf16.json
fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
2025-07-21 11:41:31 -04:00
zero3.json
fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
2025-07-21 11:41:31 -04:00
Powered by Gitea Version: 1.25.4 Page: 50ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API