Logo
Explore Help
Sign In
tocmo0nlord/axolotl
1
0
Fork 0
You've already forked axolotl
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
b7e8f66e5a78d775f5bb9e7f0aa2f2f6a2458b71
axolotl/deepspeed_configs
History
Wing Lian e207762928 fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
* fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg

* replace the rest of the migrated deepspeed params
2025-07-21 11:41:31 -04:00
..
zero1_torch_compile.json
add deepspeed example with torch compile enabled (#2212) [skip ci]
2024-12-22 12:11:39 -05:00
zero1.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
zero2_torch_compile.json
KD fix w/ online distillation (#2700) [skip ci]
2025-06-17 12:09:13 -04:00
zero2.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
zero3_bf16_cpuoffload_all.json
fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
2025-07-21 11:41:31 -04:00
zero3_bf16_cpuoffload_params.json
fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
2025-07-21 11:41:31 -04:00
zero3_bf16.json
fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
2025-07-21 11:41:31 -04:00
zero3.json
fix deprecate deepspeed stage3_gather_16bit_weights_on_model_save arg (#2956) [skip ci]
2025-07-21 11:41:31 -04:00
Powered by Gitea Version: 1.25.4 Page: 208ms Template: 1ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API