Logo
Explore Help
Sign In
tocmo0nlord/axolotl
1
0
Fork 0
You've already forked axolotl
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
73a84ad0dd8f6736b8c1c16912e26bf15e2865c0
axolotl/deepspeed_configs
History
Wing Lian 49e2fa825d additional plugin collator kwargs, don't scale up kd loss by t^2
2025-06-05 15:18:19 -07:00
..
zero1_torch_compile.json
add deepspeed example with torch compile enabled (#2212) [skip ci]
2024-12-22 12:11:39 -05:00
zero1.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
zero2_torch_compile.json
additional plugin collator kwargs, don't scale up kd loss by t^2
2025-06-05 15:18:19 -07:00
zero2.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
zero3_bf16_cpuoffload_all.json
fix zero3 (#1994)
2024-10-28 07:32:49 -04:00
zero3_bf16_cpuoffload_params.json
fix zero3 (#1994)
2024-10-28 07:32:49 -04:00
zero3_bf16.json
fix zero3 (#1994)
2024-10-28 07:32:49 -04:00
zero3.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
Powered by Gitea Version: 1.25.4 Page: 240ms Template: 14ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API