Logo
Explore Help
Sign In
tocmo0nlord/axolotl
1
0
Fork 0
You've already forked axolotl
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
c7b1db329edf0f22e680582f6a7d68a3d459d88a
axolotl/deepspeed_configs
History
Wing Lian 49e2fa825d additional plugin collator kwargs, don't scale up kd loss by t^2
2025-06-05 15:18:19 -07:00
..
zero1_torch_compile.json
add deepspeed example with torch compile enabled (#2212) [skip ci]
2024-12-22 12:11:39 -05:00
zero1.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
zero2_torch_compile.json
additional plugin collator kwargs, don't scale up kd loss by t^2
2025-06-05 15:18:19 -07:00
zero2.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
zero3_bf16_cpuoffload_all.json
fix zero3 (#1994)
2024-10-28 07:32:49 -04:00
zero3_bf16_cpuoffload_params.json
fix zero3 (#1994)
2024-10-28 07:32:49 -04:00
zero3_bf16.json
fix zero3 (#1994)
2024-10-28 07:32:49 -04:00
zero3.json
Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]
2024-03-10 20:50:12 -04:00
Powered by Gitea Version: 1.25.4 Page: 136ms Template: 1ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API