This website requires JavaScript.
Explore
Help
Sign In
tocmo0nlord
/
axolotl
Watch
1
Star
0
Fork
0
You've already forked axolotl
Code
Issues
Pull Requests
Actions
3
Packages
Projects
Releases
Wiki
Activity
Files
770bb0605a35020fbb90ab533ea2f6f643e1db56
axolotl
/
deepspeed_configs
History
Wing Lian
49e2fa825d
additional plugin collator kwargs, don't scale up kd loss by t^2
2025-06-05 15:18:19 -07:00
..
zero1_torch_compile.json
add deepspeed example with torch compile enabled (
#2212
) [skip ci]
2024-12-22 12:11:39 -05:00
zero1.json
Set
gradient_clipping
to
auto
in DeepSpeed configs (
#1382
) [skip ci]
2024-03-10 20:50:12 -04:00
zero2_torch_compile.json
additional plugin collator kwargs, don't scale up kd loss by t^2
2025-06-05 15:18:19 -07:00
zero2.json
Set
gradient_clipping
to
auto
in DeepSpeed configs (
#1382
) [skip ci]
2024-03-10 20:50:12 -04:00
zero3_bf16_cpuoffload_all.json
fix zero3 (
#1994
)
2024-10-28 07:32:49 -04:00
zero3_bf16_cpuoffload_params.json
fix zero3 (
#1994
)
2024-10-28 07:32:49 -04:00
zero3_bf16.json
fix zero3 (
#1994
)
2024-10-28 07:32:49 -04:00
zero3.json
Set
gradient_clipping
to
auto
in DeepSpeed configs (
#1382
) [skip ci]
2024-03-10 20:50:12 -04:00