This website requires JavaScript.
Explore
Help
Sign In
tocmo0nlord
/
axolotl
Watch
1
Star
0
Fork
0
You've already forked axolotl
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
47
Commits
319
Branches
32
Tags
097d367af6bb92d0ed30f35ddd6406f13dedc59a
Commit Graph
3 Commits
Author
SHA1
Message
Date
Wing Lian
4f2584f2dc
shuffle and split dataset after save/load
2023-04-24 09:41:35 -04:00
Wing Lian
d1aed4c8e5
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
2023-04-16 06:59:47 -04:00
Wing Lian
05fffb53b4
more logging, wandb fixes
2023-04-15 13:37:17 -04:00