Merge pull request #130 from OpenAccess-AI-Collective/gas
swap batch size for gradient accumulation steps to decouple from num gpu
This commit is contained in:
Reference in New Issue
Block a user
swap batch size for gradient accumulation steps to decouple from num gpu