* fix optimizer reset * set states to reset for 8bit optimizers and handle quantile runtime error for embeddings * fix relora test to check grad_norm * use flash attn for relora and tweak hyperparams for test * fix messages field for test dataset
5 lines
37 B
Plaintext
5 lines
37 B
Plaintext
pre-commit
|
|
black
|
|
mypy
|
|
types-requests
|