fix: switch to using the HuggingFace Transformers NEFT implementation (#941)

* fix: switch to using the HuggingFace Transformers NEFT implementation * linter * add support for noisy_embedding_alpha with a warning about it being renamed * restore pre/posttrain_hooks * move validation of NEFT noise alpha into validate_config() * linter
2023-12-14 07:15:34 +09:00
parent 5ea3aa31f0
commit ef24342538
5 changed files with 23 additions and 71 deletions
--- a/README.md
+++ b/README.md
@@ -774,7 +774,7 @@ max_grad_norm:
 # Augmentation techniques
 # NEFT https://arxiv.org/abs/2310.05914, set this to a number (paper default is 5) to add noise to embeddings
 # currently only supported on Llama and Mistral
-noisy_embedding_alpha:
+neftune_noise_alpha:

 # Whether to bettertransformers
 flash_optimum: