fix: switch to using the HuggingFace Transformers NEFT implementation (#941)

* fix: switch to using the HuggingFace Transformers NEFT implementation

* linter

* add support for noisy_embedding_alpha with a warning about it being renamed

* restore pre/posttrain_hooks

* move validation of NEFT noise alpha into validate_config()

* linter
This commit is contained in:
kallewoof
2023-12-14 07:15:34 +09:00
committed by GitHub
parent 5ea3aa31f0
commit ef24342538
5 changed files with 23 additions and 71 deletions

View File

@@ -774,7 +774,7 @@ max_grad_norm:
# Augmentation techniques
# NEFT https://arxiv.org/abs/2310.05914, set this to a number (paper default is 5) to add noise to embeddings
# currently only supported on Llama and Mistral
noisy_embedding_alpha:
neftune_noise_alpha:
# Whether to bettertransformers
flash_optimum: