Cosine learning rate schedule - minimum learning rate (#1062)

* Cosine min lr

* Cosine min lr - warn if using deepspeed

* cosine_min_lr_ratio readme

* chore: lint

---------

Co-authored-by: Wing Lian <wing.lian@gmail.com>
This commit is contained in:
Ricardo Dominguez-Olmedo
2024-01-09 15:29:56 +01:00
committed by GitHub
parent c3e8165f26
commit 04b978b428
3 changed files with 61 additions and 1 deletions

View File

@@ -755,6 +755,7 @@ early_stopping_patience: 3
# Specify a scheduler and kwargs to use with the optimizer
lr_scheduler: # 'one_cycle' | 'log_sweep' | empty for cosine
lr_scheduler_kwargs:
cosine_min_lr_ratio: # decay lr to some percentage of the peak lr, e.g. cosine_min_lr_ratio=0.1 for 10% of peak lr
# For one_cycle optim
lr_div_factor: # Learning rate div factor