add support for rpo_alpha (#1681)

* add support for rpo_alpha

* Add smoke test for dpo + nll loss
This commit is contained in:
Wing Lian
2024-06-04 16:09:51 -04:00
committed by GitHub
parent 1f151c0d52
commit c996881ec2
4 changed files with 58 additions and 3 deletions

View File

@@ -39,6 +39,6 @@ s3fs
gcsfs
# adlfs
trl==0.8.6
trl @ git+https://github.com/huggingface/trl.git@f18253bf2d747f68acc9cd89da95c85ebf59dbb9
zstandard==0.22.0
fastcore