Add ORPO example and e2e test (#1572)

* add example for mistral orpo

* sample_packing: false for orpo

* go to load_dataset (since load_rl_datasets require a transfom_fn, which only dpo uses currently)
This commit is contained in:
Motoki Wu
2024-04-27 09:07:06 -07:00
committed by GitHub
parent 68601ec6ad
commit 98c25e15cb
4 changed files with 131 additions and 1 deletions

1
.gitignore vendored
View File

@@ -133,6 +133,7 @@ venv/
ENV/
env.bak/
venv.bak/
venv3.10/
# Spyder project settings
.spyderproject