Add ORPO example and e2e test (#1572)
* add example for mistral orpo * sample_packing: false for orpo * go to load_dataset (since load_rl_datasets require a transfom_fn, which only dpo uses currently)
This commit is contained in:
@@ -49,7 +49,7 @@ remove_unused_columns: false
|
||||
chat_template: chatml
|
||||
datasets:
|
||||
- path: argilla/ultrafeedback-binarized-preferences-cleaned
|
||||
type: orpo.chat_template
|
||||
type: chat_template.argilla
|
||||
```
|
||||
|
||||
#### Using local dataset files
|
||||
|
||||
Reference in New Issue
Block a user