feat: add kto example (#2158) [skip ci]

This commit is contained in:
NanoCode012
2024-12-09 20:17:27 +07:00
committed by GitHub
parent b1e8286c57
commit c78de6f214
3 changed files with 116 additions and 0 deletions

View File

@@ -52,6 +52,26 @@ datasets:
type: chat_template.argilla
```
#### KTO
```yaml
rl: kto
rl_beta: 0.5
kto_desirable_weight: 0.2
remove_unused_columns: false
datasets:
- path: argilla/ultrafeedback-binarized-preferences-cleaned-kto
type: llama3.ultra
split: train
gradient_checkpointing: true
gradient_checkpointing_kwargs:
use_reentrant: true
```
#### Using local dataset files
```yaml
datasets: