Oliver Molenschot
|
a4b1cc6df0
|
Add example YAML file for training Mistral using DPO (#2029) [skip ci]
* Add example YAML file for training Mistral using DPO
* chore: lint
* Apply suggestions from code review
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
* Update mistral-dpo.yml
Adding qlora and removing role-related data (unecessary)
* Rename mistral-dpo.yml to mistral-dpo-qlora.yml
* Apply suggestions from code review
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com>
Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>
|
2024-11-13 10:06:25 -05:00 |
|