From 2bf61d8e25cc1bc47522ccf87b924181eafe1422 Mon Sep 17 00:00:00 2001 From: mhenrhcsen Date: Sun, 1 Jun 2025 22:50:17 +0200 Subject: [PATCH] fix abbriviatation spelling error --- docs/rlhf.qmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/rlhf.qmd b/docs/rlhf.qmd index af3fe8767..b2687a8f9 100644 --- a/docs/rlhf.qmd +++ b/docs/rlhf.qmd @@ -16,7 +16,7 @@ feedback. Various methods include, but not limited to: - [Identity Preference Optimization (IPO)](#ipo) - [Kahneman-Tversky Optimization (KTO)](#kto) - [Odds Ratio Preference Optimization (ORPO)](#orpo) -- [Group Relative Policy Optimization (GPRO)](#grpo) +- [Group Relative Policy Optimization (GRPO)](#grpo) - Proximal Policy Optimization (PPO) (not yet supported in axolotl, if you're interested in contributing, please reach out!)