chore(docs): add cookbook/blog link to docs (#2410) [skip ci]

This commit is contained in:
NanoCode012
2025-03-17 19:38:19 +07:00
committed by GitHub
parent 4f5eb42a73
commit 7235123d44
3 changed files with 12 additions and 0 deletions

View File

@@ -497,6 +497,10 @@ The input format is a simple JSON input with customizable fields based on the ab
### GRPO
::: {.callout-tip}
Check out our [GRPO cookbook](https://github.com/axolotl-ai-cloud/axolotl-cookbook/tree/main/grpo#training-an-r1-style-large-language-model-using-grpo).
:::
GRPO uses custom reward functions and transformations. Please have them ready locally.
For ex, to load OpenAI's GSM8K and use a random reward for completions: