Built site for gh-pages

This commit is contained in:
Quarto GHA Workflow Runner
2025-06-14 18:56:28 +00:00
parent 84db47f3c0
commit 5b66b8e86c
6 changed files with 393 additions and 339 deletions

View File

@@ -984,7 +984,7 @@ Tip
</div>
</div>
<div class="callout-body-container callout-body">
<p>Check out our <a href="https://github.com/axolotl-ai-cloud/axolotl-cookbook/tree/main/grpo#training-an-r1-style-large-language-model-using-grpo">GRPO cookbook</a>.</p>
<p>Check out our <a href="https://github.com/axolotl-ai-cloud/grpo_code">GRPO cookbook</a>.</p>
</div>
</div>
<p>In the latest GRPO implementation, <code>vLLM</code> is used to significantly speedup trajectory generation during training. In this example, were using 4 GPUs - 2 for training, and 2 for vLLM:</p>