Built site for gh-pages

This commit is contained in:
Quarto GHA Workflow Runner
2024-09-05 14:12:25 +00:00
parent 253e9163db
commit 097ec6570f
5 changed files with 34 additions and 34 deletions

View File

@@ -322,7 +322,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
<li>Pass an empty <code>type:</code> in your axolotl config.</li>
<li>Columns in Dataset must be exactly <code>input_ids</code>, <code>attention_mask</code>, <code>labels</code></li>
<li>To indicate that a token should be ignored during training, set its corresponding label to <code>-100</code>.</li>
<li>Do not add BOS/EOS. Axolotl will add them for you based on the default tokenizer for the model youre using.</li>
<li>You must add BOS and EOS, and make sure that you are training on EOS by not setting its label to -100.</li>
<li>For pretraining, do not truncate/pad documents to the context window length.</li>
<li>For instruction training, documents must be truncated/padded as desired.</li>
</ul>