Built site for gh-pages

2025-09-25 05:12:06 +00:00
parent dc4adee7b0
commit 3c0d96db45
203 changed files with 6888 additions and 4524 deletions
--- a/docs/qat.html
+++ b/docs/qat.html
@@ -75,7 +75,7 @@ pre > code.sourceCode > span > a:first-child::before { text-decoration: underlin
 <link href="../site_libs/quarto-html/quarto-syntax-highlighting-dark-b651517ce65839d647a86e2780455cfb.css" rel="stylesheet" id="quarto-text-highlighting-styles">
 <script src="../site_libs/bootstrap/bootstrap.min.js"></script>
 <link href="../site_libs/bootstrap/bootstrap-icons.css" rel="stylesheet">
-<link href="../site_libs/bootstrap/bootstrap-f9d679a32da2b248d4ca48a0e58e089e.min.css" rel="stylesheet" append-hash="true" id="quarto-bootstrap" data-mode="dark">
+<link href="../site_libs/bootstrap/bootstrap-08d9eb451d58809f35fda8b852d737d8.min.css" rel="stylesheet" append-hash="true" id="quarto-bootstrap" data-mode="dark">
 <script id="quarto-search-options" type="application/json">{
  "location": "navbar",
  "copy-button": false,
@@ -361,6 +361,12 @@ gtag('config', 'G-9KYCVJBNMQ', { 'anonymize_ip': true});
  <a href="../docs/quantize.html" class="sidebar-item-text sidebar-link">
 <span class="menu-text">Quantization with torchao</span></a>
  </div>
+</li>
+          <li class="sidebar-item">
+  <div class="sidebar-item-container"> 
+  <a href="../docs/optimizations.html" class="sidebar-item-text sidebar-link">
+ <span class="menu-text">Optimizations Guide</span></a>
+  </div>
 </li>
      </ul>
  </li>
@@ -548,12 +554,14 @@ and the QAT documentation in the <a href="https://github.com/pytorch/ao/tree/mai
 <span id="cb1-3"><a href="#cb1-3" aria-hidden="true" tabindex="-1"></a><span class="at">  </span><span class="fu">weight_dtype</span><span class="kw">:</span><span class="co"> # Optional[str] = "int8". Fake quantization layout to use for weight quantization. Valid options are "int4", "fp8", and "nvfp4".</span></span>
 <span id="cb1-4"><a href="#cb1-4" aria-hidden="true" tabindex="-1"></a><span class="at">  </span><span class="fu">group_size</span><span class="kw">:</span><span class="co"> # Optional[int] = 32. The number of elements in each group for per-group fake quantization</span></span>
 <span id="cb1-5"><a href="#cb1-5" aria-hidden="true" tabindex="-1"></a><span class="at">  </span><span class="fu">fake_quant_after_n_steps</span><span class="kw">:</span><span class="co"> # Optional[int] = None. The number of steps to apply fake quantization after</span></span></code></pre></div><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></div>
-<p>We support the following quantization schemas:
- <code>Int4WeightOnly</code> (requires the <code>fbgemm-gpu</code> extra when installing Axolotl)
- <code>Int8DynamicActivationInt4Weight</code>
- <code>Float8DynamicActivationFloat8Weight</code>
- <code>Float8DynamicActivationInt4Weight</code>
- <code>NVFP4</code></p>
+<p>We support the following quantization schemas:</p>
+<ul>
+<li><code>Int4WeightOnly</code> (requires the <code>fbgemm-gpu</code> extra when installing Axolotl)</li>
+<li><code>Int8DynamicActivationInt4Weight</code></li>
+<li><code>Float8DynamicActivationFloat8Weight</code></li>
+<li><code>Float8DynamicActivationInt4Weight</code></li>
+<li><code>NVFP4</code></li>
+</ul>
 <p>Once you have finished training, you must quantize your model by using the same quantization configuration which you used to train the model with. You can use the <a href="../docs/quantize.html"><code>quantize</code></a> command to do this.</p>