From 46880ee977ac545f575dedcc75d0e02069f022b4 Mon Sep 17 00:00:00 2001 From: NanoCode012 Date: Mon, 21 Jul 2025 19:23:41 +0700 Subject: [PATCH] feat: add gemma3n to the docs --- docs/multimodal.qmd | 12 ++++++++++++ 1 file changed, 12 insertions(+) diff --git a/docs/multimodal.qmd b/docs/multimodal.qmd index 2be3304d8..f6460ce5f 100644 --- a/docs/multimodal.qmd +++ b/docs/multimodal.qmd @@ -110,6 +110,18 @@ base_model: google/gemma-3-4b-it chat_template: gemma3 ``` +### Gemma-3n {#sec-gemma-3n} + +::: {.callout-note} +The model's initial loss and grad norm will be very high. We suspect this to be due to the Conv in the vision layers. +::: + +```yaml +base_model: google/gemma-3n-E2B-it + +chat_template: gemma3n +``` + ### Qwen2-VL {#sec-qwen2-vl} ```yaml