feat: add gemma3n to the docs

This commit is contained in:
NanoCode012
2025-07-21 19:23:41 +07:00
parent 152f5d7b67
commit 46880ee977

View File

@@ -110,6 +110,18 @@ base_model: google/gemma-3-4b-it
chat_template: gemma3
```
### Gemma-3n {#sec-gemma-3n}
::: {.callout-note}
The model's initial loss and grad norm will be very high. We suspect this to be due to the Conv in the vision layers.
:::
```yaml
base_model: google/gemma-3n-E2B-it
chat_template: gemma3n
```
### Qwen2-VL {#sec-qwen2-vl}
```yaml