fix: add dequant bf16 repo (#3507) [skip ci]
This commit is contained in:
@@ -6,9 +6,6 @@ Thanks to the team at MistralAI for giving us early access to prepare for this r
|
|||||||
|
|
||||||
## Getting started
|
## Getting started
|
||||||
|
|
||||||
Note: Training this model requires weights in BF16 which we will link to later.
|
|
||||||
Users interested in training can convert / descale the existing FP8 weights.
|
|
||||||
|
|
||||||
1. Install Axolotl following the [installation guide](https://docs.axolotl.ai/docs/installation.html).
|
1. Install Axolotl following the [installation guide](https://docs.axolotl.ai/docs/installation.html).
|
||||||
|
|
||||||
2. Install [Cut Cross Entropy](https://docs.axolotl.ai/docs/custom_integrations.html#cut-cross-entropy) to reduce training VRAM usage
|
2. Install [Cut Cross Entropy](https://docs.axolotl.ai/docs/custom_integrations.html#cut-cross-entropy) to reduce training VRAM usage
|
||||||
|
|||||||
@@ -1,4 +1,4 @@
|
|||||||
base_model: mistralai/Mistral-Small-4-119B-2603
|
base_model: axolotl-ai-co/Mistral-Small-4-119B-2603-BF16
|
||||||
|
|
||||||
plugins:
|
plugins:
|
||||||
- axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin
|
- axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin
|
||||||
|
|||||||
@@ -1,4 +1,4 @@
|
|||||||
base_model: mistralai/Mistral-Small-4-119B-2603
|
base_model: axolotl-ai-co/Mistral-Small-4-119B-2603-BF16
|
||||||
processor_type: AutoProcessor
|
processor_type: AutoProcessor
|
||||||
|
|
||||||
plugins:
|
plugins:
|
||||||
|
|||||||
@@ -1,4 +1,4 @@
|
|||||||
base_model: mistralai/Mistral-Small-4-119B-2603
|
base_model: axolotl-ai-co/Mistral-Small-4-119B-2603-BF16
|
||||||
|
|
||||||
plugins:
|
plugins:
|
||||||
- axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin
|
- axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin
|
||||||
|
|||||||
@@ -1,4 +1,4 @@
|
|||||||
base_model: mistralai/Mistral-Small-4-119B-2603
|
base_model: axolotl-ai-co/Mistral-Small-4-119B-2603-BF16
|
||||||
processor_type: AutoProcessor
|
processor_type: AutoProcessor
|
||||||
|
|
||||||
plugins:
|
plugins:
|
||||||
|
|||||||
Reference in New Issue
Block a user