tocmo0nlord/axolotl

Files

History

Wing Lian 3036ca349f add README for llama4

2025-04-09 02:15:09 -04:00

..

README.md

add README for llama4

2025-04-09 02:15:09 -04:00

scout-qlora-fsdp1.yaml

Llama4 linearized (#2502 )

2025-04-07 20:47:00 -04:00

scout-qlora-single-h100.yaml

[llama4] fix the mm yaml, add scout single gpu yaml

2025-04-09 01:52:31 -04:00

scout-vision-qlora.yaml

add README for llama4

2025-04-09 02:15:09 -04:00

README.md

Llama 4 by Meta AI

Available Examples

Llama 4 Scout 17Bx16Experts (109B)

Our Single GPU implementation for Llama 4 Scout uses only 68.5GB VRAM for post-training with 4k context length @ 546 tokens/second.