Files

Wing Lian bf9efe2a09 [llama4] fix the mm yaml, add scout single gpu yaml (#2510 )

* [llama4] fix the mm yaml, add scout single gpu yaml

* add README for llama4

* rename to specify fsdp

2025-04-09 02:52:45 -04:00

README.md

2025-04-09 02:52:45 -04:00

scout-qlora-fsdp1.yaml

Llama4 linearized (#2502 )

2025-04-07 20:47:00 -04:00

scout-qlora-single-h100.yaml

2025-04-09 02:52:45 -04:00

scout-vision-qlora-fsdp.yaml

2025-04-09 02:52:45 -04:00

Llama 4 by Meta AI

Available Examples

Our Single GPU implementation for Llama 4 Scout uses only 68.5GB VRAM for post-training with 4k context length @ 546 tokens/second.