Wing Lian
|
dd8bad06d0
|
remove strict=false from example yamls [skip ci] (#2523) [skip ci]
|
2025-04-12 07:25:11 -07:00 |
|
Wing Lian
|
9f986f5e71
|
Add Llama4 maverick examples (#2512)
|
2025-04-09 14:01:28 -04:00 |
|
Wing Lian
|
bf9efe2a09
|
[llama4] fix the mm yaml, add scout single gpu yaml (#2510)
* [llama4] fix the mm yaml, add scout single gpu yaml
* add README for llama4
* rename to specify fsdp
|
2025-04-09 02:52:45 -04:00 |
|
Wing Lian
|
0dac2ddeac
|
Llama4 linearized (#2502)
* llama4 support for linearized experts
* clean up fsdp2 sharding to prevent hang
* add yaml config
* cleanup example [skip ci]
|
2025-04-07 20:47:00 -04:00 |
|