Files
axolotl/tests/e2e
Dan Saunders 5410195e0b Sequence parallelism quick follow-ups; remove ModelCallback (#2450)
* guard return if ring attn alrady registered

* add docs link, bits in multi-gpu docs, remove save model callback (subsumed by HF trainers)

* configurable heads_k_stride from ring-flash-attn hf adapter
2025-03-31 09:13:42 -04:00
..
2023-11-06 18:33:01 -05:00
2025-01-30 11:45:56 -05:00