drop old patches and code that are no longer needed (#3007) [skip ci]

This commit is contained in:
Wing Lian
2025-08-06 08:02:39 -04:00
committed by GitHub
parent 784f8c0e95
commit 97e86c6d47
11 changed files with 7 additions and 1330 deletions

View File

@@ -185,7 +185,6 @@ datasets:
| `flash_attention` | `false` | Use flash attention |
| `flash_attn_cross_entropy` | `false` | Flash attention cross entropy |
| `flash_attn_rms_norm` | `false` | Flash attention RMS norm |
| `flash_attn_fuse_qkv` | `false` | Fuse QKV operations |
| `flash_attn_fuse_mlp` | `false` | Fuse MLP operations |
| `sdp_attention` | `false` | Use scaled dot product |
| `s2_attention` | `false` | Use shifted sparse attention |