Dan Saunders
3d8425fa91
Activation function Triton kernels, LoRA custom autograd functions (#2324)
* LoRA + activation fn Triton kernels: initial commit
* implementing optims
* finalizing MLP LoRA kernels and progress on QKV / W kernels
* updates
* O projection optim
* adding monkey patching logic
* doc strings, typing, pre-commit fixes
* updates
* adding lora 8b kernels example
* working on fsdp support
* tests and fixes
* small fixes, getting tests to pass, adding doc strings
* integration tests for LoRA patching
* config.qmd
* remove unneeded pytest fixture
* fix
* review comments first pass
* improving tests, attention class agnostic patching
* adding support for more archs
* wip SiLU / GELU impls
* improved testing, small updates, etc.
* slightly updating docs
* rebase
* fixing test_attention_patching_integration
* additional review comments, fixing test in CI (hopefully)
* isolating problematic patching test
* relaxing allclose threshold to reduce flakiness
* fixing accidental change
* adding model arch agnostic attention class fetching
* removing unused activations
2025-02-17 14:23:15 -05:00
..
2025-02-13 16:01:41 -05:00
2025-01-29 00:10:19 -05:00
2024-03-21 22:28:36 -07:00
2024-12-10 16:25:25 -05:00
2024-04-04 13:43:40 +09:00
2025-01-30 11:34:02 -05:00
2025-02-17 14:23:15 -05:00
2024-04-16 19:45:46 -04:00
2024-12-10 16:25:25 -05:00
2025-02-12 10:04:26 -05:00
2024-07-11 09:19:29 -04:00
2025-01-30 12:49:21 -05:00
2025-01-30 12:49:21 -05:00
2024-10-02 21:02:48 -04:00
2025-01-30 12:49:21 -05:00
2025-02-17 14:23:15 -05:00
2025-01-24 12:56:28 -05:00
2024-03-21 22:28:36 -07:00
2025-01-30 12:49:21 -05:00
2025-02-08 06:02:02 -05:00
2024-10-02 21:02:48 -04:00
2024-03-21 22:28:36 -07:00
2024-03-21 22:28:36 -07:00
2025-01-29 00:10:19 -05:00
2025-01-29 00:08:33 -05:00
2025-02-13 16:01:41 -05:00
2024-07-19 00:47:07 -04:00
2024-11-20 14:07:54 -05:00