Activation function Triton kernels, LoRA custom autograd functions (#2324)

* LoRA + activation fn Triton kernels: initial commit

* implementing optims

* finalizing MLP LoRA kernels and progress on QKV / W kernels

* updates

* O projection optim

* adding monkey patching logic

* doc strings, typing, pre-commit fixes

* updates

* adding lora 8b kernels example

* working on fsdp support

* tests and fixes

* small fixes, getting tests to pass, adding doc strings

* integration tests for LoRA patching

* config.qmd

* remove unneeded pytest fixture

* fix

* review comments first pass

* improving tests, attention class agnostic patching

* adding support for more archs

* wip SiLU / GELU impls

* improved testing, small updates, etc.

* slightly updating docs

* rebase

* fixing test_attention_patching_integration

* additional review comments, fixing test in CI (hopefully)

* isolating problematic patching test

* relaxing allclose threshold to reduce flakiness

* fixing accidental change

* adding model arch agnostic attention class fetching

* removing unused activations

This commit is contained in:

Dan Saunders

2025-02-17 14:23:15 -05:00

committed by

GitHub

parent 97a2fa2781

commit 3d8425fa91

22 changed files with 3102 additions and 22 deletions

									
										4

cicd/tests.py
									
												View File
												
				@@ -1,6 +1,4 @@

				"""

				 modal application to run axolotl gpu tests in Modal

				 """

				"""Modal app to run axolotl GPU tests"""

				# pylint: disable=duplicate-code

				import os

Activation function Triton kernels, LoRA custom autograd functions (#2324)

4 cicd/tests.py Unescape Escape View File

4

cicd/tests.py

View File