Fix: add delinearization and make qlora work with fsdp2 (#2515)

* fixes for delinearization, and make qlora work with fsdp2

* Add back mistakenly removed lm_eval

* typo [skip ci]

* patch evals for torch.compile + fsdp2

* also check torch_compile w fsdp2

* lots of fixes for flex attn with llama4

* fix patch check and patch llama4 too

* attempt to make the patches stick

* use transformers 4.51.2

* update configs and README for llama4

* remove torch.compile for CI test

* cleanup any existing singletons

* set singleton cache to None instead of deleting

* use importlib reload with monkeypatch

* don't worry about transformers version, mark inputs with grads, fix regex

* make sure embeds aren't on cpu

* logging and mem improvements

* vllm version and add to docker, make sure to save processor on conversion

* fix ambiguous tensor bool check

* fix vllm to not use v1, upgrade hf transformers

* fix tests

* make flex_attn_compile_kwargs configurable, since this depends on model params

---------

Co-authored-by: Wing Lian <wing@axolotl.ai>
Co-authored-by: Salman Mohammadi <salman.mohammadi@outlook.com>

This commit is contained in:

NanoCode012

2025-04-16 13:31:39 +07:00

committed by

GitHub

parent 271b24cccc

commit 682a9cf79b

26 changed files with 629 additions and 45 deletions

									
										2

.github/workflows/main.yml
									
										vendored
									
												View File
												
				@@ -29,7 +29,7 @@ jobs:

				            cuda_version: 12.4.1

				            python_version: "3.11"

				            pytorch: 2.6.0

				            axolotl_extras:

				            axolotl_extras: vllm

				            is_latest: true

				    runs-on: axolotl-gpu-runner

				    steps:

Fix: add delinearization and make qlora work with fsdp2 (#2515)

2 .github/workflows/main.yml vendored Unescape Escape View File

2

.github/workflows/main.yml vendored

View File