axolotl

Files

Wing Lian bcc78d8fa3 bump transformers and update attention class map name (#1023 )

* bump transformers and update attention class map name

* also run the tests in docker

* add mixtral e2e smoke test

* fix base name for docker image in test

* mixtral lora doesn't seem to work, at least check qlora

* add testcase for mixtral w sample packing

* check monkeypatch for flash attn multipack

* also run the e2e tests in docker

* use all gpus to run tests in docker ci

* use privileged mode too for docker w gpus

* rename the docker e2e actions for gh ci

* set privileged mode for docker and update mixtral model self attn check

* use fp16/bf16 for mixtral w fa2

* skip e2e tests on docker w gpus for now

* tests to validate mistral and mixtral patches

* fix rel import

2024-01-03 12:11:04 -08:00

base.yml

support for cuda 12.1 (#989 )

2023-12-22 11:08:22 -05:00

main.yml

support for cuda 12.1 (#989 )

2023-12-22 11:08:22 -05:00

pypi.yml

fix the sed command to replace the version w the tag

2023-09-11 13:44:19 -04:00

tests-docker.yml

bump transformers and update attention class map name (#1023 )

2024-01-03 12:11:04 -08:00

tests.yml

support for mamba (#915 )

2023-12-09 12:10:41 -05:00