Wing Lian
db51a9e4cb
use pep440 instead of semver ( #2088 ) [skip ci]
2024-11-19 15:02:10 -05:00
Wing Lian
a77c8a71cf
fix brackets on docker ci builds, add option to skip e2e builds [skip e2e] ( #2080 ) [skip ci]
2024-11-19 10:29:31 -05:00
Wing Lian
5be8e13d35
make sure to add tags for versioned tag on cloud docker images ( #2060 )
2024-11-14 10:24:49 -05:00
Wing Lian
c5eb9ea2c2
fix push to main and tag semver build for docker ci ( #2054 )
2024-11-13 14:04:28 -05:00
Wing Lian
01881c3113
make sure to tag images in docker for tagged releases ( #2051 ) [skip ci]
...
* make sure to tag images in docker for tagged releases
* fix tag event
2024-11-13 13:15:49 -05:00
Wing Lian
f68fb71005
update actions version for node16 deprecation ( #2037 ) [skip ci]
...
* update actions version for node16 deprecation
* update pre-commit/action to use 3.0.1 for actions/cache@v4 dep
* update docker/setup-buildx-action too to v3
2024-11-11 15:09:11 -05:00
Wing Lian
9bc3ee6c75
add axolotlai docker hub org to publish list ( #2031 )
...
* add axolotlai docker hub org to publish list
* fix to use latest actions docker metadata version
* fix list in yaml for expected format for action
* missed a change
2024-11-11 09:48:19 -05:00
Wing Lian
3cb2d75de1
upgrade pytorch to 2.5.1 ( #2024 )
2024-11-08 10:46:24 -05:00
Wing Lian
718cfb2dd1
revert image tagged as main-latest ( #1990 )
2024-10-22 13:54:24 -04:00
Wing Lian
5c629ee444
use torch 2.4.1 images as latest now that torch 2.5.0 is out ( #1987 )
2024-10-21 19:51:06 -04:00
Wing Lian
e12a2130e9
first pass at pytorch 2.5.0 support ( #1982 )
...
* first pass at pytorch 2.5.0 support
* attempt to install causal_conv1d with mamba
* gracefully handle missing xformers
* fix import
* fix incorrect version, add 2.5.0
* increase tests timeout
2024-10-21 11:00:45 -04:00
Wing Lian
e8d3da0081
upgrade pytorch from 2.4.0 => 2.4.1 ( #1950 )
...
* upgrade pytorch from 2.4.0 => 2.4.1
* update xformers for updated pytorch version
* handle xformers version case for torch==2.3.1
2024-10-09 11:53:56 -04:00
Wing Lian
dbf8fb549e
publish axolotl images without extras in the tag name ( #1798 )
2024-07-30 13:36:19 -04:00
Wing Lian
9a63884597
update test and main/nightly builds ( #1797 )
...
* update test and main/nightly builds
* don't install mamba-ssm on 2.4.0 since it has no wheels yet
2024-07-30 12:37:40 -04:00
Wing Lian
1e57b4c562
update to pytorch 2.3.1 ( #1746 ) [skip ci]
2024-07-13 13:28:17 -04:00
Wing Lian
a159724e44
bump trl and accelerate for latest releases ( #1730 )
...
* bump trl and accelerate for latest releases
* ensure that the CI runs on new gh org
* drop kto_pair support since removed upstream
2024-07-10 11:15:44 -04:00
Wing Lian
60113437e4
cloud image w/o tmux ( #1628 )
2024-05-15 22:27:40 -04:00
Wing Lian
3319780300
update torch 2.2.1 -> 2.2.2 ( #1622 )
2024-05-15 09:45:27 -04:00
Wing Lian
70185763f6
add torch 2.3.0 to builds ( #1593 )
2024-05-05 18:45:45 -04:00
Wing Lian
8cb127abeb
configure nightly docker builds ( #1454 ) [skip ci]
...
* configure nightly docker builds
* also test update pytorch in modal ci
2024-03-29 08:25:45 -04:00
Wing Lian
5894f0e57e
make mlflow optional ( #1317 )
...
* make mlflow optional
* fix xformers
don't patch swiglu if xformers not working
fix the check for xformers swiglu
* fix install of xformers with extra index url for docker builds
* fix docker build arg quoting
2024-02-26 11:41:33 -05:00
NanoCode012
a359579371
deprecate: pytorch 2.0.1 image ( #1315 ) [skip ci]
...
* deprecate: pytorch 2.0.1 image
* deprecate from main image
* Update main.yml
* Update tests.yml
2024-02-22 11:39:47 +09:00
Wing Lian
ea00dd0852
don't use load and push together ( #1284 )
2024-02-09 14:54:31 -05:00
Wing Lian
aaf54dc730
run the docker image builds and push on gh action gpu runners ( #1218 )
2024-02-09 10:32:54 -05:00
Wing Lian
74c72ca5eb
drop py39 docker images, add py311, upgrade pytorch to 2.1.2 ( #1205 )
...
* drop py39 docker images, add py311, upgrade pytorch to 2.1.2
* also allow the main build to be manually triggered
* fix workflow_dispatch in yaml
2024-01-26 00:38:49 -05:00
Wing Lian
0f77b8d798
add commit message option to skip docker image builds in ci ( #1168 ) [skip ci]
2024-01-22 19:55:36 -05:00
Wing Lian
ece0211996
Agnostic cloud gpu docker image and Jupyter lab ( #1097 )
2024-01-15 22:37:54 -05:00
Wing Lian
37820f6540
support for cuda 12.1 ( #989 )
2023-12-22 11:08:22 -05:00
Hamel Husain
2e61dc3180
Add tests to Docker ( #993 )
2023-12-22 06:37:20 -08:00
Hamel Husain
62ba1609b6
bump actions versions
2023-12-21 08:54:08 -08:00
Wing Lian
161bcb6517
Dockerfile torch fix ( #987 )
...
* add torch to requirements.txt at build time to force version to stick
* fix xformers check
* better handling of xformers based on installed torch version
* fix for ci w/o torch
2023-12-21 09:38:20 -05:00
Wing Lian
70157ccb8f
add a latest tag for regular axolotl image, cleanup extraneous print statement ( #746 )
2023-10-19 12:28:29 -04:00
Wing Lian
2aa1f71464
fix pytorch 2.1.0 build, add multipack docs ( #722 )
2023-10-13 08:57:28 -04:00
Wing Lian
7f2618b5f4
add docker images for pytorch 2.10 ( #697 )
2023-10-07 12:23:31 -04:00
Wing Lian
9218ebecd2
e2e testing ( #574 )
2023-09-14 21:56:11 -04:00
Wing Lian
3355706e22
Add support for GPTQ using native transformers/peft ( #468 )
...
* auto gptq support
* more tweaks and add yml
* remove old gptq docker
* don't need explicit peft install for tests
* fix setup.py to use extra index url
install torch for tests
fix cuda version for autogptq index
set torch in requirements so that it installs properly
move gptq install around to work with github cicd
* gptq doesn't play well with sample packing
* address pr feedback
* remove torch install for now
* set quantization_config from model config
* Fix the implementation for getting quant config from model config
2023-09-05 12:43:22 -04:00
mhenrichsen
cf6654769a
flash attn pip install ( #426 )
...
* flash attn pip
* add packaging
* add packaging to apt get
* install flash attn in dockerfile
* remove unused whls
* add wheel
* clean up pr
fix packaging requirement for ci
upgrade pip for ci
skip build isolation for requiremnents to get flash-attn working
install flash-attn seperately
* install wheel for ci
* no flash-attn for basic cicd
* install flash-attn as pip extras
---------
Co-authored-by: Ubuntu <mgh@mgh-vm.wsyvwcia0jxedeyrchqg425tpb.ax.internal.cloudapp.net >
Co-authored-by: mhenrichsen <some_email@hey.com >
Co-authored-by: Mads Henrichsen <mads@BrbartiendeMads.lan >
Co-authored-by: Wing Lian <wing.lian@gmail.com >
2023-08-18 19:00:27 -04:00
Wing Lian
d3d6fd6ae6
just resort to tags ans use main-latest ( #424 )
2023-08-16 00:39:57 -04:00
Wing Lian
5f80b3560b
use inputs for image rather than outputs for docker metadata ( #420 )
2023-08-15 18:26:59 -04:00
Wing Lian
7af816699e
tag with latest as well for axolotl-runpod ( #418 )
...
* tag with latest as well for axolotl-runpod
* no dev branch for now
2023-08-15 15:30:41 -04:00
Wing Lian
918f1b0dfb
revert previous change and build ax images w docker on gpu ( #371 )
2023-08-12 20:23:00 -04:00
Wing Lian
c3fde36ada
attempt to run non-base docker builds on regular cpu hosts ( #369 )
2023-08-12 19:07:38 -04:00
Wing Lian
2c37bf6c21
Prune cuda117 ( #327 )
...
* drop cuda117/torch 1.13.1 from support, pin flash attention to v2.0.1, rm torchvision/torchaudio install
* gptq base build not needed. add sm 9.0 support
2023-07-26 16:27:49 -04:00
Wing Lian
ff7f18d1ed
disable gh cache for first step of docker builds too
2023-07-22 11:46:37 -04:00
Wing Lian
cf62cfd661
add runpod envs to .bashrc, fix bnb env ( #316 )
...
* hopper support for base dockerfile, add runpod envs to .bashrc
* set BNB_CUDA_VERSION env for latest bnb
* don't support hopper yet w 118
2023-07-22 10:09:38 -04:00
Wing Lian
c5df969262
don't use the gha cache w docker
2023-07-22 08:46:21 -04:00
Wing Lian
c58034d48c
use pytorch 2.0.1
2023-07-20 00:47:13 -04:00
Wing Lian
a10da1caff
11.7.0 nvidia/cuda docker images are deprecated, move to 11.7.1
ci-cd-base / build-base (<nil>, 117, 11.7.1, 3.9, 1.13.1) (push) Has been cancelled
ci-cd-base / build-base (<nil>, 118, 11.8.0, 3.10, 2.0.0) (push) Has been cancelled
ci-cd-base / build-base (<nil>, 118, 11.8.0, 3.9, 2.0.0) (push) Has been cancelled
ci-cd-base / build-base (gptq, 118, 11.8.0, 3.9, 2.0.0) (push) Has been cancelled
pre-commit / pre-commit (push) Has been cancelled
PyTest / test (3.10) (push) Has been cancelled
PyTest / test (3.9) (push) Has been cancelled
2023-07-01 00:29:07 -04:00
Wing Lian
d35278aaf1
don't fail fast
2023-06-15 16:01:27 -04:00
Wing Lian
e3d03745ba
add py310 support from base image
2023-05-31 09:07:28 -04:00