Wing Lian
dc77b5bf42
fix arm64 builds ( #3355 )
...
* fix syntax for secrets in gha yaml
* setup env for uv too
* arm64 for base uv too
* don't build causal-conv1d or mamba for arm64 and use arm64 wheels
* fix dockerfile syntax
* fix shell syntax
2026-01-14 09:38:48 -05:00
@TT
3e0bbd33ec
feat: add ARM64/AArch64 build support to Dockerfile-base ( #3346 )
...
* Add support for capability to build arm64 image
* Fixing wrong variable TARGETPLATFORM bug
* Adding missing semicolons
* skip docker hub login if PR (no push) or no credentials
* Enabling arm64 builds for Dockerfile-base in Github actions
* TARGETARCH automatically default to platform arch under build
* Enabling arm64 builds for axolotl docker builds
* Enabling arm64 builds for axolotl-cloud docker build Github actions
---------
Co-authored-by: Wing Lian <wing@axolotl.ai >
2026-01-12 12:00:02 -05:00
Wing Lian
afe18ace35
deprecate torch 2.7.1 ( #3339 )
2026-01-01 06:52:45 -05:00
Wing Lian
0d27e14e45
Torch 2.9.1 base images ( #3268 )
...
* update torch 2.9.1 base images
* update base dockerfile image check
2025-11-20 09:04:37 -05:00
Wing Lian
0fbde69e9c
only push axolotl images, personal repo is deprecated ( #3262 )
...
* only push axolotl images, personal repo is deprecated
* cleanup
2025-11-14 07:50:03 -08:00
Wing Lian
a4b921135b
build cuda 13.0.0 base image with 2.9.0 ( #3229 )
...
* build cuda 13.0.0 base image with 2.9.0
* upgrade causal-conv1d
* 1.5.4 not in pypi yet
* pin to 1.3.0
* use github release instead of pypi
* split the logic for incompatible packages
* fix bash in dockerfile
2025-10-29 18:07:29 -04:00
Wing Lian
383f220cfd
build torch 2.9.0 base images ( #3221 )
2025-10-20 08:53:49 -04:00
Wing Lian
409cfb8a87
deprecate torch 2.6.0 support ( #3197 ) [skip ci]
2025-10-07 11:23:41 -04:00
Wing Lian
c5e5aba547
Add 2.8.0 base images and uv images ( #3034 )
2025-08-08 02:30:16 -04:00
salman
1407aac779
Skip CI for draft PRs ( #2970 )
2025-07-24 09:11:46 +01:00
Wing Lian
c6d69d5c1b
release v0.11.0 ( #2875 )
...
ci-cd / build-axolotl (<nil>, 126, 12.6.3, 3.11, 2.6.0) (push) Has been cancelled
ci-cd / build-axolotl (<nil>, 126, 12.6.3, 3.11, 2.7.1) (push) Has been cancelled
ci-cd / build-axolotl (<nil>, 128, 12.8.1, 3.11, 2.7.1) (push) Has been cancelled
ci-cd / build-axolotl (vllm, 126, 12.6.3, 3.11, 2.7.0) (push) Has been cancelled
publish pypi / Create Release (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 126, 12.6.3, 3.11, 2.7.0) (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 126, 12.6.3, 3.11, 2.7.1) (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 126, 12.6.3, true, 3.11, 2.6.0) (push) Has been cancelled
ci-cd / build-axolotl-cloud (<nil>, 128, 12.8.1, 3.11, 2.7.1) (push) Has been cancelled
ci-cd / build-axolotl-cloud-no-tmux (<nil>, 126, 12.6.3, 3.11, 2.6.0) (push) Has been cancelled
publish pypi / Upload release to PyPI (push) Has been cancelled
* release v0.11.0
* don't build vllm into release for now
* remove 2.5.1 references
* smollm3 multipack support
* fix ordering of e2e tests
2025-07-09 09:22:35 -04:00
Wing Lian
6ed501f6dc
add 2.7.0 torch images back to support vlllm ( #2885 )
2025-07-08 16:28:14 -04:00
Wing Lian
a5946ff1f0
build fa2 from source for base image with torch2.6 and cu124 ( #2867 )
2025-07-05 09:21:18 -04:00
Wing Lian
581dd324cc
build base images for torch 2.7.1 ( #2764 )
...
* build base images for torch 2.7.1
* fix: update base docker to use torch 2.7.1
* fix: update doc for main base to use 2.7.1
* make sure to install fa2 in base uv too
* use no build isolation for uv+flashattn
* install psutil also for fa2
* longer timeout for flash attn build
---------
Co-authored-by: NanoCode012 <nano@axolotl.ai >
2025-06-11 17:11:06 -04:00
Wing Lian
ecc719f5c7
add support for base image with uv ( #2691 )
2025-06-02 12:48:55 -07:00
Wing Lian
fedbcc0254
remove torch 2.4.1 CI as part of support deprecation ( #2582 )
2025-04-29 08:28:32 -04:00
Wing Lian
0d691cc2a7
add base docker image with pytorch 2.7.0 and variant for cuda 12.8 ( #2551 )
...
* add base docker image with pytorch 2.7.0 and variant for cuda 12.8
* my bash is terrible
2025-04-23 14:59:03 -04:00
Wing Lian
0bfa180f7d
torch 2.7.0 base image for testing ( #2467 )
2025-04-01 15:38:26 -04:00
Wing Lian
e0aba74dd0
Release update 20250331 ( #2460 ) [skip ci]
...
* make torch 2.6.0 the default image
* fix tests against upstream main
* fix attribute access
* use fixture dataset
* fix dataset load
* correct the fixtures + tests
* more fixtures
* add accidentally removed shakespeare fixture
* fix conversion from unittest to pytest class
* nightly main ci caches
* build 12.6.3 cuda base image
* override for fix from huggingface/transformers#37162
* address PR feedback
2025-04-01 08:47:50 -04:00
Wing Lian
aae4337f40
add 12.8.1 cuda to the base matrix ( #2426 )
...
* add 12.8.1 cuda to the base matrix
* use nightly
* bump deepspeed and set no binary
* deepspeed binary fixes hopefully
* install deepspeed by itself
* multiline fix
* make sure ninja is installed
* try with reversion of packaging/setuptools/wheel install
* use license instead of license-file
* try rolling back packaging and setuptools versions
* comment out license for validation for now
* make sure packaging version is consistent
* more parity across tests and docker images for packaging/setuptools
2025-03-21 10:17:25 -04:00
NanoCode012
fd8cb32547
chore: remove redundant py310 from tests ( #2316 )
2025-02-07 21:34:16 -05:00
Wing Lian
a971eb4ce6
Torch 2.6 support for base docker image ( #2312 )
2025-02-05 09:24:02 -05:00
salman
c071a530f7
removing 2.3.1 ( #2294 )
2025-01-28 23:23:44 -05:00
Wing Lian
a4f4a56d77
build causal_conv1d and mamba-ssm into the base image ( #2113 )
...
* build causal_conv1d and mamba-ssm into the base image
* also build base images on changes to Dockerfile-base and base workflow yaml
2024-12-02 18:27:46 -05:00
Wing Lian
ba219b51a5
fix duplicate base build ( #2061 ) [skip ci]
2024-11-14 10:31:19 -05:00
Wing Lian
f68fb71005
update actions version for node16 deprecation ( #2037 ) [skip ci]
...
* update actions version for node16 deprecation
* update pre-commit/action to use 3.0.1 for actions/cache@v4 dep
* update docker/setup-buildx-action too to v3
2024-11-11 15:09:11 -05:00
Wing Lian
9bc3ee6c75
add axolotlai docker hub org to publish list ( #2031 )
...
* add axolotlai docker hub org to publish list
* fix to use latest actions docker metadata version
* fix list in yaml for expected format for action
* missed a change
2024-11-11 09:48:19 -05:00
Wing Lian
3591bcfaf9
add torch 2.5.1 for base image ( #2010 )
2024-10-31 13:27:49 -04:00
Wing Lian
67f744dc8c
add pytorch 2.5.0 base images ( #1979 )
...
* add pytorch 2.5.0 base images
* make sure num examples for debug is zero and fix comparison
2024-10-18 03:36:51 -04:00
Wing Lian
e8d3da0081
upgrade pytorch from 2.4.0 => 2.4.1 ( #1950 )
...
* upgrade pytorch from 2.4.0 => 2.4.1
* update xformers for updated pytorch version
* handle xformers version case for torch==2.3.1
2024-10-09 11:53:56 -04:00
Wing Lian
4ca0a47cfb
add 2.4.1 to base models ( #1953 )
2024-10-09 08:43:11 -04:00
Wing Lian
c5587b45ac
use 12.4.1 instead of 12.4 [skip-ci] ( #1796 )
2024-07-30 08:50:23 -04:00
Wing Lian
d4f6a6b103
fix dockerfile and base builder ( #1795 ) [skip-ci]
2024-07-30 08:34:37 -04:00
Wing Lian
d8d1788ffc
move to supporting mostly 12.1 w 2.3.1 and add new 12.4 with 2.4.0 ( #1793 )
2024-07-30 08:06:11 -04:00
Wing Lian
137d84d1b4
add torch 2.3.1 base image ( #1745 )
2024-07-13 09:41:51 -04:00
Wing Lian
a159724e44
bump trl and accelerate for latest releases ( #1730 )
...
* bump trl and accelerate for latest releases
* ensure that the CI runs on new gh org
* drop kto_pair support since removed upstream
2024-07-10 11:15:44 -04:00
Wing Lian
3319780300
update torch 2.2.1 -> 2.2.2 ( #1622 )
2024-05-15 09:45:27 -04:00
Wing Lian
70185763f6
add torch 2.3.0 to builds ( #1593 )
2024-05-05 18:45:45 -04:00
Wing Lian
da265dd796
fix for accelerate env var for auto bf16, add new base image and expand torch_cuda_arch_list support ( #1413 )
2024-03-26 16:46:19 -04:00
NanoCode012
a359579371
deprecate: pytorch 2.0.1 image ( #1315 ) [skip ci]
...
* deprecate: pytorch 2.0.1 image
* deprecate from main image
* Update main.yml
* Update tests.yml
2024-02-22 11:39:47 +09:00
Wing Lian
aaf54dc730
run the docker image builds and push on gh action gpu runners ( #1218 )
2024-02-09 10:32:54 -05:00
Wing Lian
74c72ca5eb
drop py39 docker images, add py311, upgrade pytorch to 2.1.2 ( #1205 )
...
* drop py39 docker images, add py311, upgrade pytorch to 2.1.2
* also allow the main build to be manually triggered
* fix workflow_dispatch in yaml
2024-01-26 00:38:49 -05:00
Wing Lian
37820f6540
support for cuda 12.1 ( #989 )
2023-12-22 11:08:22 -05:00
Wing Lian
161bcb6517
Dockerfile torch fix ( #987 )
...
* add torch to requirements.txt at build time to force version to stick
* fix xformers check
* better handling of xformers based on installed torch version
* fix for ci w/o torch
2023-12-21 09:38:20 -05:00
Wing Lian
7f2618b5f4
add docker images for pytorch 2.10 ( #697 )
2023-10-07 12:23:31 -04:00
Wing Lian
2c37bf6c21
Prune cuda117 ( #327 )
...
* drop cuda117/torch 1.13.1 from support, pin flash attention to v2.0.1, rm torchvision/torchaudio install
* gptq base build not needed. add sm 9.0 support
2023-07-26 16:27:49 -04:00
Wing Lian
c5df969262
don't use the gha cache w docker
2023-07-22 08:46:21 -04:00
Wing Lian
c58034d48c
use pytorch 2.0.1
2023-07-20 00:47:13 -04:00
Wing Lian
a10da1caff
11.7.0 nvidia/cuda docker images are deprecated, move to 11.7.1
ci-cd-base / build-base (<nil>, 117, 11.7.1, 3.9, 1.13.1) (push) Has been cancelled
ci-cd-base / build-base (<nil>, 118, 11.8.0, 3.10, 2.0.0) (push) Has been cancelled
ci-cd-base / build-base (<nil>, 118, 11.8.0, 3.9, 2.0.0) (push) Has been cancelled
ci-cd-base / build-base (gptq, 118, 11.8.0, 3.9, 2.0.0) (push) Has been cancelled
pre-commit / pre-commit (push) Has been cancelled
PyTest / test (3.10) (push) Has been cancelled
PyTest / test (3.9) (push) Has been cancelled
2023-07-01 00:29:07 -04:00
Wing Lian
d35278aaf1
don't fail fast
2023-06-15 16:01:27 -04:00