axolotl

Author	SHA1	Message	Date
Wing Lian	6d4bbb877f	deprecate py 3.9 support, set min pytorch version (#1343 ) [skip ci]	2024-02-28 12:58:05 -05:00
Wing Lian	5894f0e57e	make mlflow optional (#1317 ) * make mlflow optional * fix xformers don't patch swiglu if xformers not working fix the check for xformers swiglu * fix install of xformers with extra index url for docker builds * fix docker build arg quoting	2024-02-26 11:41:33 -05:00
Wing Lian	d113331e9a	add a helpful motd for cloud image (#1235 ) [skip ci]	2024-01-31 10:26:02 -05:00
Wing Lian	8da1633124	Revert "run PR e2e docker CI tests in Modal" (#1220 ) [skip ci]	2024-01-26 16:50:44 -05:00
Wing Lian	36d053f6f0	run PR e2e docker CI tests in Modal (#1217 ) [skip ci] * wip modal for ci * handle falcon layernorms better * update * rebuild the template each time with the pseudo-ARGS * fix ref * update tests to use modal * cleanup ci script * make sure to install jinja2 also * kickoff the gh action on gh hosted runners and specify num gpus	2024-01-26 16:13:27 -05:00
Wing Lian	8a49309489	upgrade deepspeed to 0.13.1 for mixtral fixes (#1189 ) [skip ci] * upgrade deepspeed to 0.13.1 for mixtral fixes * move deepspeed-kernels install to setup.py	2024-01-24 14:26:40 -05:00
Wing Lian	eaaeefce55	jupyter lab fixes (#1139 ) [skip ci] * add a basic notebook for lab users in the root * update notebook and fix cors for jupyter * cell is code * fix eval batch size check * remove intro notebook	2024-01-22 18:42:40 -05:00
Wing Lian	729740df81	Dockerfile cloud ports (#1148 ) * explicitly expose ports 8888 and 22 * support for SSH_KEY from latitude	2024-01-18 22:04:25 -05:00
Wing Lian	ece0211996	Agnostic cloud gpu docker image and Jupyter lab (#1097 )	2024-01-15 22:37:54 -05:00
Wing Lian	23495a80af	misc fixes from #943 (#1086 ) [skip ci]	2024-01-10 22:31:36 -05:00
NanoCode012	d69ba2b0b7	fix: warn user to install mamba_ssm package (#1019 )	2024-01-10 02:50:56 -05:00
Wing Lian	788649fe95	attempt to also run e2e tests that needs gpus (#1070 ) * attempt to also run e2e tests that needs gpus * fix stray quote * checkout specific github ref * dockerfile for tests with proper checkout ensure wandb is dissabled for docker pytests clear wandb env after testing clear wandb env after testing make sure to provide a default val for pop tryin skipping wandb validation tests explicitly disable wandb in the e2e tests explicitly report_to None to see if that fixes the docker e2e tests split gpu from non-gpu unit tests skip bf16 check in test for now build docker w/o cache since it uses branch name ref revert some changes now that caching is fixed skip bf16 check if on gpu w support * pytest skip for auto-gptq requirements * skip mamba tests for now, split multipack and non packed lora llama tests * split tests that use monkeypatches * fix relative import for prev commit * move other tests using monkeypatches to the correct run	2024-01-09 21:23:23 -05:00
Hamel Husain	2e61dc3180	Add tests to Docker (#993 )	2023-12-22 06:37:20 -08:00
Wing Lian	161bcb6517	Dockerfile torch fix (#987 ) * add torch to requirements.txt at build time to force version to stick * fix xformers check * better handling of xformers based on installed torch version * fix for ci w/o torch	2023-12-21 09:38:20 -05:00
Wing Lian	85de004dd4	fix for build for nccl in dockerfile (#970 )	2023-12-16 19:12:01 -05:00
Wing Lian	80ec7af358	update to latest nccl in docker image (#965 )	2023-12-16 18:31:25 -05:00
Wing Lian	68b227a7d8	Mixtral multipack (#928 ) * mixtral multipack * use mixtral model * sample yml * calculate cu_seqlens properly * use updated flash ettention setting * attn var checks * force use of flash attention 2 for packing * lint * disable future fix for now * update support table	2023-12-09 21:26:30 -05:00
Wing Lian	f544ab2bed	don't compile deepspeed or bitsandbytes from source (#837 )	2023-11-08 19:49:55 -05:00
Fabian Preiß	8056ecd30e	add deepspeed-kernels dependency for deepspeed>=0.12.0 (#827 )	2023-11-05 07:52:56 -05:00
Wing Lian	2aa1f71464	fix pytorch 2.1.0 build, add multipack docs (#722 )	2023-10-13 08:57:28 -04:00
Wing Lian	aca0398315	apex not needed as amp is part of pytorch (#696 )	2023-10-07 12:20:45 -04:00
Wing Lian	de87ea68f6	fix multiline for docker (#694 )	2023-10-06 22:38:15 -04:00
NanoCode012	133e676bcc	Feat: Set WORKDIR to /workspace/axolotl (#679 )	2023-10-06 04:09:14 +09:00
Maxime	923eb91304	tweak: improve base builder for smaller layers (#500 )	2023-09-22 16:17:50 -04:00
Wing Lian	e85d2eb06b	let MAX_JOBS use the default since we're not resource constrained on our self-hosted runners (#427 )	2023-09-21 20:36:30 -04:00
Wing Lian	b53e77775b	update dockerfile to not build evoformer since it fails the build (#607 )	2023-09-19 16:28:29 -04:00
Wing Lian	34c0a86a11	update readme to point to direct link to runpod template, cleanup install instrucitons (#532 ) * update readme to point to direct link to runpod template, cleanup install instrucitons * default install flash-attn and auto-gptq now too * update readme w flash-attn extra * fix version in setup	2023-09-08 11:58:54 -04:00
Wing Lian	3355706e22	Add support for GPTQ using native transformers/peft (#468 ) * auto gptq support * more tweaks and add yml * remove old gptq docker * don't need explicit peft install for tests * fix setup.py to use extra index url install torch for tests fix cuda version for autogptq index set torch in requirements so that it installs properly move gptq install around to work with github cicd * gptq doesn't play well with sample packing * address pr feedback * remove torch install for now * set quantization_config from model config * Fix the implementation for getting quant config from model config	2023-09-05 12:43:22 -04:00
Aman Gupta Karmani	e356b297cb	remove --force-reinstall from Dockerfile to ensure correct pytorch version (#492 )	2023-08-29 06:17:51 -07:00
mhenrichsen	cf6654769a	flash attn pip install (#426 ) * flash attn pip * add packaging * add packaging to apt get * install flash attn in dockerfile * remove unused whls * add wheel * clean up pr fix packaging requirement for ci upgrade pip for ci skip build isolation for requiremnents to get flash-attn working install flash-attn seperately * install wheel for ci * no flash-attn for basic cicd * install flash-attn as pip extras --------- Co-authored-by: Ubuntu <mgh@mgh-vm.wsyvwcia0jxedeyrchqg425tpb.ax.internal.cloudapp.net> Co-authored-by: mhenrichsen <some_email@hey.com> Co-authored-by: Mads Henrichsen <mads@BrbartiendeMads.lan> Co-authored-by: Wing Lian <wing.lian@gmail.com>	2023-08-18 19:00:27 -04:00
Wing Lian	ffac902c1b	bump flash-attn to 2.0.4 for the base docker image (#382 )	2023-08-13 17:55:04 -04:00
Wing Lian	db2a3586f3	add peft install back since it doesn't get installed by setup.py (#331 )	2023-07-31 16:31:53 -04:00
Wing Lian	6c9a87c8ee	pin accelerate so it works with llama2 (#330 )	2023-07-30 22:20:06 -04:00
Wing Lian	2c37bf6c21	Prune cuda117 (#327 ) * drop cuda117/torch 1.13.1 from support, pin flash attention to v2.0.1, rm torchvision/torchaudio install * gptq base build not needed. add sm 9.0 support	2023-07-26 16:27:49 -04:00
Wing Lian	cf62cfd661	add runpod envs to .bashrc, fix bnb env (#316 ) * hopper support for base dockerfile, add runpod envs to .bashrc * set BNB_CUDA_VERSION env for latest bnb * don't support hopper yet w 118	2023-07-22 10:09:38 -04:00
Wing Lian	cdf85fdbd5	pin flash attention 2 to the fix for backwards pass	2023-07-21 08:18:53 -04:00
Wing Lian	9b790d359b	flash attention 2	2023-07-21 08:17:46 -04:00
Wing Lian	b06d3e3645	explicitly pin flash attention 1 to v1.0.9	2023-07-20 01:02:08 -04:00
Wing Lian	d75adb9835	misc fixes	2023-07-17 03:00:27 -04:00
Wing Lian	f162f3c7cc	set transformers cache env var in docker image	2023-07-16 23:03:54 -04:00
Wing Lian	eca3531329	git fetch fix for docker	2023-07-16 22:25:05 -04:00
Wing Lian	71456955f5	pin pydantic so deepspeed isn't broken	2023-07-02 22:26:51 -04:00
Wing Lian	530809fd74	update pip install command for apex	2023-06-28 22:36:28 -04:00
Wing Lian	5cd2126439	shallow clone	2023-06-02 14:54:28 -04:00
Wing Lian	12620f3089	clone in docker	2023-06-02 14:52:50 -04:00
Wing Lian	c43c5c84ff	py310, fix cuda arg in deepspeed	2023-05-30 18:02:34 -04:00
Wing Lian	bbc5bc5791	Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq default to qlora support, make gptq specific image	2023-05-30 15:07:04 -04:00
NanoCode012	392dfd9b07	Lint and format	2023-05-31 02:53:22 +09:00
Wing Lian	48612f8376	cleanup from pr feedback	2023-05-30 09:56:30 -04:00
Wing Lian	6ef96f569b	default to qlora support, make gptq specific image	2023-05-29 20:34:41 -04:00

1 2

81 Commits