axolotl

Author	SHA1	Message	Date
Wing Lian	2bb0b78975	Attention mask and position id fixes for packing (#285 ) * fix attetion mask with packing * set position ids and use block diagonal attn mask * fix expand mask for multiple batch items, make sure we pad position_ids * don't move masks to cpu * use multi pack dataloader w random sampler * add position_ids back * more fixes for dataloader integration * est total tokens, fix field loop * more fixes, position_ids seems broken * more fixes for sample packing * use distributed sampler, avoid accelerate prepare * use accelerator prepare for dataloader * fix for position_ids w packing * Update src/axolotl/utils/dataloader.py * validation for sample packing and doc * more fixes for 4k and optimizations * optimized expand mask fn * better handling of variance in multipack dataloader length and trainer hanging when it runs out of data * fix rounding of len of batches to int * better handling so that all devices have the same dataloader len * fix step calc for packing * pass sample packing efficiency to training args * add a test for the mask expansion for sequence packing * only process eval dataset for packing if not None * don't split batches when packing * weighted CE losses * weighted CEL fixes * limit packing to sequences of max seq len * seq_len_multiple for packing * make sure the chunk size is an int * sample_packing_seq_len_multiplier config * use cumulative seq len with var len flash attn v2 w packing * properly calculate max len * fix flash-attn, xformers, packing, support chatml * fix chatml system prompt for openorca, legacy tokenizer opts * add chatml * add unit tests for cum seq lens, add ability to build cu_seq_lens from positional ids, fix prompt test * fix test and pylint checks * more packing and dataset optimizations and fixes * filter w multiple cpus * more fixes and optimizations * fixes and go back to distributed sampler since batch sampler won't work * fix counts by accounting for num devices * fix steps calculation * previous accelerate is still most performant * add numba to requirements. * use custom distributed checks * fix sampler to prevent overfit w new epochs * let's not cleanup the cached datasets * calculate cum seq lens with pos_ids instead of mask, simplify packing params, fix distributed barrier * speed optimizations and set accelerate fsdp env vars * optimize dataset concatenation? * more optimizations for dataset handling * fix import for annotation * manual pre-commit fixes * another sum optimization and bug fix for calc steps * fix packing estimations * fix formatting * pylint problems * add back flash attention branch for handling unpacked sequences seperately * Address PR feedback * add optional sample packing config params to readme	2023-08-12 15:14:56 -04:00
Aman Gupta Karmani	35c8b90306	Merge pull request #355 from tmm1/bitsandbytes-fixes bump to latest bitsandbytes release with major bug fixes	2023-08-11 15:15:38 -07:00
Aman Karmani	fce40aab23	bump to latest bitsandbytes release with major bug fixes	2023-08-09 21:47:11 +00:00
Aman Karmani	9c314101d5	use newer pynvml package	2023-08-09 21:06:28 +00:00
Aman Karmani	e303d64728	log GPU memory usage	2023-08-09 18:26:28 +00:00
Wing Lian	6c9a87c8ee	pin accelerate so it works with llama2 (#330 )	2023-07-30 22:20:06 -04:00
Wing Lian	9f69c4d8c1	latest HEAD of accelerate causes 0 loss immediately w FSDP (#321 )	2023-07-24 11:23:56 -04:00
Wing Lian	6dd2e7d671	add hf_transfer to requirements for faster hf upload	2023-07-17 14:44:48 -04:00
Teknium	273b3a3aa7	Update requirements.txt Require latest git accelerate to fix saving checkpoint issue	2023-07-16 10:24:24 -07:00
Wing Lian	1edc30c786	add support for opimum bettertransformers	2023-06-10 14:22:30 -04:00
Wing Lian	36ec6e1a0e	Add accelerate dep	2023-05-30 16:36:13 -04:00
NanoCode012	1bf1f59a41	Move black to dev requirements	2023-05-31 02:53:53 +09:00
NanoCode012	bdfe7c9201	Convert attrdict to addict	2023-05-28 23:06:10 +09:00
Wing Lian	312b8d51d6	update docker to compile latest bnb to properly support qlora	2023-05-27 12:36:53 -04:00
Wing Lian	7e81ca720b	Update requirements.txt Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>	2023-05-24 15:44:48 -04:00
Wing Lian	3b4d055edd	integrate qlora? maybe?	2023-05-24 14:32:39 -04:00
Wing Lian	fa8bd14be4	update entrypoint and force min accelerate	2023-05-18 06:25:34 -04:00
NanoCode012	fe582df7d3	Fix BNB OOM by pinning version	2023-05-09 02:10:31 +09:00
Wing Lian	990bec63e6	docker layer caching, build w axolotl from base build	2023-05-07 17:16:05 -04:00
Wing Lian	7753cdee57	cleanup empty lines, tweak env for runpod setup	2023-04-19 08:24:58 -04:00
Wing Lian	0a472e1e08	quickstart instructions for starting from runpod (#5 )	2023-04-18 19:22:25 -04:00
Wing Lian	4131183115	fix install to work with latest alpaca lora 4bit	2023-04-17 12:45:12 -04:00
Wing Lian	77fca25f1b	4bit quantized support (wip)	2023-04-17 11:37:39 -04:00
Wing Lian	937f44f021	helpful info output	2023-04-15 00:03:43 -04:00
Wing Lian	80b2ed29d8	various bugfixes	2023-04-14 21:37:07 -04:00
Wing Lian	f2a2029d0d	config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes	2023-04-14 12:18:56 -04:00
Wing Lian	ce24f5e246	WIP for axolotl trainer	2023-04-14 00:20:05 -04:00

27 Commits