axolotl

Author	SHA1	Message	Date
Wing Lian	010d0e7ff3	retry flaky test_packing_stream_dataset test that timesout on read (#2052 ) [skip ci]	2024-11-13 13:16:16 -05:00
Wing Lian	98af5388ba	bump flash attention 2.5.8 -> 2.6.1 (#1738 ) * bump flash attention 2.5.8 -> 2.6.1 * use triton implementation of cross entropy from flash attn * add smoke test for flash attn cross entropy patch * fix args to xentropy.apply * handle tuple from triton loss fn * ensure the patch tests run independently * use the wrapper already built into flash attn for cross entropy * mark pytest as forked for patches * use pytest xdist instead of forked, since cuda doesn't like forking * limit to 1 process and use dist loadfile for pytest * change up pytest for fixture to reload transformers w monkeypathc	2024-07-14 19:11:31 -04:00
Wing Lian	a653392287	use requirements file for tests	2023-05-27 12:17:46 -04:00
Wing Lian	403af0b1d7	fix path and streamline pip installs	2023-05-27 11:58:37 -04:00
Wing Lian	d199d6c261	automated testing in github actions	2023-05-27 11:51:01 -04:00