axolotl/requirements-tests.txt at v0.5.0

Files

Wing Lian 98af5388ba bump flash attention 2.5.8 -> 2.6.1 (#1738 )

* bump flash attention 2.5.8 -> 2.6.1

* use triton implementation of cross entropy from flash attn

* add smoke test for flash attn cross entropy patch

* fix args to xentropy.apply

* handle tuple from triton loss fn

* ensure the patch tests run independently

* use the wrapper already built into flash attn for cross entropy

* mark pytest as forked for patches

* use pytest xdist instead of forked, since cuda doesn't like forking

* limit to 1 process and use dist loadfile for pytest

* change up pytest for fixture to reload transformers w monkeypathc

2024-07-14 19:11:31 -04:00

3 lines

20 B

Plaintext

Raw Permalink Blame History

	`pytest`
	`pytest-xdist`

3 lines 20 B Plaintext Raw Permalink Blame History

3 lines

20 B

Plaintext

Raw Permalink Blame History