Dan Saunders
c907ac173e
adding pre-commit auto-update GH action and bumping plugin versions ( #2428 )
...
* adding pre-commit auto-update GH action and bumping plugin versions
* running updated pre-commit plugins
* sorry to revert, but pylint complained
* Update .pre-commit-config.yaml
Co-authored-by: Wing Lian <wing.lian@gmail.com >
---------
Co-authored-by: Dan Saunders <dan@axolotl.ai >
Co-authored-by: Wing Lian <wing.lian@gmail.com >
2025-03-21 11:02:43 -04:00
Wing Lian
a1790f2652
replace tensorboard checks with helper function ( #2120 ) [skip ci]
...
* replace tensorboard checks with helper function
* move helper function
* use relative
2024-12-03 21:06:20 -05:00
Wing Lian
6e0fb4a6b2
add finetome dataset to fixtures, check eval_loss in test ( #2106 ) [skip ci]
...
* add finetome dataset to fixtures, check eval_loss in test
* add qwen 0.5b to pytest session fixture
2024-11-29 20:37:32 -05:00
Wing Lian
71d4030b79
gradient accumulation tests, embeddings w pad_token fix, smaller models ( #2059 )
...
* add more test cases for gradient accumulation and fix zero3
* swap out for smaller model
* fix missing return
* fix missing pad_token in config
* support concurrency for multigpu testing
* cast empty deepspeed to empty string for zero3 check
* fix temp_dir as fixture so parametrize works properly
* fix test file for multigpu evals
* don't use default
* don't use default for fsdp_state_dict_type
* don't use llama tokenizer w smollm
* also automatically cancel multigpu for concurrency
2024-11-14 12:59:00 -05:00
Sunny Liu
f62e23737b
memoize dataset length for eval sample packing ( #1974 )
...
* wip on multimodal sample packing support
* wip on multimodal packing support
* llama-1b-yml
* setup logging for test
* yml
* yml
* yml
* fix for __len__ for eval sample packing
* reverted irrelavant changes
* reformatted, reverted log message
* reverted unnecessary changes
* added e2e multigpu testing for eval sample packing
* formatting
* fixed e2e test_eval params
* fix test_eval e2e multigpu
* fix test_eval e2e multigpu
* Update tests/e2e/multigpu/test_eval.py
Co-authored-by: Wing Lian <wing.lian@gmail.com >
* Update tests/e2e/multigpu/test_eval.py
Co-authored-by: Wing Lian <wing.lian@gmail.com >
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com >
2024-10-17 15:15:29 -04:00