axolotl/requirements.txt at fc8766e502dee8d26c0eef835818d8a390ebf574

Files

Wing Lian 76576323df add eval benchmark callback (#441 )

* add mmlu callback

* use hf dataset for mmlu evals

* default to mmlu-zs

* make sure to define all the explicit positional args

* include metrics in callback

* another callback fix for collator max len attribute

* fix mmlu evals

* sample benchmarks, ensure we drop long samples

* fix the data file

* fix elif and add better messaging

* more fixes

* rename mmlu to bench

* more fixes

* dataset handling and aggregate across benchmark

* better handling when no subjects

* benchmark callback has its own dataloader and collator

* fixes

* updated dataset

* more fixes

* missing transformers import

* improve support for customized dataset for bench evals

* gather benchmarks from all ranks

* fix for gather across multiple gpus

2023-08-29 13:24:19 -07:00

500 B

Raw Blame History

View Raw

500 B Raw Blame History

500 B

Raw Blame History