* add finetome dataset to fixtures, check eval_loss in test * add qwen 0.5b to pytest session fixture