* use smaller pretrained models for ci * more steps for loss check * fix tests * more train steps * fix losses