split up llama model loading so config can be loaded from base config and models can be loaded from a path