You can bring your own evaluation to Prem. This is useful if you have a custom evaluation that you want to use for your model.You may be coming from another platform like Modal and want to bring your own evaluation methods to Prem.
The Prem Studio server sends your evaluation endpoint three key components:
Each datapoint from your selected snapshot
The model names being tested
The corresponding responses
Your endpoint should return a numerical score and reasoning for each evaluation.The evaluation resultsโincluding scores and any additional metadataโwill be displayed in the final dashboard. This approach allows you to conduct comprehensive in-house evaluations using Premโs infrastructure and models while keeping your proprietary evaluation logic secure
In order to start evaluation by bringing your own evaluator to Prem, you need to create an evaluation endpoint on the service you are using, which Prem Studio will ping with the selected dataset you want to evaluate.You can see here for more details: