On this page

Start Here ↓

You can only evaluate a model if you have a snapshot of your dataset.

Create an New Evaluation

Browse and Choose Metrics

Check your results

Once you have created an evaluation, you can check your results by clicking on the evaluation and each datapoint to get more details. Image of checking your results

Here are Some Results to Keep in Mind:

Average Score: The average score of the evaluation for the model.
Model Name: The name of the model the evaluation was run on.
System Prompt: The system prompt used for the evaluation.
User Message: The user message from the dataset.
Original Assistant Message: The original assistant message from the dataset.
Predicted Assistant Message: The predicted assistant message from the model.
Model Score: The score of the model chosen for the evaluation.
Score Reason: The reasoning behind the score.

Other Options

Bring Your Own Evaluation

Click here to learn how to bring your own evaluation to Prem.

Fine-Tuned Model → Playground

Click here to learn how to use your fine-tuned model in the playground.

Evaluations Overview Bring Your Own Evaluation

Get started

Datasets 🗃️

Fine-Tuning 🛠️

Inference 🏃‍♂️

Agentic Evaluations 📈

Playground 🛝

Stats 📊

Resources 🧰

Cookbook 🍳

Get Started With Evaluations

Start Here ↓

Here are Some Results to Keep in Mind:

Other Options

Bring Your Own Evaluation

Fine-Tuned Model → Playground

Get started

Datasets 🗃️

Fine-Tuning 🛠️

Inference 🏃‍♂️

Agentic Evaluations 📈

Playground 🛝

Stats 📊

Resources 🧰

Cookbook 🍳

​Start Here ↓

​Here are Some Results to Keep in Mind:

​Other Options

Bring Your Own Evaluation

Fine-Tuned Model → Playground

Start Here ↓

Here are Some Results to Keep in Mind:

Other Options