Check out the Get Started guide to learn how to
create and use datasets.
What Are Datasets?
A Dataset is a collection of datapoints that we will use to train a model how to respond to specific types of inputs. Understanding JSONL Datasets for Fine-Tuning A dataset for fine-tuning is a collection of examples in JSONL format (JSON Lines), where each line represents a single conversation example.Dataset Structure
Each line in your JSONL file contains a JSON object with a single field called โmessagesโ. This field holds an array of 3 message objects, each with:-
A
"role"
field (identifying who is speaking) -
A
"content"
field (containing the actual text)
-
"system"
: Provides context and instructions that guide the modelโs behavior -
"user"
: Represents what a human user would say or ask -
"assistant"
: Contains the ideal response you want the model to learn to generate
Next Step: Create a Dataset
Create a Dataset
Click here to learn how to create a dataset.