Check out the Get Started guide to learn how to
create and use datasets.
What Are Datasets?
A Dataset is a collection of datapoints that we will use to train a model how to respond to specific types of inputs. Understanding JSONL Datasets for Fine-Tuning A dataset for fine-tuning is a collection of examples in JSONL format (JSON Lines), where each line represents a single conversation example.Dataset Structure
Each line in your JSONL file contains a JSON object with a single field called βmessagesβ. This field holds an array of 3 message objects, each with:-
A
"role"field (identifying who is speaking) -
A
"content"field (containing the actual text)
-
"system": Provides context and instructions that guide the modelβs behavior -
"user": Represents what a human user would say or ask -
"assistant": Contains the ideal response you want the model to learn to generate
Next Step: Create a Dataset
Create a Dataset
Click here to learn how to create a dataset.