Skip to main content
POST
/
api
/
v1
/
public
/
datasets
/
create-synthetic
cURL
curl --request POST \
  --url http://studio.premai.io/api/v1/public/datasets/create-synthetic \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form project_id=3c90c3cc-0d44-4b50-8888-8dd25736052a \
  --form 'name=<string>' \
  --form pairs_to_generate=2 \
  --form files=@example-file
{
  "dataset_id": "3c90c3cc-0d44-4b50-8888-8dd25736052a"
}

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

multipart/form-data
project_id
string<uuid>
required
name
string
required
Minimum length: 1
pairs_to_generate
integer
required
Required range: x >= 1
files
file[]

Optional: PDF, DOCX, etc.

youtube_urls[]
string<uri>[]

YouTube URLs as array

website_urls[]
string<uri>[]

Website URLs as array

user_instructions
string

Generate Q&A pairs about...

system_prompt
string

You are a helpful assistant...

temperature
number | null

0.0-1.0, controls randomness

Required range: 0 <= x <= 1
chunk_size
integer

Text chunk size for processing

Required range: x >= 1000
pair_type
enum<string>

Type of pairs to generate

Available options:
qa,
cot,
summary
rules_and_constraints
string

Avoid...

question_answer_guidance
string

Focus on...

Response

Dataset creation started

dataset_id
string<uuid>
required