Use this endpoint to generate a chat completion.

Completions

Create a chat completion (OpenAI compatible).

bearerAuth

Successful chat completion response (non-streaming). For streaming, use stream=true and expect text/event-stream.

Server-Sent Events stream for chat completion chunks.

Prem

Fine-tune, Evaluate, Observe and Deploy Your Custom AI Models.

Welcome To Prem Studio

Let's get you started with Prem in minutes. In this guide, we'll show you how to use the **Chat Completions** functionality.

Quickstart Guide

We provide client libraries in a number of popular languages that make it easier to work with the PremAI API.

Client SDKs

Learn how to create and use datasets to fine-tune your models.

Datasets Overview

Learn how to create, manage, and use datasets in Prem.

Get Started With Datasets

Learn how to enrich a dataset with synthetic data.

Enrich a Dataset

Learn how to autosplit a dataset into training, and validation sets.

Autosplit a Dataset

Learn how to create a snapshot of your dataset.

Create a Snapshot 📸

Use fine-tuning to create a custom model for your use case.

Fine-Tuning Overview

Learn how to fine-tune a model with Prem.

Get Started With Fine-Tuning

Learn how to create experiments with your fine-tuned model.

Experiments

Learn how to run evaluations on your fine-tuned models.

Evaluations Overview

Get Started With Evaluations

Using The Playground

Check Your Stats

Find answers to common questions about Prem.

Frequently Asked Questions

Learn about the available models in Prem.

Available Models

Learn how to interact with Prem's API endpoints.

Getting Started With The Prem API

Use this endpoint to get a list of models available to you.

The models available will depend on your subscription tier.

Get Models

Welcome to Prem

Let's create a Generative AI application in minutes.

Set up Prem SDK

Monitoring and Tracing

Autonomous Fine-Tuning

Structured Output

Function Calling

The [chat completion](/api-reference/chat-completions/post-v1chatcompletions) endpoint provides a way to generate chat completions based on provided input messages. This endpoint supports streaming via Server-Sent Events (SSE), allowing for partial message deltas to be sent back to the client.

Streaming Chat

Prompt Templates

This example explains how to access different LLMs and Embedding Models in Llamaindex through PremAI SDK.

LlamaIndex

This example goes over how to use LangChain to interact with different chat models using langchain `ChatPremAI`

LangChain

This example explains how to use DsPY by Stanford NLP to interact with different chat models.

DSPy

Qdrant is a vector similarity search engine. It provides a production-ready service with a convenient API to store, search, and manage points - vectors with an additional payload. Qdrant is tailored to extended filtering support. It makes it useful for all sorts of neural network or semantic-based matching, faceted search, and other applications.

Qdrant

PREM Cookbook Repository

Chat with PDF

URL Content Summarizer

Generate SQL from Text

Chat with SQL Tables

Paper search and QnA on ArXiv papers

Introduction

Learn how to use and customize premsql datasets for Text-to-SQL tasks, including working with available datasets, creating your own, and extending functionalities.

Datasets

Models that generate SQL queries from user input and a specified database source.

Generators

Connects to databases and executes generated SQL queries to fetch results.

Executors

Evaluates different Text to SQL models and pipelines with standard benchmarks and metrics.

Evaluators

Helps to make error handling prompts and datasets for error free inference and fine-tuning datasets for enforcing self correction property.

Error Handling

Helps to fine-tune Open Source models using different readily available datasets or custom datasets for Text to SQL tasks.

Tuner

End-to-end workflows customizable Text to SQL Agentic workflows for querying, analysing and plotting charts in databases all in natural language.

Agents

Agent Server where you can deploy PremSQL agents in the most customized forms.

API Reference

Chat

Completions

Authorizations

Body

Response