2. Quickstart: Run Your First Inference

Follow these steps to run your first inference request on Suiri. This quickstart will help you make your first API call in minutes.

Before You Begin

Before starting the quickstart, make sure you’ve completed the following one-time setup steps:

Create a Suiri account
Sign up at https://www.suiri.ai using your email address.
Add a billing method
An active billing method is required to run inference requests, including in demo environments. This is a one-time setup. Check Step 2 below.

Step 1: Create an Account

Visit www.suiri.ai and sign up using your email address
You’ll receive a one-time password (OTP) via email for both first login and subsequent logins
Check your spam/junk folder if you don’t see the email

Step 2: Choose a Billing Plan

In the current demo environment, billing is usage-based (pay-as-you-go) and measured by token consumption. Looking for custom models or free credits? Contact us via email at support@suiri.ai or via Slack to discuss custom arrangements.

Step 3: Create an API Key

Navigate to the API Keys section in your dashboard
Click Generate New Key
Copy and save your API key securely. Note that it will only be displayed once

See Security & Access → Creating API Keys for detailed steps.

Step 4: Run Inference via Playground

Test your first inference using the interactive Playground:

Go to the Playground tab
Select a model from the dropdown
Enter your prompt in the chat interface
a. For text models: “What is the capital of France?”
Click Send to see the model’s response

Step 5: Run Inference via API

Once you’ve tested in the Playground, call Suiri programmatically using your API key. Base API URL: https://pulse.suiri.ai/v1 Authentication: Include your API key in the request header: Authorization: Bearer YOUR_API_KEY

Example Request

curl --location https://pulse.suiri.ai/v1/chat/completions \
 --header "Content-Type: application/json" \
 --header "Authorization: Bearer $API_KEY" \
 --data '{
 "model": "gemma-2b-it-q4_0",
 "messages": [
 {"role": "system", "content": "You are a helpful assistant."},
 {"role": "user", "content": "What is the capital of France?"}
 ]
 }'

Example Response

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "created": 1234567890,
  "model": "gemma-2b-it-q4_0",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The capital of France is Paris."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 20,
    "completion_tokens": 8,
    "total_tokens": 28
  }
}

Getting started

API

Models

Security & privacy

Troubleshooting

Support

Quickstart

2. Quickstart: Run Your First Inference

Before You Begin

Step 1: Create an Account

Step 2: Choose a Billing Plan

Step 3: Create an API Key

Step 4: Run Inference via Playground

Step 5: Run Inference via API

Example Request

Example Response

Getting started

API

Models

Security & privacy

Troubleshooting

Support

​2. Quickstart: Run Your First Inference

​Before You Begin

​Step 1: Create an Account

​Step 2: Choose a Billing Plan

​Step 3: Create an API Key

​Step 4: Run Inference via Playground

​Step 5: Run Inference via API

​Example Request

​Example Response

2. Quickstart: Run Your First Inference

Before You Begin

Step 1: Create an Account

Step 2: Choose a Billing Plan

Step 3: Create an API Key

Step 4: Run Inference via Playground

Step 5: Run Inference via API

Example Request

Example Response