Skip to main content

2. Quickstart: Run Your First Inference

Follow these steps to run your first inference request on Suiri. This quickstart will help you make your first API call in minutes.

Before You Begin

Before starting the quickstart, make sure you’ve completed the following one-time setup steps:
  • Create a Suiri account
    Sign up at https://www.suiri.ai using your email address.
  • Add a billing method
    An active billing method is required to run inference requests, including in demo environments. This is a one-time setup. Check Step 2 below.

Step 1: Create an Account

  1. Visit www.suiri.ai and sign up using your email address
  2. You’ll receive a one-time password (OTP) via email for both first login and subsequent logins
  3. Check your spam/junk folder if you don’t see the email

Step 2: Choose a Billing Plan

In the current demo environment, billing is usage-based (pay-as-you-go) and measured by token consumption. Looking for custom models or free credits? Contact us via email at support@suiri.ai or via Slack to discuss custom arrangements.

Step 3: Create an API Key

  1. Navigate to the API Keys section in your dashboard
  2. Click Generate New Key
  3. Copy and save your API key securely. Note that it will only be displayed once
See Security & Access → Creating API Keys for detailed steps.

Step 4: Run Inference via Playground

Test your first inference using the interactive Playground:
  1. Go to the Playground tab
  2. Select a model from the dropdown
  3. Enter your prompt in the chat interface
    a. For text models: “What is the capital of France?”
  4. Click Send to see the model’s response

Step 5: Run Inference via API

Once you’ve tested in the Playground, call Suiri programmatically using your API key. Base API URL: https://pulse.suiri.ai/v1 Authentication: Include your API key in the request header: Authorization: Bearer YOUR_API_KEY

Example Request

curl --location https://pulse.suiri.ai/v1/chat/completions \
 --header "Content-Type: application/json" \
 --header "Authorization: Bearer $API_KEY" \
 --data '{
 "model": "gemma-2b-it-q4_0",
 "messages": [
 {"role": "system", "content": "You are a helpful assistant."},
 {"role": "user", "content": "What is the capital of France?"}
 ]
 }'

Example Response

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "created": 1234567890,
  "model": "gemma-2b-it-q4_0",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The capital of France is Paris."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 20,
    "completion_tokens": 8,
    "total_tokens": 28
  }
}