Kluster.ai Free Batch Inference Tier

AI API Free Tiers | Amount: Free tier: up to 1,000 batch requests/file, 100MB max file size. Promotional credits periodically available. | AI-generated | 1/5 InstantSignup and get credits instantly — no credit card, no approval active

2026-02-26

▲ ▼ Create account to vote or Sign in Score: 0

Source: https://www.kluster.ai/

Description

S2x1c3Rlci5haSBpcyBhbiBBSSBjbG91ZCBwbGF0Zm9ybSBzcGVjaWFsaXppbmcgaW4gKipiYXRjaCBpbmZlcmVuY2UqKiBmb3Igb3Blbi1zb3VyY2UgbW9kZWxzIGxpa2UgKipEZWVwU2Vlay1SMSoqLCAqKkxsYW1hIDQgTWF2ZXJpY2svU2NvdXQqKiwgYW5kICoqUXdlbjMqKi4gVGhlIGZyZWUgdGllciBzdXBwb3J0cyB1cCB0byAqKjEsMDAwIGJhdGNoIHJlcXVlc3RzIHBlciBmaWxlKiogd2l0aCBhIDEwME1CIG1heCBmaWxlIHNpemUuIFRoZSBwbGF0Zm9ybSBhbHNvIG9mZmVycyBhZGFwdGl2ZSByZWFsLXRpbWUgaW5mZXJlbmNlIHdpdGggc3ViLXNlY29uZCBsYXRlbmN5LiBLbHVzdGVyLmFpIHVzZXMgYW4gKipPcGVuQUktY29tcGF0aWJsZSBBUEkqKiAoYGFwaS5rbHVzdGVyLmFpL3YxYCksIHNvIHlvdSBjYW4gdXNlIHRoZSBzdGFuZGFyZCBPcGVuQUkgUHl0aG9uIFNESy4gQmF0Y2ggaW5mZXJlbmNlIHByaWNpbmcgc3RhcnRzIGFzIGxvdyBhcyAqKiQwLjEwL00gdG9rZW5zKiogZm9yIHNtYWxsZXIgbW9kZWxzIHdpdGggNzItaG91ciBjb21wbGV0aW9uIHdpbmRvd3Mg4oCUIHVwIHRvIDUwJSBjaGVhcGVyIHRoYW4gY29tcGV0aXRvcnMuCgotLS0KCiMjIFJlZ2lzdHJhdGlvbiAoU3RlcC1ieS1TdGVwKQoKMS4gR28gdG8gW2tsdXN0ZXIuYWldKGh0dHBzOi8vd3d3LmtsdXN0ZXIuYWkpIGFuZCBjbGljayAqKlNpZ24gVXAqKgoyLiBDcmVhdGUgeW91ciBhY2NvdW50CjMuIE5hdmlnYXRlIHRvIHRoZSBBUEkgc2VjdGlvbiB0byBnZW5lcmF0ZSB5b3VyIEFQSSBrZXkKNC4gU2V0IHlvdXIgYmFzZSBVUkwgdG8gYGh0dHBzOi8vYXBpLmtsdXN0ZXIuYWkvdjFgCjUuIFVzZSB0aGUgT3BlbkFJIFB5dGhvbiBTREsgd2l0aCB5b3VyIGtsdXN0ZXIuYWkgQVBJIGtleQo2LiBQcmVwYXJlIHlvdXIgYmF0Y2ggcmVxdWVzdCBhcyBhICoqSlNPTkwgZmlsZSoqIChvbmUgcmVxdWVzdCBwZXIgbGluZSkKNy4gU3VibWl0IHlvdXIgYmF0Y2ggam9iIHZpYSB0aGUgQVBJCgoqKkltcG9ydGFudDoqKgotIEZyZWUgdGllciBoYXMgYSBoYXJkIGxpbWl0IG9mIDEsMDAwIHJlcXVlc3RzIHBlciBiYXRjaCBmaWxlCi0gTWF4IGZpbGUgc2l6ZSBpcyAxMDBNQiBwZXIgYmF0Y2ggZmlsZSAoYXBwbGllcyB0byBhbGwgdGllcnMpCi0gVGhlIEFQSSBpcyBPcGVuQUktY29tcGF0aWJsZSDigJQgdXNlIGBmcm9tIG9wZW5haSBpbXBvcnQgT3BlbkFJYCBhbmQgY2hhbmdlIHRoZSBiYXNlIFVSTAotIEJhdGNoIGpvYnMgYXJlIHByb2Nlc3NlZCBhc3luY2hyb25vdXNseSDigJQgeW91IHN1Ym1pdCBhbmQgcG9sbCBmb3IgcmVzdWx0cwoKLS0tCgojIyBBdmFpbGFibGUgQUkgTW9kZWxzCgp8IE1vZGVsIHwgQ2F0ZWdvcnkgfCBOb3RlcyB8CnwtLS18LS0tfC0tLXwKfCAqKkRlZXBTZWVrLVIxKiogfCBSZWFzb25pbmcgfCA2NzFCIHBhcmFtZXRlciByZWFzb25pbmcgbW9kZWwgfAp8ICoqTGxhbWEgNCBNYXZlcmljayoqIHwgQ2hhdCB8IE1ldGEncyBsYXRlc3QgTW9FIG1vZGVsIHwKfCAqKkxsYW1hIDQgU2NvdXQqKiB8IENoYXQgfCBNZXRhJ3MgZWZmaWNpZW50IE1vRSBtb2RlbCB8CnwgKipMbGFtYSAzLjMgNzBCIEluc3RydWN0KiogfCBDaGF0IHwgTWV0YSdzIGluc3RydWN0aW9uLWZvbGxvd2luZyBtb2RlbCB8CnwgKipMbGFtYSAzLjEgNDA1QiBJbnN0cnVjdCBUdXJibyoqIHwgQ2hhdCB8IE1ldGEncyBsYXJnZXN0IExsYW1hIG1vZGVsIHwKfCAqKkxsYW1hIDMuMSA4QioqIHwgQ2hhdCB8IEZhc3QsIGxpZ2h0d2VpZ2h0IG1vZGVsIHwKfCAqKlF3ZW4zLTIzNUItQTIyQioqIHwgQ2hhdCB8IEFsaWJhYmEncyBmbGFnc2hpcCBNb0UgfAp8ICoqR2VtbWEgMyoqIHwgQ2hhdCB8IEdvb2dsZSdzIG9wZW4gbW9kZWwgfAoKLS0tCgojIyBGcmVlIFRpZXIgTGltaXRzCgp8IEZlYXR1cmUgfCBGcmVlIFRpZXIgfCBTdGFuZGFyZCBUaWVyIHwKfC0tLXwtLS18LS0tfAp8IE1heCBiYXRjaCByZXF1ZXN0cyBwZXIgZmlsZSB8IDEsMDAwIHwgVW5saW1pdGVkIHwKfCBNYXggZmlsZSBzaXplIHwgMTAwTUIgfCAxMDBNQiB8CnwgTW9kZWxzIGF2YWlsYWJsZSB8IEFsbCBzdXBwb3J0ZWQgfCBBbGwgc3VwcG9ydGVkIHwKfCBDb21wbGV0aW9uIHdpbmRvd3MgfCAyNGgsIDQ4aCwgNzJoIHwgMjRoLCA0OGgsIDcyaCB8CnwgUmVhbC10aW1lIGluZmVyZW5jZSB8IExpbWl0ZWQgfCBZZXMgfAoKLS0tCgojIyBCYXRjaCBJbmZlcmVuY2UgUHJpY2luZyAocGVyIDFNIHRva2VucykKCnwgTW9kZWwgfCAyNC1Ib3VyIHwgNDgtSG91ciB8IDcyLUhvdXIgfAp8LS0tfC0tLXwtLS18LS0tfAp8ICoqRGVlcFNlZWstUjEqKiB8ICQzLjUwIHwgJDMuMDAgfCAkMi41MCB8CnwgKipMbGFtYSA0IFNjb3V0IDE3QngxNkUqKiB8ICQwLjE1IHwgJDAuMTIgfCAkMC4xMCB8CnwgKipMbGFtYSAzLjEgNDA1QiBUdXJibyoqIHwgSGlnaGVyIHwgTWlkIHwgTG93ZXIgfAoKKkxvbmdlciBjb21wbGV0aW9uIHdpbmRvd3MgPSBsb3dlciBjb3N0LiBDaG9vc2UgNzJoIGZvciBtYXhpbXVtIHNhdmluZ3Mgb24gbm9uLXVyZ2VudCB3b3JrbG9hZHMuKgoKLS0tCgojIyBJbmZlcmVuY2UgTW9kZXMKCnwgTW9kZSB8IExhdGVuY3kgfCBCZXN0IEZvciB8CnwtLS18LS0tfC0tLXwKfCAqKlJlYWwtdGltZSoqIHwgU3ViLXNlY29uZCB8IEludGVyYWN0aXZlIGFwcHMsIGNoYXQgfAp8ICoqQXN5bmNocm9ub3VzKiogfCBNaW51dGVzIHwgRmxleGlibGUgdGltaW5nLCBtb2RlcmF0ZSB2b2x1bWUgfAp8ICoqQmF0Y2gqKiB8IDI0LTcyIGhvdXJzIHwgSGlnaC12b2x1bWUsIGJ1bGsgcHJvY2Vzc2luZyB8CgotLS0KCiMjIEFkZGl0aW9uYWwgVGlwcwoKLSAqKkJhdGNoIGluZmVyZW5jZSBpcyB0aGUga2lsbGVyIGZlYXR1cmUqKiDigJQgaWYgeW91IGhhdmUgbGFyZ2Utc2NhbGUgcHJvY2Vzc2luZyBqb2JzIChkYXRhIGxhYmVsaW5nLCBjb250ZW50IGdlbmVyYXRpb24sIGFuYWx5c2lzKSwgYmF0Y2ggbW9kZSBhdCA3MmggY29tcGxldGlvbiBjYW4gYmUgNTAlKyBjaGVhcGVyIHRoYW4gcmVhbC10aW1lCi0gKipKU09OTCBmb3JtYXQqKiDigJQgZWFjaCBsaW5lIGluIHlvdXIgYmF0Y2ggZmlsZSBpcyBhIHNlcGFyYXRlIHJlcXVlc3QgaW4gSlNPTiBmb3JtYXQuIE1ha2Ugc3VyZSB5b3VyIGZpbGUgaXMgdmFsaWQgSlNPTkwgYmVmb3JlIHN1Ym1pdHRpbmcKLSAqKlVzZSB3aXRoIEJlc3Bva2UgQ3VyYXRvcioqIOKAlCBrbHVzdGVyLmFpIGludGVncmF0ZXMgd2l0aCBCZXNwb2tlIExhYnMnIGRhdGEgY3VyYXRpb24gdG9vbCBmb3IgZWZmaWNpZW50IGxhcmdlLXNjYWxlIGluZmVyZW5jZSBwaXBlbGluZXMKLSAqKk9wZW5BSSBTREsgY29tcGF0aWJsZSoqIOKAlCBubyBjdXN0b20gU0RLIG5lZWRlZC4gSnVzdCBgcGlwIGluc3RhbGwgb3BlbmFpYCwgc2V0IHRoZSBiYXNlIFVSTCB0byBgYXBpLmtsdXN0ZXIuYWkvdjFgLCBhbmQgdXNlIHlvdXIgQVBJIGtleQotICoqRnJlZSB0aWVyIGlzIHBlci1maWxlLCBub3QgcGVyLW1vbnRoKiog4oCUIHlvdSBjYW4gc3VibWl0IG11bHRpcGxlIGJhdGNoIGZpbGVzIHdpdGggdXAgdG8gMSwwMDAgcmVxdWVzdHMgZWFjaAotICoqUHJvbW90aW9uYWwgY3JlZGl0cyoqIOKAlCBrbHVzdGVyLmFpIGhhcyBwZXJpb2RpY2FsbHkgb2ZmZXJlZCAkMTAwIGluIGZyZWUgY3JlZGl0cyAoZS5nLiwgZm9yIERlZXBTZWVrLVIxIHVzYWdlKS4gV2F0Y2ggdGhlaXIgYmxvZyBmb3IgYW5ub3VuY2VtZW50cwotICoqQWRhcHRpdmUgaW5mZXJlbmNlKiog4oCUIHRoZSBwbGF0Zm9ybSBkeW5hbWljYWxseSBhZGp1c3RzIGNvbXB1dGluZyByZXNvdXJjZXMgYmFzZWQgb24gd29ya2xvYWQsIHdoaWNoIGhlbHBzIGtlZXAgY29zdHMgbG93Ci0gKipDb21wYXJlIHdpdGggYWx0ZXJuYXRpdmVzKiog4oCUIGZvciBiYXRjaCBpbmZlcmVuY2UsIGFsc28gY29uc2lkZXIgT3BlbkFJIEJhdGNoIEFQSSAoNTAlIGRpc2NvdW50KSwgVG9nZXRoZXIuYWksIGFuZCBGaXJld29ya3MuYWkKCi0tLQoKKipTb3VyY2VzOioqCi0gW0tsdXN0ZXIuYWldKGh0dHBzOi8vd3d3LmtsdXN0ZXIuYWkpCi0gW0tsdXN0ZXIuYWkgRG9jdW1lbnRhdGlvbl0oaHR0cHM6Ly9kb2NzLmtsdXN0ZXIuYWkpCi0gW0tsdXN0ZXIuYWkgU3VwcG9ydGVkIE1vZGVsc10oaHR0cHM6Ly9kb2NzLmtsdXN0ZXIuYWkvZ2V0LXN0YXJ0ZWQvbW9kZWxzLykKLSBbS2x1c3Rlci5haSBCYXRjaCBJbmZlcmVuY2UgR3VpZGVdKGh0dHBzOi8vZG9jcy5rbHVzdGVyLmFpL2dldC1zdGFydGVkL3N0YXJ0LWJ1aWxkaW5nL2JhdGNoLykKLSBbS2x1c3Rlci5haSBBZGFwdGl2ZSBJbmZlcmVuY2UgQmxvZ10oaHR0cHM6Ly93d3cua2x1c3Rlci5haS9ibG9nL2ludHJvZHVjaW5nLWtsdXN0ZXItYWktcy1hZGFwdGl2ZS1pbmZlcmVuY2Utc2NhbGFibGUtb24tZGVtYW5kLWFjY2Vzcy10by1sbGFtYS0zLTMtMy0xLW1vZGVscykKLSBbS2x1c3Rlci5haSBvbiBBcnRpZmljaWFsIEFuYWx5c2lzXShodHRwczovL2FydGlmaWNpYWxhbmFseXNpcy5haS9wcm92aWRlcnMva2x1c3RlcmFpKQotIFtVc2luZyBrbHVzdGVyLmFpIHdpdGggQmVzcG9rZSBDdXJhdG9yXShodHRwczovL2RvY3MuYmVzcG9rZWxhYnMuYWkvYmVzcG9rZS1jdXJhdG9yL3NhdmUtdXNkdXNkdXNkLW9uLWxsbS1pbmZlcmVuY2UvdXNpbmcta2x1c3Rlci5haS1mb3ItYmF0Y2gtaW5mZXJlbmNlKQ==

Create account to comment on specific lines or Sign in

+ 1 Kluster.ai is an AI cloud platform specializing in batch inference for open-source models like DeepSeek-R1, Llama 4 Maverick/Scout, and Qwen3. The free tier supports up to 1,000 batch requests per file with a 100MB max file size. The platform also offers adaptive real-time inference with sub-second latency. Kluster.ai uses an OpenAI-compatible API (api.kluster.ai/v1), so you can use the standard OpenAI Python SDK. Batch inference pricing starts as low as $0.10/M tokens for smaller models with 72-hour completion windows — up to 50% cheaper than competitors.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 2

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 3

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 4

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 5 Registration (Step-by-Step)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 6

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 7 1. Go to kluster.ai and click Sign Up

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 8 2. Create your account

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 9 3. Navigate to the API section to generate your API key

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 10 4. Set your base URL to https://api.kluster.ai/v1

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 11 5. Use the OpenAI Python SDK with your kluster.ai API key

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 12 6. Prepare your batch request as a JSONL file (one request per line)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 13 7. Submit your batch job via the API

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 14

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 15 Important:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 16 • Free tier has a hard limit of 1,000 requests per batch file

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 17 • Max file size is 100MB per batch file (applies to all tiers)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 18 • The API is OpenAI-compatible — use from openai import OpenAI and change the base URL

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 19 • Batch jobs are processed asynchronously — you submit and poll for results

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 20

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 21

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 22

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 23 Available AI Models

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 24

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 25 ModelCategoryNotes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 26 DeepSeek-R1Reasoning671B parameter reasoning model

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 27 Llama 4 MaverickChatMeta's latest MoE model

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 28 Llama 4 ScoutChatMeta's efficient MoE model

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 29 Llama 3.3 70B InstructChatMeta's instruction-following model

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 30 Llama 3.1 405B Instruct TurboChatMeta's largest Llama model

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 31 Llama 3.1 8BChatFast, lightweight model

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 32 Qwen3-235B-A22BChatAlibaba's flagship MoE

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 33 Gemma 3ChatGoogle's open model

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 34

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 35

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 36

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 37 Free Tier Limits

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 38

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 39 FeatureFree TierStandard Tier

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 40 Max batch requests per file1,000Unlimited

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 41 Max file size100MB100MB

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 42 Models availableAll supportedAll supported

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 43 Completion windows24h, 48h, 72h24h, 48h, 72h

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 44 Real-time inferenceLimitedYes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 45

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 46

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 47

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 48 Batch Inference Pricing (per 1M tokens)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 49

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 50 Model24-Hour48-Hour72-Hour

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 51 DeepSeek-R1$3.50$3.00$2.50

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 52 Llama 4 Scout 17Bx16E$0.15$0.12$0.10

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 53 Llama 3.1 405B TurboHigherMidLower

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 54

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 55 Longer completion windows = lower cost. Choose 72h for maximum savings on non-urgent workloads.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 56

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 57

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 58

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 59 Inference Modes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 60

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 61 ModeLatencyBest For

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 62 Real-timeSub-secondInteractive apps, chat

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 63 AsynchronousMinutesFlexible timing, moderate volume

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 64 Batch24-72 hoursHigh-volume, bulk processing

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 65

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 66

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 67

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 68 Additional Tips

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 69

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 70 • Batch inference is the killer feature — if you have large-scale processing jobs (data labeling, content generation, analysis), batch mode at 72h completion can be 50%+ cheaper than real-time

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 71 • JSONL format — each line in your batch file is a separate request in JSON format. Make sure your file is valid JSONL before submitting

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 72 • Use with Bespoke Curator — kluster.ai integrates with Bespoke Labs' data curation tool for efficient large-scale inference pipelines

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 73 • OpenAI SDK compatible — no custom SDK needed. Just pip install openai, set the base URL to api.kluster.ai/v1, and use your API key

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 74 • Free tier is per-file, not per-month — you can submit multiple batch files with up to 1,000 requests each

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 75 • Promotional credits — kluster.ai has periodically offered $100 in free credits (e.g., for DeepSeek-R1 usage). Watch their blog for announcements

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 76 • Adaptive inference — the platform dynamically adjusts computing resources based on workload, which helps keep costs low

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 77 • Compare with alternatives — for batch inference, also consider OpenAI Batch API (50% discount), Together.ai, and Fireworks.ai

Kluster.ai Free Batch Inference Tier

Description

Comments