Inference.net: $25 free credits for OSS-model inference

AI API Free Tiers | Amount: $25 in free credits (one-time, on signup) | AI-generated | 1/5 InstantSignup and get credits instantly — no credit card, no approval active
2026-05-09
Create account to vote or Sign in Score: 0

Source: https://inference.net/pricing

Description

Create account to comment on specific lines or Sign in

+ 1 Inference.net hands every new account $25 in free credits to use against its OpenAI-compatible serverless inference API for open-source LLMs and vision-language models (Gemma 3, GPT-OSS 120B, NVIDIA Nemotron, plus Inference.net's own Schematron/ClipTagger families). Marketing promises rates up to ~90% lower than legacy providers — at $0.02/$0.05 per 1M tokens for the cheapest Schematron model, that $25 stretches a long way for evaluation, prototyping, batch jobs, structured-output pipelines, and OSS app development.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 2  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 3

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 4  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 5 Registration (Step-by-Step)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 6  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 7 1. Go to inference.net and click Sign up (or jump straight to the docs at docs.inference.net).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 8 2. Create an account with email or a supported SSO option.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 9 3. The $25 free credit is auto-applied to new accounts — you do not need to enter a credit card to start using the Playground or API.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 10 4. Open the dashboard sidebar and go to API Keys.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 11 5. Click Create new key (or use the default key that's pre-generated for the account).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 12 6. Export the key locally:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 13

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 14 export INFERENCE_API_KEY=

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 15

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 16 7. Point any OpenAI SDK at https://api.inference.net/v1 and you're done — the first request will start drawing from the $25 balance.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 17  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 18 Important:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 19 • No credit card required to claim the $25 (verified via signup flow + docs).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 20 • Credits are usage-based — they only deplete when you actually call the API; idle accounts don't lose them.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 21 • No public expiry on the $25 grant (treat it as ongoing until used).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 22 • Going beyond $25 requires adding a payment method and switching to pay-as-you-go.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 23  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 24

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 25  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 26 API Compatibility (OpenAI Drop-In)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 27  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 28 Inference.net is a strict OpenAI-compatible endpoint. Migrating from OpenAI / Anthropic / Together / DeepInfra is a one-line change:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 29  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 30

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 31 from openai import OpenAI

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 32  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 33 client = OpenAI(

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 34
base_url="https://api.inference.net/v1", 

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 35
api_key=os.environ["INFERENCE_API_KEY"], 

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 36 )

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 37  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 38 response = client.chat.completions.create(

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 39
model="google/gemma-3-27b-instruct/bf-16", 

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 40
messages=[{"role": "user", "content": "Hello"}], 

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 41 )

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 42

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 43  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 44 Supported features:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 45 Chat completions (primary endpoint)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 46 Structured outputs (JSON schema)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 47 Function / tool calling via tools parameter

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 48 Streaming responses

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 49  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 50

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 51  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 52 Available AI Models

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 53  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 54 Text-to-text LLMs (priced per 1M input / output tokens)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 55  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 56 ModelContextInput / Output ($/1M tokens)Notes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 57 NVIDIA Nemotron 3 Super (FP8)1M$2.50 / $5.00JSON, tool calling

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 58 Schematron 3B (Inference.net, BF16)125K$0.02 / $0.05Cheapest; JSON output

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 59 Schematron 8B (Inference.net, BF16)125K$0.04 / $0.10JSON output

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 60 Schematron V2 Small (BF16)125K$0.05 / $0.25JSON output

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 61 Schematron V2 Turbo (BF16)125K$0.03 / $0.15JSON output

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 62  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 63 Image / vision models

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 64  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 65 ModelContextInput / Output ($/1M tokens)Notes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 66 Google Gemma 3 (BF16)125K$0.15 / $0.30VLM, multimodal, JSON

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 67 ClipTagger 12B (GrassData, FP8)8K$0.30 / $0.50VLM for video frame tagging

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 68  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 69 Larger frontier-class OSS models (dedicated GPU pricing, $9.98/hr on B200)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 70 Kimi K2.5 (Moonshot AI)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 71 MiniMax-M2.5

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 72 GLM-5 (Z.ai)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 73 GPT-OSS 120B (OpenAI)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 74  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 75

Larger models are priced per GPU-hour (dedicated deploys), not per-token, so they are best evaluated on the $25 balance with short test runs.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 76  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 77 Latest catalog: see the official Inference.net models page — pricing and lineup change frequently.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 78  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 79

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 80  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 81 What $25 Actually Buys

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 82  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 83 Using the cheapest catalog model (Schematron 3B, $0.02 input / $0.05 output per 1M tokens):

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 84 • ~1.25 billion input tokens, or

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 85 • ~500 million output tokens, or

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 86 • A typical 50/50 split: hundreds of millions of tokens

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 87  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 88 Using Gemma 3 vision ($0.15 / $0.30 per 1M tokens):

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 89 • ~166 million input tokens / ~83 million output tokens

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 90  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 91 Using Nemotron 3 Super ($2.50 / $5.00 per 1M):

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 92 • ~10 million input tokens / 5 million output tokens (still huge for evaluation)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 93  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 94 The $25 grant is genuinely useful — well beyond a token-tasting demo.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 95  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 96

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 97  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 98 Open-Source Grants Program (separate from $25)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 99  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 100 If you maintain or contribute to an open-source AI project, Inference.net runs a Grants Program offering free compute beyond the $25 starter:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 101 • Free compute credits for OSS AI projects

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 102 • Applications reviewed within ~24 hours

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 103 • Useful for OSS model authors, eval frameworks, agent libraries, etc.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 104  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 105 Apply via the Grants link on inference.net.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 106  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 107

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 108  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 109 Catalyst (Full LLM Lifecycle Platform)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 110  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 111 The $25 also unlocks Catalyst, Inference.net's broader platform:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 112 Observe — log production LLM traffic

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 113 Datasets — manage eval/training data

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 114 Evaluate — compare model quality

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 115 Train — fine-tune custom models from your traffic

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 116 Deploy — serve fine-tunes on dedicated GPU infra

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 117  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 118 This matters if you want to start with the free credits, then graduate to fine-tuning your own task-specific small model (the Schematron family is their reference example of this pipeline).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 119  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 120

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 121  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 122 What's the Catch?

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 123  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 124 No catch on the $25 — no card required, no auto-billing, balance simply runs out and your API calls start returning 402-style errors until you top up.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 125 Frontier OSS models are GPU-hour priced ($9.98/hr on B200), so a couple of hours of testing Kimi K2.5 / GLM-5 / GPT-OSS-120B will eat the $25 quickly. Use Schematron / Gemma 3 for long-running token-cheap workloads.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 126 No published rate-limit ceiling for free-tier accounts — typical OpenAI-compatible limits apply; high-RPS workloads should contact sales.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 127 Catalog evolves fast — model availability and pricing change; always re-check the live models page and pricing page before committing code to a specific model id.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 128  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 129

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 130  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 131 Additional Tips

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 132  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 133 OpenAI SDK drop-in — swap base_url and api_key only; everything else (streaming, tool calling, JSON mode) just works.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 134 Pair with OpenRouter for fallback — if Inference.net runs out of capacity for a specific model, OpenRouter often hosts the same OSS model.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 135 Schematron family is unique to Inference.net — purpose-built for structured/JSON output at very low cost. Worth the $25 just to benchmark against your current GPT-4o-mini structured-output pipeline.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 136 Production migration — combine the $25 with the OSS Grants Program for sustained free usage if you're shipping an open-source agent / eval framework.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 137  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 138

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 139  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 140 Sources:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 141 Inference.net Pricing

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 144 Catalyst Platform Docs

No comments on this line yet.

Create account to comment on this line. or Sign in

Comments

Create account to post a comment or Sign in

No comments yet.

Back