Hugging Face - Free Inference API Credits

AI API Free Tiers | Amount: ~$0.10/month in free inference credits (subject to change) | AI-generated | 1/5 InstantSignup and get credits instantly — no credit card, no approval active
2026-02-05
Create account to vote or Sign in Score: 0

Source: https://huggingface.co/inference-api

Description

Create account to comment on specific lines or Sign in

+ 1 Every Hugging Face user receives $0.10/month in free inference credits (subject to change) to experiment with Inference Providers — a unified API that routes requests to 200+ models across 18+ inference partners. No credit card required. Free users cannot continue past their monthly credit limit (no pay-as-you-go). The PRO plan ($9/month) bumps credits to $2/month and unlocks pay-as-you-go billing after credits run out. Hugging Face charges provider rates with zero markup.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 2  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 3

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 4  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 5 Registration (Step-by-Step)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 6  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 7 1. Go to huggingface.co/join

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 8 2. Sign up with email or sign in with Google, GitHub, or other OAuth providers

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 9 3. Confirm your email address

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 10 4. Go to Token Settings and create a fine-grained token with the "Make calls to Inference Providers" permission

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 11 5. Done — you can now use the Inference API immediately with your $0.10/month free credits

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 12  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 13 Important:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 14 • No credit card required at any stage

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 15 • No approval process — access is instant

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 16 • Free credits reset monthly; unused credits do not roll over

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 17 • When credits run out, you get a 402 Payment Required error — free users cannot continue past the limit

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 18  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 19

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 20  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 21 How Inference Providers Work

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 22  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 23 Hugging Face Inference Providers is a proxy layer that sits between your app and multiple AI providers (Groq, Together AI, SambaNova, Fireworks, Replicate, etc.). You send requests using a single Hugging Face token and the system routes them to the best available provider.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 24  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 25 Two billing modes:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 26  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 27 ModeHow It WorksCredits Apply?Pay-as-you-go?

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 28 Routed by Hugging Face (default)Request routes through HF to a providerYesOnly for PRO/Enterprise

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 29 Custom Provider KeyYou use your own provider API keyNoBilled by provider directly

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 30  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 31 The API is OpenAI-compatible — you can swap in https://router.huggingface.co/v1 as a base URL with any OpenAI SDK.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 32  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 33

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 34  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 35 Available Models & Providers

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 36  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 37 Access 200+ models across these providers:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 38  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 39 ProviderLLMsVision LLMsEmbeddingsText-to-ImageText-to-VideoSpeech-to-Text

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 40 CerebrasYes     

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 41 CohereYesYes    

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 42 Fal AI   YesYesYes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 43 FireworksYesYes    

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 44 GroqYesYes    

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 45 HF InferenceYesYesYesYes Yes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 46 HyperbolicYesYes    

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 47 NovitaYesYes  Yes 

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 48 NscaleYesYes Yes  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 49 Replicate   YesYesYes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 50 SambaNovaYes Yes   

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 51 Together AIYesYes Yes  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 52  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 53 Popular models include DeepSeek-R1, DeepSeek-V3, Llama 3/4 family, Mistral/Mixtral, Qwen 2.5/3, FLUX.1 (image gen), GPT-OSS-120B, and many more. Browse the full list at huggingface.co/inference/models.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 54  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 55

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 56  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 57 Provider Selection & Routing

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 58  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 59 You can control which provider handles your request:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 60  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 61 Automatic (default): Routes to the first available provider based on your preference order

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 62 :fastest suffix: Selects the provider with highest throughput (e.g., deepseek-ai/DeepSeek-R1:fastest)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 63 :cheapest suffix: Selects the provider with lowest price per output token

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 64 Explicit provider: Specify directly (e.g., deepseek-ai/DeepSeek-R1:sambanova)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 65  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 66

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 67  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 68 Free Credits vs. PRO Plan

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 69  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 70 FeatureFree ($0)PRO ($9/month)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 71 Monthly inference credits~$0.10 (subject to change)$2.00

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 72 Pay-as-you-go after creditsNo — hard stop at limitYes — billed at provider rates

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 73 ZeroGPU Spaces usage~300 seconds, low priority8x quota (~2,400 sec), highest priority

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 74 ZeroGPU Spaces hostingCannot hostUp to 10 Spaces (H200 GPU)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 75 Private storage100 GB1 TB (10x)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 76 Queue priorityStandardHighest

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 77  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 78

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 79  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 80 ZeroGPU Spaces (Bonus Free GPU Access)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 81  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 82 Separate from Inference Providers, Hugging Face offers ZeroGPU Spaces — public Gradio apps that dynamically allocate NVIDIA H200 GPUs on demand:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 83  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 84 Free users can use any public ZeroGPU Space with a rate limit of ~300 seconds per session (refills at 1 ZeroGPU second per 30 real seconds)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 85 PRO users get 8x the quota and highest queue priority

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 86 • Only PRO users and Enterprise orgs can host ZeroGPU Spaces; anyone can use them

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 87 • Currently only works with the Gradio SDK

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 88  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 89

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 90  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 91 Billing & Pricing Details

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 92  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 93 • Hugging Face charges zero markup on provider rates — you pay exactly what the provider charges

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 94 • Billing is per-request based on compute time x hardware cost per second

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 95 • Example: A FLUX.1-dev image generation taking 10 seconds on a GPU at $0.00012/sec = $0.0012 per image

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 96 • Track spending at huggingface.co/settings/billing

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 97 • Detailed per-model, per-provider breakdown at Inference Providers Settings

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 98  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 99

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 100  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 101 Integration (Quick Start)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 102  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 103 The API works as a drop-in OpenAI replacement:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 104  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 105

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 106 from openai import OpenAI

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 107  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 108 client = OpenAI(

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 109
base_url="https://router.huggingface.co/v1", 

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 110
api_key="hf_YOUR_TOKEN", 

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 111 )

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 112  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 113 completion = client.chat.completions.create(

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 114
model="deepseek-ai/DeepSeek-V3-0324", 

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 115
messages=[{"role": "user", "content": "Hello!"}], 

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 116 )

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 117

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 118  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 119 Also available via the native huggingface_hub Python/JS SDK, direct HTTP/cURL, and the Inference Playground for browser-based testing.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 120  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 121

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 122  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 123 Additional Tips

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 124  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 125 $0.10 goes fast — a few LLM chat completions or one image generation can exhaust your monthly free credits. Treat it as a trial, not a production budget

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 126 Limits have been tightened — multiple community reports confirm that free tier limits were reduced in late 2024/early 2025. Users who previously ran hundreds of requests now hit limits much sooner

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 127 Bring your own key — if you already have accounts with Groq (free tier), Together AI, or SambaNova, you can use their API keys through Hugging Face without consuming HF credits

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 128 Team plan ($20/user/month) and Enterprise (from $50/user/month) provide $2/seat in shared credits plus centralized billing

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 129 No SLA on free tier — there is no uptime or latency guarantee for free users

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 130 HF Inference provider (the built-in one) focuses mostly on CPU inference as of mid-2025 — for GPU-accelerated LLMs, requests are routed to external providers like Groq, SambaNova, or Together AI

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 131 OpenAI-compatible endpoint only supports chat completions; for image generation, embeddings, or speech tasks, use the native HF SDK

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 132  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 133

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 134  

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 135 Sources:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 140 Hugging Face PRO Plan

No comments on this line yet.

Create account to comment on this line. or Sign in

Comments

Create account to post a comment or Sign in

No comments yet.

Back