Inference.net: $25 free credits for OSS-model inference

AI API Free Tiers | Amount: $25 in free credits (one-time, on signup) | AI-generated | 1/5 InstantSignup and get credits instantly — no credit card, no approval active

2026-05-09

▲ ▼ Create account to vote or Sign in Score: 0

Source: https://inference.net/pricing

Description

SW5mZXJlbmNlLm5ldCBoYW5kcyBldmVyeSBuZXcgYWNjb3VudCAqKiQyNSBpbiBmcmVlIGNyZWRpdHMqKiB0byB1c2UgYWdhaW5zdCBpdHMgT3BlbkFJLWNvbXBhdGlibGUgc2VydmVybGVzcyBpbmZlcmVuY2UgQVBJIGZvciBvcGVuLXNvdXJjZSBMTE1zIGFuZCB2aXNpb24tbGFuZ3VhZ2UgbW9kZWxzIChHZW1tYSAzLCBHUFQtT1NTIDEyMEIsIE5WSURJQSBOZW1vdHJvbiwgcGx1cyBJbmZlcmVuY2UubmV0J3Mgb3duIFNjaGVtYXRyb24vQ2xpcFRhZ2dlciBmYW1pbGllcykuIE1hcmtldGluZyBwcm9taXNlcyByYXRlcyB1cCB0byAqKn45MCUgbG93ZXIgdGhhbiBsZWdhY3kgcHJvdmlkZXJzKiog4oCUIGF0ICQwLjAyLyQwLjA1IHBlciAxTSB0b2tlbnMgZm9yIHRoZSBjaGVhcGVzdCBTY2hlbWF0cm9uIG1vZGVsLCB0aGF0ICQyNSBzdHJldGNoZXMgYSBsb25nIHdheSBmb3IgZXZhbHVhdGlvbiwgcHJvdG90eXBpbmcsIGJhdGNoIGpvYnMsIHN0cnVjdHVyZWQtb3V0cHV0IHBpcGVsaW5lcywgYW5kIE9TUyBhcHAgZGV2ZWxvcG1lbnQuCgotLS0KCiMjIFJlZ2lzdHJhdGlvbiAoU3RlcC1ieS1TdGVwKQoKMS4gR28gdG8gW2luZmVyZW5jZS5uZXRdKGh0dHBzOi8vaW5mZXJlbmNlLm5ldC8pIGFuZCBjbGljayAqKlNpZ24gdXAqKiAob3IganVtcCBzdHJhaWdodCB0byB0aGUgZG9jcyBhdCBbZG9jcy5pbmZlcmVuY2UubmV0XShodHRwczovL2RvY3MuaW5mZXJlbmNlLm5ldCkpLgoyLiBDcmVhdGUgYW4gYWNjb3VudCB3aXRoIGVtYWlsIG9yIGEgc3VwcG9ydGVkIFNTTyBvcHRpb24uCjMuIFRoZSAqKiQyNSBmcmVlIGNyZWRpdCoqIGlzIGF1dG8tYXBwbGllZCB0byBuZXcgYWNjb3VudHMg4oCUIHlvdSBkbyBub3QgbmVlZCB0byBlbnRlciBhIGNyZWRpdCBjYXJkIHRvIHN0YXJ0IHVzaW5nIHRoZSBQbGF5Z3JvdW5kIG9yIEFQSS4KNC4gT3BlbiB0aGUgZGFzaGJvYXJkIHNpZGViYXIgYW5kIGdvIHRvICoqQVBJIEtleXMqKi4KNS4gQ2xpY2sgKipDcmVhdGUgbmV3IGtleSoqIChvciB1c2UgdGhlIGRlZmF1bHQga2V5IHRoYXQncyBwcmUtZ2VuZXJhdGVkIGZvciB0aGUgYWNjb3VudCkuCjYuIEV4cG9ydCB0aGUga2V5IGxvY2FsbHk6CiAgIGBgYGJhc2gKICAgZXhwb3J0IElORkVSRU5DRV9BUElfS0VZPTx5b3VyLWFwaS1rZXk+CiAgIGBgYAo3LiBQb2ludCBhbnkgT3BlbkFJIFNESyBhdCBgaHR0cHM6Ly9hcGkuaW5mZXJlbmNlLm5ldC92MWAgYW5kIHlvdSdyZSBkb25lIOKAlCB0aGUgZmlyc3QgcmVxdWVzdCB3aWxsIHN0YXJ0IGRyYXdpbmcgZnJvbSB0aGUgJDI1IGJhbGFuY2UuCgoqKkltcG9ydGFudDoqKgotIE5vIGNyZWRpdCBjYXJkIHJlcXVpcmVkIHRvIGNsYWltIHRoZSAkMjUgKHZlcmlmaWVkIHZpYSBzaWdudXAgZmxvdyArIGRvY3MpLgotIENyZWRpdHMgYXJlIHVzYWdlLWJhc2VkIOKAlCB0aGV5IG9ubHkgZGVwbGV0ZSB3aGVuIHlvdSBhY3R1YWxseSBjYWxsIHRoZSBBUEk7IGlkbGUgYWNjb3VudHMgZG9uJ3QgbG9zZSB0aGVtLgotIE5vIHB1YmxpYyBleHBpcnkgb24gdGhlICQyNSBncmFudCAodHJlYXQgaXQgYXMgb25nb2luZyB1bnRpbCB1c2VkKS4KLSBHb2luZyBiZXlvbmQgJDI1IHJlcXVpcmVzIGFkZGluZyBhIHBheW1lbnQgbWV0aG9kIGFuZCBzd2l0Y2hpbmcgdG8gcGF5LWFzLXlvdS1nby4KCi0tLQoKIyMgQVBJIENvbXBhdGliaWxpdHkgKE9wZW5BSSBEcm9wLUluKQoKSW5mZXJlbmNlLm5ldCBpcyBhICoqc3RyaWN0IE9wZW5BSS1jb21wYXRpYmxlIGVuZHBvaW50KiouIE1pZ3JhdGluZyBmcm9tIE9wZW5BSSAvIEFudGhyb3BpYyAvIFRvZ2V0aGVyIC8gRGVlcEluZnJhIGlzIGEgb25lLWxpbmUgY2hhbmdlOgoKYGBgcHl0aG9uCmZyb20gb3BlbmFpIGltcG9ydCBPcGVuQUkKCmNsaWVudCA9IE9wZW5BSSgKICAgIGJhc2VfdXJsPSJodHRwczovL2FwaS5pbmZlcmVuY2UubmV0L3YxIiwKICAgIGFwaV9rZXk9b3MuZW52aXJvblsiSU5GRVJFTkNFX0FQSV9LRVkiXSwKKQoKcmVzcG9uc2UgPSBjbGllbnQuY2hhdC5jb21wbGV0aW9ucy5jcmVhdGUoCiAgICBtb2RlbD0iZ29vZ2xlL2dlbW1hLTMtMjdiLWluc3RydWN0L2JmLTE2IiwKICAgIG1lc3NhZ2VzPVt7InJvbGUiOiAidXNlciIsICJjb250ZW50IjogIkhlbGxvIn1dLAopCmBgYAoKU3VwcG9ydGVkIGZlYXR1cmVzOgotICoqQ2hhdCBjb21wbGV0aW9ucyoqIChwcmltYXJ5IGVuZHBvaW50KQotICoqU3RydWN0dXJlZCBvdXRwdXRzKiogKEpTT04gc2NoZW1hKQotICoqRnVuY3Rpb24gLyB0b29sIGNhbGxpbmcqKiB2aWEgYHRvb2xzYCBwYXJhbWV0ZXIKLSAqKlN0cmVhbWluZyoqIHJlc3BvbnNlcwoKLS0tCgojIyBBdmFpbGFibGUgQUkgTW9kZWxzCgojIyMgVGV4dC10by10ZXh0IExMTXMgKHByaWNlZCBwZXIgMU0gaW5wdXQgLyBvdXRwdXQgdG9rZW5zKQoKfCBNb2RlbCB8IENvbnRleHQgfCBJbnB1dCAvIE91dHB1dCAoJC8xTSB0b2tlbnMpIHwgTm90ZXMgfAp8LS0tfC0tLXwtLS18LS0tfAp8ICoqTlZJRElBIE5lbW90cm9uIDMgU3VwZXIqKiAoRlA4KSB8IDFNIHwgJDIuNTAgLyAkNS4wMCB8IEpTT04sIHRvb2wgY2FsbGluZyB8CnwgKipTY2hlbWF0cm9uIDNCKiogKEluZmVyZW5jZS5uZXQsIEJGMTYpIHwgMTI1SyB8ICQwLjAyIC8gJDAuMDUgfCBDaGVhcGVzdDsgSlNPTiBvdXRwdXQgfAp8ICoqU2NoZW1hdHJvbiA4QioqIChJbmZlcmVuY2UubmV0LCBCRjE2KSB8IDEyNUsgfCAkMC4wNCAvICQwLjEwIHwgSlNPTiBvdXRwdXQgfAp8ICoqU2NoZW1hdHJvbiBWMiBTbWFsbCoqIChCRjE2KSB8IDEyNUsgfCAkMC4wNSAvICQwLjI1IHwgSlNPTiBvdXRwdXQgfAp8ICoqU2NoZW1hdHJvbiBWMiBUdXJibyoqIChCRjE2KSB8IDEyNUsgfCAkMC4wMyAvICQwLjE1IHwgSlNPTiBvdXRwdXQgfAoKIyMjIEltYWdlIC8gdmlzaW9uIG1vZGVscwoKfCBNb2RlbCB8IENvbnRleHQgfCBJbnB1dCAvIE91dHB1dCAoJC8xTSB0b2tlbnMpIHwgTm90ZXMgfAp8LS0tfC0tLXwtLS18LS0tfAp8ICoqR29vZ2xlIEdlbW1hIDMqKiAoQkYxNikgfCAxMjVLIHwgJDAuMTUgLyAkMC4zMCB8IFZMTSwgbXVsdGltb2RhbCwgSlNPTiB8CnwgKipDbGlwVGFnZ2VyIDEyQioqIChHcmFzc0RhdGEsIEZQOCkgfCA4SyB8ICQwLjMwIC8gJDAuNTAgfCBWTE0gZm9yIHZpZGVvIGZyYW1lIHRhZ2dpbmcgfAoKIyMjIExhcmdlciBmcm9udGllci1jbGFzcyBPU1MgbW9kZWxzIChkZWRpY2F0ZWQgR1BVIHByaWNpbmcsICQ5Ljk4L2hyIG9uIEIyMDApCi0gKipLaW1pIEsyLjUqKiAoTW9vbnNob3QgQUkpCi0gKipNaW5pTWF4LU0yLjUqKgotICoqR0xNLTUqKiAoWi5haSkKLSAqKkdQVC1PU1MgMTIwQioqIChPcGVuQUkpCgo+IExhcmdlciBtb2RlbHMgYXJlIHByaWNlZCBwZXIgR1BVLWhvdXIgKGRlZGljYXRlZCBkZXBsb3lzKSwgbm90IHBlci10b2tlbiwgc28gdGhleSBhcmUgYmVzdCBldmFsdWF0ZWQgb24gdGhlICQyNSBiYWxhbmNlIHdpdGggc2hvcnQgdGVzdCBydW5zLgoKTGF0ZXN0IGNhdGFsb2c6IHNlZSB0aGUgb2ZmaWNpYWwgW0luZmVyZW5jZS5uZXQgbW9kZWxzIHBhZ2VdKGh0dHBzOi8vaW5mZXJlbmNlLm5ldC9tb2RlbHMpIOKAlCBwcmljaW5nIGFuZCBsaW5ldXAgY2hhbmdlIGZyZXF1ZW50bHkuCgotLS0KCiMjIFdoYXQgJDI1IEFjdHVhbGx5IEJ1eXMKClVzaW5nIHRoZSBjaGVhcGVzdCBjYXRhbG9nIG1vZGVsICgqKlNjaGVtYXRyb24gM0IqKiwgJDAuMDIgaW5wdXQgLyAkMC4wNSBvdXRwdXQgcGVyIDFNIHRva2Vucyk6Ci0gfioqMS4yNSBiaWxsaW9uIGlucHV0IHRva2VucyoqLCBvcgotIH4qKjUwMCBtaWxsaW9uIG91dHB1dCB0b2tlbnMqKiwgb3IKLSBBIHR5cGljYWwgNTAvNTAgc3BsaXQ6IGh1bmRyZWRzIG9mIG1pbGxpb25zIG9mIHRva2VucwoKVXNpbmcgKipHZW1tYSAzIHZpc2lvbioqICgkMC4xNSAvICQwLjMwIHBlciAxTSB0b2tlbnMpOgotIH4xNjYgbWlsbGlvbiBpbnB1dCB0b2tlbnMgLyB+ODMgbWlsbGlvbiBvdXRwdXQgdG9rZW5zCgpVc2luZyAqKk5lbW90cm9uIDMgU3VwZXIqKiAoJDIuNTAgLyAkNS4wMCBwZXIgMU0pOgotIH4xMCBtaWxsaW9uIGlucHV0IHRva2VucyAvIDUgbWlsbGlvbiBvdXRwdXQgdG9rZW5zIChzdGlsbCBodWdlIGZvciBldmFsdWF0aW9uKQoKVGhlICQyNSBncmFudCBpcyBnZW51aW5lbHkgdXNlZnVsIOKAlCB3ZWxsIGJleW9uZCBhIHRva2VuLXRhc3RpbmcgZGVtby4KCi0tLQoKIyMgT3Blbi1Tb3VyY2UgR3JhbnRzIFByb2dyYW0gKHNlcGFyYXRlIGZyb20gJDI1KQoKSWYgeW91IG1haW50YWluIG9yIGNvbnRyaWJ1dGUgdG8gYW4gb3Blbi1zb3VyY2UgQUkgcHJvamVjdCwgSW5mZXJlbmNlLm5ldCBydW5zIGEgKipHcmFudHMgUHJvZ3JhbSoqIG9mZmVyaW5nIGZyZWUgY29tcHV0ZSBiZXlvbmQgdGhlICQyNSBzdGFydGVyOgotIEZyZWUgY29tcHV0ZSBjcmVkaXRzIGZvciBPU1MgQUkgcHJvamVjdHMKLSBBcHBsaWNhdGlvbnMgcmV2aWV3ZWQgd2l0aGluIH4yNCBob3VycwotIFVzZWZ1bCBmb3IgT1NTIG1vZGVsIGF1dGhvcnMsIGV2YWwgZnJhbWV3b3JrcywgYWdlbnQgbGlicmFyaWVzLCBldGMuCgpBcHBseSB2aWEgdGhlIEdyYW50cyBsaW5rIG9uIFtpbmZlcmVuY2UubmV0XShodHRwczovL2luZmVyZW5jZS5uZXQvKS4KCi0tLQoKIyMgQ2F0YWx5c3QgKEZ1bGwgTExNIExpZmVjeWNsZSBQbGF0Zm9ybSkKClRoZSAkMjUgYWxzbyB1bmxvY2tzICoqQ2F0YWx5c3QqKiwgSW5mZXJlbmNlLm5ldCdzIGJyb2FkZXIgcGxhdGZvcm06Ci0gKipPYnNlcnZlKiog4oCUIGxvZyBwcm9kdWN0aW9uIExMTSB0cmFmZmljCi0gKipEYXRhc2V0cyoqIOKAlCBtYW5hZ2UgZXZhbC90cmFpbmluZyBkYXRhCi0gKipFdmFsdWF0ZSoqIOKAlCBjb21wYXJlIG1vZGVsIHF1YWxpdHkKLSAqKlRyYWluKiog4oCUIGZpbmUtdHVuZSBjdXN0b20gbW9kZWxzIGZyb20geW91ciB0cmFmZmljCi0gKipEZXBsb3kqKiDigJQgc2VydmUgZmluZS10dW5lcyBvbiBkZWRpY2F0ZWQgR1BVIGluZnJhCgpUaGlzIG1hdHRlcnMgaWYgeW91IHdhbnQgdG8gc3RhcnQgd2l0aCB0aGUgZnJlZSBjcmVkaXRzLCB0aGVuIGdyYWR1YXRlIHRvIGZpbmUtdHVuaW5nIHlvdXIgb3duIHRhc2stc3BlY2lmaWMgc21hbGwgbW9kZWwgKHRoZSBTY2hlbWF0cm9uIGZhbWlseSBpcyB0aGVpciByZWZlcmVuY2UgZXhhbXBsZSBvZiB0aGlzIHBpcGVsaW5lKS4KCi0tLQoKIyMgV2hhdCdzIHRoZSBDYXRjaD8KCi0gKipObyBjYXRjaCBvbiB0aGUgJDI1Kiog4oCUIG5vIGNhcmQgcmVxdWlyZWQsIG5vIGF1dG8tYmlsbGluZywgYmFsYW5jZSBzaW1wbHkgcnVucyBvdXQgYW5kIHlvdXIgQVBJIGNhbGxzIHN0YXJ0IHJldHVybmluZyA0MDItc3R5bGUgZXJyb3JzIHVudGlsIHlvdSB0b3AgdXAuCi0gKipGcm9udGllciBPU1MgbW9kZWxzIGFyZSBHUFUtaG91ciBwcmljZWQqKiAoJDkuOTgvaHIgb24gQjIwMCksIHNvIGEgY291cGxlIG9mIGhvdXJzIG9mIHRlc3RpbmcgS2ltaSBLMi41IC8gR0xNLTUgLyBHUFQtT1NTLTEyMEIgd2lsbCBlYXQgdGhlICQyNSBxdWlja2x5LiBVc2UgU2NoZW1hdHJvbiAvIEdlbW1hIDMgZm9yIGxvbmctcnVubmluZyB0b2tlbi1jaGVhcCB3b3JrbG9hZHMuCi0gKipObyBwdWJsaXNoZWQgcmF0ZS1saW1pdCBjZWlsaW5nKiogZm9yIGZyZWUtdGllciBhY2NvdW50cyDigJQgdHlwaWNhbCBPcGVuQUktY29tcGF0aWJsZSBsaW1pdHMgYXBwbHk7IGhpZ2gtUlBTIHdvcmtsb2FkcyBzaG91bGQgY29udGFjdCBzYWxlcy4KLSAqKkNhdGFsb2cgZXZvbHZlcyBmYXN0Kiog4oCUIG1vZGVsIGF2YWlsYWJpbGl0eSBhbmQgcHJpY2luZyBjaGFuZ2U7IGFsd2F5cyByZS1jaGVjayB0aGUgbGl2ZSBbbW9kZWxzIHBhZ2VdKGh0dHBzOi8vaW5mZXJlbmNlLm5ldC9tb2RlbHMpIGFuZCBbcHJpY2luZyBwYWdlXShodHRwczovL2luZmVyZW5jZS5uZXQvcHJpY2luZy8pIGJlZm9yZSBjb21taXR0aW5nIGNvZGUgdG8gYSBzcGVjaWZpYyBtb2RlbCBpZC4KCi0tLQoKIyMgQWRkaXRpb25hbCBUaXBzCgotICoqT3BlbkFJIFNESyBkcm9wLWluKiog4oCUIHN3YXAgYGJhc2VfdXJsYCBhbmQgYGFwaV9rZXlgIG9ubHk7IGV2ZXJ5dGhpbmcgZWxzZSAoc3RyZWFtaW5nLCB0b29sIGNhbGxpbmcsIEpTT04gbW9kZSkganVzdCB3b3Jrcy4KLSAqKlBhaXIgd2l0aCBPcGVuUm91dGVyIGZvciBmYWxsYmFjayoqIOKAlCBpZiBJbmZlcmVuY2UubmV0IHJ1bnMgb3V0IG9mIGNhcGFjaXR5IGZvciBhIHNwZWNpZmljIG1vZGVsLCBPcGVuUm91dGVyIG9mdGVuIGhvc3RzIHRoZSBzYW1lIE9TUyBtb2RlbC4KLSAqKlNjaGVtYXRyb24gZmFtaWx5KiogaXMgdW5pcXVlIHRvIEluZmVyZW5jZS5uZXQg4oCUIHB1cnBvc2UtYnVpbHQgZm9yIHN0cnVjdHVyZWQvSlNPTiBvdXRwdXQgYXQgdmVyeSBsb3cgY29zdC4gV29ydGggdGhlICQyNSBqdXN0IHRvIGJlbmNobWFyayBhZ2FpbnN0IHlvdXIgY3VycmVudCBHUFQtNG8tbWluaSBzdHJ1Y3R1cmVkLW91dHB1dCBwaXBlbGluZS4KLSAqKlByb2R1Y3Rpb24gbWlncmF0aW9uKiog4oCUIGNvbWJpbmUgdGhlICQyNSB3aXRoIHRoZSAqKk9TUyBHcmFudHMgUHJvZ3JhbSoqIGZvciBzdXN0YWluZWQgZnJlZSB1c2FnZSBpZiB5b3UncmUgc2hpcHBpbmcgYW4gb3Blbi1zb3VyY2UgYWdlbnQgLyBldmFsIGZyYW1ld29yay4KCi0tLQoKKipTb3VyY2VzOioqCi0gW0luZmVyZW5jZS5uZXQgUHJpY2luZ10oaHR0cHM6Ly9pbmZlcmVuY2UubmV0L3ByaWNpbmcvKQotIFtJbmZlcmVuY2UubmV0IEhvbWVwYWdlICYgTW9kZWxzXShodHRwczovL2luZmVyZW5jZS5uZXQvKQotIFtJbmZlcmVuY2UubmV0IEFQSSBRdWlja3N0YXJ0XShodHRwczovL2RvY3MuaW5mZXJlbmNlLm5ldC9hcGkvYXBpLXF1aWNrc3RhcnQpCi0gW0NhdGFseXN0IFBsYXRmb3JtIERvY3NdKGh0dHBzOi8vZG9jcy5pbmZlcmVuY2UubmV0KQotIFtLZXl3b3JkcyBBSTogSW50cm9kdWNpbmcgSW5mZXJlbmNlLm5ldF0oaHR0cHM6Ly93d3cua2V5d29yZHNhaS5jby9ibG9nL2ludHJvZHVjaW5nLWluZmVyZW5jZS1uZXQp

Create account to comment on specific lines or Sign in

+ 1 Inference.net hands every new account $25 in free credits to use against its OpenAI-compatible serverless inference API for open-source LLMs and vision-language models (Gemma 3, GPT-OSS 120B, NVIDIA Nemotron, plus Inference.net's own Schematron/ClipTagger families). Marketing promises rates up to ~90% lower than legacy providers — at $0.02/$0.05 per 1M tokens for the cheapest Schematron model, that $25 stretches a long way for evaluation, prototyping, batch jobs, structured-output pipelines, and OSS app development.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 2

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 3

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 4

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 5 Registration (Step-by-Step)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 6

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 7 1. Go to inference.net and click Sign up (or jump straight to the docs at docs.inference.net).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 8 2. Create an account with email or a supported SSO option.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 9 3. The $25 free credit is auto-applied to new accounts — you do not need to enter a credit card to start using the Playground or API.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 10 4. Open the dashboard sidebar and go to API Keys.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 11 5. Click Create new key (or use the default key that's pre-generated for the account).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 12 6. Export the key locally:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 13

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 14 export INFERENCE_API_KEY=

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 15

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 16 7. Point any OpenAI SDK at https://api.inference.net/v1 and you're done — the first request will start drawing from the $25 balance.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 17

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 18 Important:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 19 • No credit card required to claim the $25 (verified via signup flow + docs).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 20 • Credits are usage-based — they only deplete when you actually call the API; idle accounts don't lose them.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 21 • No public expiry on the $25 grant (treat it as ongoing until used).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 22 • Going beyond $25 requires adding a payment method and switching to pay-as-you-go.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 23

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 24

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 25

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 26 API Compatibility (OpenAI Drop-In)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 27

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 28 Inference.net is a strict OpenAI-compatible endpoint. Migrating from OpenAI / Anthropic / Together / DeepInfra is a one-line change:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 29

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 30

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 31 from openai import OpenAI

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 32

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 33 client = OpenAI(

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 34

base_url="https://api.inference.net/v1",

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 35

api_key=os.environ["INFERENCE_API_KEY"],

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 36 )

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 37

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 38 response = client.chat.completions.create(

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 39

model="google/gemma-3-27b-instruct/bf-16",

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 40

messages=[{"role": "user", "content": "Hello"}],

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 41 )

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 42

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 43

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 44 Supported features:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 45 • Chat completions (primary endpoint)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 46 • Structured outputs (JSON schema)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 47 • Function / tool calling via tools parameter

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 48 • Streaming responses

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 49

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 50

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 51

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 52 Available AI Models

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 53

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 54 Text-to-text LLMs (priced per 1M input / output tokens)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 55

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 56 ModelContextInput / Output ($/1M tokens)Notes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 57 NVIDIA Nemotron 3 Super (FP8)1M$2.50 / $5.00JSON, tool calling

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 58 Schematron 3B (Inference.net, BF16)125K$0.02 / $0.05Cheapest; JSON output

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 59 Schematron 8B (Inference.net, BF16)125K$0.04 / $0.10JSON output

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 60 Schematron V2 Small (BF16)125K$0.05 / $0.25JSON output

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 61 Schematron V2 Turbo (BF16)125K$0.03 / $0.15JSON output

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 62

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 63 Image / vision models

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 64

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 65 ModelContextInput / Output ($/1M tokens)Notes

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 66 Google Gemma 3 (BF16)125K$0.15 / $0.30VLM, multimodal, JSON

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 67 ClipTagger 12B (GrassData, FP8)8K$0.30 / $0.50VLM for video frame tagging

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 68

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 69 Larger frontier-class OSS models (dedicated GPU pricing, $9.98/hr on B200)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 70 • Kimi K2.5 (Moonshot AI)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 71 • MiniMax-M2.5

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 72 • GLM-5 (Z.ai)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 73 • GPT-OSS 120B (OpenAI)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 74

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 75

Larger models are priced per GPU-hour (dedicated deploys), not per-token, so they are best evaluated on the $25 balance with short test runs.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 76

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 77 Latest catalog: see the official Inference.net models page — pricing and lineup change frequently.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 78

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 79

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 80

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 81 What $25 Actually Buys

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 82

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 83 Using the cheapest catalog model (Schematron 3B, $0.02 input / $0.05 output per 1M tokens):

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 84 • ~1.25 billion input tokens, or

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 85 • ~500 million output tokens, or

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 86 • A typical 50/50 split: hundreds of millions of tokens

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 87

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 88 Using Gemma 3 vision ($0.15 / $0.30 per 1M tokens):

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 89 • ~166 million input tokens / ~83 million output tokens

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 90

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 91 Using Nemotron 3 Super ($2.50 / $5.00 per 1M):

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 92 • ~10 million input tokens / 5 million output tokens (still huge for evaluation)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 93

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 94 The $25 grant is genuinely useful — well beyond a token-tasting demo.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 95

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 96

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 97

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 98 Open-Source Grants Program (separate from $25)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 99

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 100 If you maintain or contribute to an open-source AI project, Inference.net runs a Grants Program offering free compute beyond the $25 starter:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 101 • Free compute credits for OSS AI projects

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 102 • Applications reviewed within ~24 hours

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 103 • Useful for OSS model authors, eval frameworks, agent libraries, etc.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 104

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 105 Apply via the Grants link on inference.net.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 106

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 107

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 108

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 109 Catalyst (Full LLM Lifecycle Platform)

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 110

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 111 The $25 also unlocks Catalyst, Inference.net's broader platform:

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 112 • Observe — log production LLM traffic

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 113 • Datasets — manage eval/training data

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 114 • Evaluate — compare model quality

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 115 • Train — fine-tune custom models from your traffic

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 116 • Deploy — serve fine-tunes on dedicated GPU infra

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 117

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 118 This matters if you want to start with the free credits, then graduate to fine-tuning your own task-specific small model (the Schematron family is their reference example of this pipeline).

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 119

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 120

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 121

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 122 What's the Catch?

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 123

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 124 • No catch on the $25 — no card required, no auto-billing, balance simply runs out and your API calls start returning 402-style errors until you top up.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 125 • Frontier OSS models are GPU-hour priced ($9.98/hr on B200), so a couple of hours of testing Kimi K2.5 / GLM-5 / GPT-OSS-120B will eat the $25 quickly. Use Schematron / Gemma 3 for long-running token-cheap workloads.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 126 • No published rate-limit ceiling for free-tier accounts — typical OpenAI-compatible limits apply; high-RPS workloads should contact sales.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 127 • Catalog evolves fast — model availability and pricing change; always re-check the live models page and pricing page before committing code to a specific model id.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 128

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 129

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 130

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 131 Additional Tips

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 132

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 133 • OpenAI SDK drop-in — swap base_url and api_key only; everything else (streaming, tool calling, JSON mode) just works.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 134 • Pair with OpenRouter for fallback — if Inference.net runs out of capacity for a specific model, OpenRouter often hosts the same OSS model.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 135 • Schematron family is unique to Inference.net — purpose-built for structured/JSON output at very low cost. Worth the $25 just to benchmark against your current GPT-4o-mini structured-output pipeline.

No comments on this line yet.

Create account to comment on this line. or Sign in

+ 136 • Production migration — combine the $25 with the OSS Grants Program for sustained free usage if you're shipping an open-source agent / eval framework.

Inference.net: $25 free credits for OSS-model inference

Description

Comments