Providers
Cerebras
Provider
Wafer-scale inference engine. Serves frontier open-weight models at 2000+ tok/sec.
4 active deals
Cloudflare Workers AI
Cloud
Edge-deployed inference from inside Cloudflare Workers. Free allowance of compute units (Neurons) every day.
1 active deals
Cohere
Provider
Enterprise-focused LLM provider with a non-commercial trial tier covering chat, embeddings, and reranking.
1 active deals
GitHub Models
Cloud
Free experimentation tier for OpenAI, Meta, Mistral, and xAI models, authenticated with a GitHub personal access token.
1 active deals
Google AI Studio
Cloud
Google's AI playground with a generous free tier for Gemini models — with a data-training catch.
1 active deals
Groq
Provider
Ultra-fast inference on custom LPU hardware. Runs open-weight models at hundreds of tokens per second.
3 active deals
Hugging Face
Aggregator
Open model hub with a routed Inference Providers API covering 200+ models. Free tier is symbolic.
1 active deals
Microsoft Azure
Cloud
Microsoft's cloud platform. Hosts Azure OpenAI Service with GPT-4o, GPT-4.1, GPT-5 and the o-series, plus Azure AI Foundry models.
1 active deals
Nous Research
Provider
Research lab behind the Hermes and DeepHermes model families. Runs Nous Portal — an OpenAI-compatible API with signup credits.
1 active deals
NVIDIA NIM
Cloud
100+ models behind one OpenAI-compatible endpoint. Free inference credits for NVIDIA Developer Program members.
1 active deals
OpenRouter
Aggregator
Aggregator that routes to 200+ models across providers through a single OpenAI-compatible API. Hosts free variants of many models.
1 active deals
SambaNova Cloud
Provider
Inference on custom RDU silicon. One of the few providers serving 405B Llama models free on signup credits.
1 active deals