Llama 3.1 70B
free_tier
1,000 inference credits on signup, 40 RPM
status
CREDITS
rate_limit
40 RPM, one-time 1K credit pool
restrictions
Requires free NVIDIA Developer Program membership (separate signup form). Credits are one-time — no recurring free tier.
[API]
Our take
NVIDIA hands out 1,000 free inference credits to Developer Program members, 40 RPM, OpenAI-compatible, 100+ models behind one endpoint — Llama 3.1 70B, Nemotron, Kimi K2.5, MiniMax. The Dev Program form is 5 minutes of friction for 1,000 free calls. Good trade. Underused because nobody talks about it.
Apr 5, 2026