<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>Freetokens.dev</title>
        <link>https://freetokens.dev</link>
        <description>Free AI token deals — tracked, curated, opinionated.</description>
        <language>en</language>
        <atom:link href="https://freetokens.dev/rss" rel="self" type="application/rss+xml"/>
                    <item>
                <title>Gemini 2.5 Pro on Google AI Studio</title>
                <description><![CDATA[<p>Gemini 2.5 Pro is free on AI Studio. 1M context, Google's flagship, no credit card — the headline free-tier deal in AI right now. The catch: Google trains on everything you send through the free tier. Perfect for prototypes. Disastrous if you ship user data through it.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/google-ai-studio-gemini-25-pro-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/google-ai-studio-gemini-25-pro-apr-2026</guid>
                                <pubDate>Thu, 23 Apr 2026 16:30:00 +0000</pubDate>
            </item>
                    <item>
                <title>Llama 3.1 8B on Groq</title>
                <description><![CDATA[<p>14,400 free requests a day on Llama 3.1 8B via Groq. That's 10 per minute sustained, zero dollars, zero credit card. Stop hunting 70Bs and build your side project on this — it's the most usable free tier in the game.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/groq-llama-31-8b-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/groq-llama-31-8b-apr-2026</guid>
                                <pubDate>Wed, 22 Apr 2026 10:15:00 +0000</pubDate>
            </item>
                    <item>
                <title>Qwen 3 235B A22B on Cerebras</title>
                <description><![CDATA[<p>This is the most underrated free deal in AI. Qwen 3 235B A22B — Alibaba's frontier MoE — running at ~2000 tok/sec on Cerebras wafer-scale, 1M tokens/day free. A flagship model for zero dollars. Stop reading and go sign up.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/cerebras-qwen-3-235b-a22b-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/cerebras-qwen-3-235b-a22b-apr-2026</guid>
                                <pubDate>Mon, 20 Apr 2026 09:45:00 +0000</pubDate>
            </item>
                    <item>
                <title>Llama 3.1 8B on Cerebras</title>
                <description><![CDATA[<p>Cerebras hands you 1 million free tokens a day on Llama 3.1 8B at 2000+ tok/sec on wafer-scale silicon. Literally nobody talks about this. Get a key, point your SDK at it, move on. Treat it as free speed and stop overthinking.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/cerebras-llama-31-8b-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/cerebras-llama-31-8b-apr-2026</guid>
                                <pubDate>Fri, 17 Apr 2026 11:20:00 +0000</pubDate>
            </item>
                    <item>
                <title>Kimi K2 Instruct on Groq</title>
                <description><![CDATA[<p>Moonshot's Kimi K2 is quietly on Groq's free tier at 60 RPM. Faster than anyone else serves it and completely free. If you've never run a Chinese frontier model on LPU silicon, this is your on-ramp — five minutes from signup to first call.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/groq-kimi-k2-instruct-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/groq-kimi-k2-instruct-apr-2026</guid>
                                <pubDate>Wed, 15 Apr 2026 17:00:00 +0000</pubDate>
            </item>
                    <item>
                <title>Nemotron 3 Super 120B A12B on OpenRouter</title>
                <description><![CDATA[<p>NVIDIA's Nemotron 3 Super — a 120B hybrid Mamba-Transformer MoE — is free on OpenRouter. Weird architecture worth poking at, long context, zero dollars. Prompts may be logged for upstream training, so keep it to experiments and synthetic data.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/openrouter-nemotron-3-super-120b-a12b-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/openrouter-nemotron-3-super-120b-a12b-apr-2026</guid>
                                <pubDate>Mon, 13 Apr 2026 13:15:00 +0000</pubDate>
            </item>
                    <item>
                <title>GPT-4o on Microsoft Azure</title>
                <description><![CDATA[<p>Microsoft throws $200 of Azure credit at new accounts and — as of March 2026 — the old Azure OpenAI access-request form is finally dead. Anyone with an account can deploy GPT-4o now. Catches: credit card required, 30-day clock on the credit, and free-tier subs still choke on Foundry deployments sometimes. But $200 is $200 for a serious throughput test.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/microsoft-azure-gpt-4o-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/microsoft-azure-gpt-4o-apr-2026</guid>
                                <pubDate>Fri, 10 Apr 2026 14:00:00 +0000</pubDate>
            </item>
                    <item>
                <title>MiMo V2 Pro on Nous Research</title>
                <description><![CDATA[<p>Nous and Xiaomi just dropped a 2-week free window on MiMo V2 Pro — Xiaomi's ~1T-parameter MoE flagship — routed through Hermes Agent on the Nous Portal. Install Hermes, run <code>hermes update</code>, sign into a free Nous account, and you're calling a trillion-parameter model for nothing until ~April 21. Not OpenAI-compatible (Hermes Agent CLI only), but you're not going to get another shot at 1T free any time soon. Clock's ticking.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/nous-research-mimo-v2-pro-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/nous-research-mimo-v2-pro-apr-2026</guid>
                                <pubDate>Tue, 07 Apr 2026 10:00:00 +0000</pubDate>
            </item>
                    <item>
                <title>Llama 3.1 70B on NVIDIA NIM</title>
                <description><![CDATA[<p>NVIDIA hands out 1,000 free inference credits to Developer Program members, 40 RPM, OpenAI-compatible, 100+ models behind one endpoint — Llama 3.1 70B, Nemotron, Kimi K2.5, MiniMax. The Dev Program form is 5 minutes of friction for 1,000 free calls. Good trade. Underused because nobody talks about it.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/nvidia-nim-llama-31-70b-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/nvidia-nim-llama-31-70b-apr-2026</guid>
                                <pubDate>Sun, 05 Apr 2026 12:00:00 +0000</pubDate>
            </item>
                    <item>
                <title>Llama 3.3 70B on SambaNova Cloud</title>
                <description><![CDATA[<p>SambaNova gives you $5 of credits on signup — enough to touch Llama 3.1 405B on their RDU silicon, one of the only places you can run a 405B model free. Credits expire in 30 days, rate-limited free tier continues after. No credit card. If you haven't tried RDU inference yet, this is the cheapest test drive there is.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/sambanova-cloud-llama-33-70b-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/sambanova-cloud-llama-33-70b-apr-2026</guid>
                                <pubDate>Fri, 03 Apr 2026 09:30:00 +0000</pubDate>
            </item>
                    <item>
                <title>Llama 3.3 70B on Groq</title>
                <description><![CDATA[<p>Llama 3.3 70B free on Groq at ~275 tok/sec. The 1K-request-a-day ceiling stops you running a SaaS on it, but for agent loops, evals, and weekend builds, it's the fastest free 70B on the planet. Go.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/groq-llama-33-70b-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/groq-llama-33-70b-apr-2026</guid>
                                <pubDate>Wed, 01 Apr 2026 14:30:00 +0000</pubDate>
            </item>
                    <item>
                <title>Qwen 3.6 Plus Preview on OpenRouter</title>
                <description><![CDATA[<p>Alibaba's Qwen 3.6 Plus Preview just landed free on OpenRouter. 1M context. The 'Preview' label means the second Alibaba flips it to GA, the free endpoint dies — and nobody knows when that drops. This is exactly the kind of deal you check your inbox for. Use it now, not next week.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/openrouter-qwen-36-plus-preview-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/openrouter-qwen-36-plus-preview-apr-2026</guid>
                                <pubDate>Tue, 31 Mar 2026 12:00:00 +0000</pubDate>
            </item>
                    <item>
                <title>Llama 3.1 8B on Cloudflare Workers AI</title>
                <description><![CDATA[<p>Cloudflare Workers AI gives you 10,000 free Neurons a day across Llama, Mistral, Qwen, and more — edge-deployed in a one-line Worker call. Neurons aren't tokens, so small models stretch way further than you'd think. If you already live on CF, this is effectively free inference co-located with your app.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/cloudflare-workers-ai-llama-31-8b-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/cloudflare-workers-ai-llama-31-8b-apr-2026</guid>
                                <pubDate>Sat, 28 Mar 2026 10:00:00 +0000</pubDate>
            </item>
                    <item>
                <title>GPT-4.1 on GitHub Models</title>
                <description><![CDATA[<p>GitHub Models lets you hit GPT-4.1, Llama, Mistral, and xAI behind your GitHub PAT for free. Read the fine print: the free tier caps context at 8K/4K — even on GPT-4.1 — and tops out at 50 req/day on the big models. Wrong tool for production. Right tool for one-token multi-provider experiments.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/github-models-gpt-41-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/github-models-gpt-41-apr-2026</guid>
                                <pubDate>Thu, 26 Mar 2026 15:45:00 +0000</pubDate>
            </item>
                    <item>
                <title>DeepSeek V3 on Hugging Face</title>
                <description><![CDATA[<p>HuggingFace's free Inference tier is ten cents a month. Yes, ten cents. It's a tasting flight across DeepSeek V3, Llama, Qwen routed through HF's Inference Providers — you get a handful of calls, then you either upgrade to PRO at $9/mo or bounce. At least it's honest about what it is.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/hugging-face-deepseek-v3-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/hugging-face-deepseek-v3-apr-2026</guid>
                                <pubDate>Mon, 23 Mar 2026 18:00:00 +0000</pubDate>
            </item>
                    <item>
                <title>Command A on Cohere</title>
                <description><![CDATA[<p>Cohere's trial tier gives you Command A, Command R+, embeddings, and rerank for free. Non-commercial only, and the 1,000-calls-per-month hard cap will bite fast. Useless for an app — perfect for testing Cohere's rerank against your RAG pipeline before you commit.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/cohere-command-a-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/cohere-command-a-apr-2026</guid>
                                <pubDate>Fri, 20 Mar 2026 11:30:00 +0000</pubDate>
            </item>
                    <item>
                <title>DeepHermes 3 8B Preview on Nous Research</title>
                <description><![CDATA[<p>Nous dropped their own portal with a $5 signup credit and an OpenAI-compatible endpoint. Get DeepHermes 3 8B, Hermes 3 70B, and Hermes 4 behind one key without paying upfront. Rate limits are a little flaky right now — that's the Nous energy, lean in.</p>
]]></description>
                                    <link>https://freetokens.dev/deals/nous-research-deephermes-3-8b-preview-apr-2026</link>
                    <guid isPermaLink="true">https://freetokens.dev/deals/nous-research-deephermes-3-8b-preview-apr-2026</guid>
                                <pubDate>Tue, 17 Mar 2026 11:00:00 +0000</pubDate>
            </item>
            </channel>
</rss>
