GPT-4.1
free_tier
Free tier: 10 RPM, 50 requests/day on high-tier models (GPT-4.1)
status
FOREVER
rate_limit
High-tier: 10 RPM / 50 RPD / 2 concurrent. Low-tier: 15 RPM / 150 RPD / 5 concurrent. Token caps: 8K input / 4K output per request.
restrictions
Free GitHub account + PAT required. In public preview. Free tier caps context at 8K input / 4K output regardless of model capability — GPT-4.1 is served with a drastically reduced context window.
[PERMANENT]
[API]
Our take
GitHub Models lets you hit GPT-4.1, Llama, Mistral, and xAI behind your GitHub PAT for free. Read the fine print: the free tier caps context at 8K/4K — even on GPT-4.1 — and tops out at 50 req/day on the big models. Wrong tool for production. Right tool for one-token multi-provider experiments.
Mar 26, 2026