freetokens.dev
/deals /updates /providers /submit

Cerebras

Provider

Wafer-scale inference engine. Serves frontier open-weight models at 2000+ tok/sec.

https://cerebras.ai {{ $provider->twitter_handle }}

Active deals (4)

GLM 4.7
Free preview: 1M tokens/day, 10 RPM, 60K TPM. Daily request cap: 100
FOREVER
GPT-OSS 120B
Free tier: 1M tokens/day, 30 RPM, 64K TPM
FOREVER
Llama 3.1 8B
Free tier: 1M tokens/day, 30 RPM, 60K TPM
FOREVER
Qwen 3 235B A22B
Free tier: 1M tokens/day, 30 RPM, 60K TPM
FOREVER
© 2026 freetokens.dev /about /rss
// built by degenerates who hate paying for inference