gpt-oss-120b — Free AI Model & API
cerebras/gpt-oss-120b Overview
Cerebras offers GPT-oss-120b, a powerful text-based LLM ideal for chat and coding, generating up to 8,000 tokens from 128,000-token contexts at 30 RPM, 14,400 RPD, and 1M TPD, without requiring a credit card and compatible with OpenAI.
Quick Start
Integrate gpt-oss-120b with 3 lines of code. See the config generator for Claude Code, Cursor, and more.
Other Free Models from Cerebras
Rate Limits & Constraints
Cerebras Platform Limitations
- 8K context window on free tier (vs 128K on paid)
- Limited model selection — Llama and GPT-OSS only
- 1M tokens/day shared across models
Features & Use Cases
Best For
Modality Support
Cerebras Highlights
- Ultra-fast inference on WSE chips
- 1M tokens/day free
- No credit card required
- Llama 3.1 8B + GPT-OSS 120B available
Playground — Test gpt-oss-120b
Test gpt-oss-120b directly in your browser. Your API key is sent directly to Cerebras — never stored.
🔒 Your key is never stored — sent directly to the model provider via our server proxy.
Ready to chat with gpt-oss-120b.
Frequently Asked Questions
How do I get an API key for gpt-oss-120b?
Sign up at Cerebras to get your API key. No credit card is required — just an email sign-up. Once you have the key, use the code snippets in the Quick Start section above.
Is gpt-oss-120b really free?
Yes. gpt-oss-120b is available on Cerebras's free tier and has been free since May 10, 2026. Rate limits apply: 30 RPM, 14,400 RPD, 1M TPD. Always check the provider's terms for any changes to the free tier.
What are gpt-oss-120b's rate limits?
30 RPM, 14,400 RPD, 1M TPD Context window: 128K. Max output: 8K. No credit card required.
What are the best free alternatives to gpt-oss-120b?
Popular free alternatives include inclusionAI: Ring-2.6-1T, Baidu Qianfan: CoBuddy (free), Owl Alpha. You can also browse all 147+ free models on our site.
More questions? See our full FAQ →