How to Get a Free Cloudflare Workers AI API Key (2026)
10 free models available — no credit card required. Get your Cloudflare Workers AI API key →
Overview
Edge AI inference — 10,000 neurons/day, 50+ models.
Cloudflare Workers AI runs open-weight models directly on Cloudflare's global edge network. The free tier allocates 10,000 Neurons (compute units) per day, supporting 50+ models including Llama, Mistral, Gemma, DeepSeek, and Qwen. Unlike other providers, billing is based on Neurons rather than tokens, making it hard to predict exact request counts. Ideal for low-latency edge deployments.
- 50+ models on the free tier
- 10,000 Neurons/day
- Global edge network for low latency
- Text, image, audio, and embedding models
API Compatibility: OpenAI SDK-compatible (via REST)
Quick Start Guide
- 1 Sign up at dash.cloudflare.com Free account. No credit card.
- 2 Go to Workers & Pages → AI
- 3 Create an API token with Workers AI permissions
- 4 Pick a model Llama 3.2 3B and Mistral 7B are reliable choices.
- 5 Configure OpenAI client Base URL: https://api.cloudflare.com/client/v4/accounts/YOUR_ACCOUNT_ID/ai/run
All Free Cloudflare Workers AI Models — Context Windows & Rate Limits
| Model | Context | Max Output | Modality | Rate Limit | Released | Status | |
|---|---|---|---|---|---|---|---|
| @cf/meta/llama-3.3-70b-instruct-fp8-fast | 131K | 131K | 10K neurons/day (shared) | — | Online | Details | |
| @cf/meta/llama-3.1-8b-instruct-fp8-fast | 131K | 131K | 10K neurons/day (shared) | — | Online | Details | |
| @cf/meta/llama-3.2-11b-vision-instruct | 131K | 131K | 10K neurons/day (shared) | — | Online | Details | |
| @cf/meta/llama-4-scout-17b-16e-instruct | 10.0M | 131K | 10K neurons/day (shared) | — | Online | Details | |
| @cf/mistralai/mistral-small-3.1-24b-instruct | 128K | 131K | 10K neurons/day (shared) | — | Online | Details | |
| @cf/google/gemma-4-26b-a4b-it | 256K | 131K | 10K neurons/day (shared) | — | Online | Details | |
| @cf/qwen/qwq-32b | 32K | 131K | 10K neurons/day (shared) | — | Online | Details | |
| @cf/deepseek-ai/deepseek-r1-distill-qwen-32b | 32K | 131K | 10K neurons/day (shared) | — | Online | Details | |
| Mistral 7B | 33K | 0 | See provider page | — | Online | Details | |
| Qwen 1.5 7B | 33K | 0 | See provider page | — | Online | Details |
Pricing & Limits
Credit Card Not required
Free Tier Permanently free
Context Range 32K – 10.0M
Total Models 10 free
Rate Limits 10K neurons/day (shared)
API Compatibility OpenAI SDK-compatible (via REST)
Use Cases
What Cloudflare Workers AI's free models are best for, based on aggregated model capabilities:
Limitations & Caveats
- Neurons billing is opaque — hard to predict exact request counts
- Model availability varies by Cloudflare region
- 10,000 Neurons/day shared across all models