How to Get a Free Cloudflare Workers AI API Key (2026)

10 free models available — no credit card required. Get your Cloudflare Workers AI API key →

Overview

Edge AI inference — 10,000 neurons/day, 50+ models.

Cloudflare Workers AI runs open-weight models directly on Cloudflare's global edge network. The free tier allocates 10,000 Neurons (compute units) per day, supporting 50+ models including Llama, Mistral, Gemma, DeepSeek, and Qwen. Unlike other providers, billing is based on Neurons rather than tokens, making it hard to predict exact request counts. Ideal for low-latency edge deployments.

  • 50+ models on the free tier
  • 10,000 Neurons/day
  • Global edge network for low latency
  • Text, image, audio, and embedding models

API Compatibility: OpenAI SDK-compatible (via REST)

Quick Start Guide

  1. 1
    Sign up at dash.cloudflare.com Free account. No credit card.
  2. 2
    Go to Workers & Pages → AI
  3. 3
    Create an API token with Workers AI permissions
  4. 4
    Pick a model Llama 3.2 3B and Mistral 7B are reliable choices.
  5. 5
    Configure OpenAI client Base URL: https://api.cloudflare.com/client/v4/accounts/YOUR_ACCOUNT_ID/ai/run

All Free Cloudflare Workers AI Models — Context Windows & Rate Limits

Model Context Max Output Modality Rate Limit Released Status
@cf/meta/llama-3.3-70b-instruct-fp8-fast 131K 131K text 10K neurons/day (shared) Online Details
@cf/meta/llama-3.1-8b-instruct-fp8-fast 131K 131K text 10K neurons/day (shared) Online Details
@cf/meta/llama-3.2-11b-vision-instruct 131K 131K textimage 10K neurons/day (shared) Online Details
@cf/meta/llama-4-scout-17b-16e-instruct 10.0M 131K text 10K neurons/day (shared) Online Details
@cf/mistralai/mistral-small-3.1-24b-instruct 128K 131K text 10K neurons/day (shared) Online Details
@cf/google/gemma-4-26b-a4b-it 256K 131K text 10K neurons/day (shared) Online Details
@cf/qwen/qwq-32b 32K 131K text 10K neurons/day (shared) Online Details
@cf/deepseek-ai/deepseek-r1-distill-qwen-32b 32K 131K text 10K neurons/day (shared) Online Details
Mistral 7B 33K 0 text See provider page Online Details
Qwen 1.5 7B 33K 0 text See provider page Online Details

Pricing & Limits

Credit Card Not required
Free Tier Permanently free
Context Range 32K – 10.0M
Total Models 10 free
Rate Limits 10K neurons/day (shared)
API Compatibility OpenAI SDK-compatible (via REST)

Use Cases

What Cloudflare Workers AI's free models are best for, based on aggregated model capabilities:

Chat 10 models Vision 1 model Reasoning 1 model

Limitations & Caveats

  • Neurons billing is opaque — hard to predict exact request counts
  • Model availability varies by Cloudflare region
  • 10,000 Neurons/day shared across all models
See our FAQ for common questions about free LLM APIs