44 models verified via live API · refreshed May 20, 2026how we verify
Provider Model Context Max Output Modality Rate Limit Released Weekly Tokens Status
OpenRouter Baidu Qianfan: CoBuddy (free)Verified 131K 66K text See provider page May 6, 2026 23.2B Online Details
OpenRouter Owl AlphaVerified 1.0M 262K text See provider page Apr 28, 2026 1.2T Online Details
OpenRouter NVIDIA: Nemotron 3 Nano Omni (free)Verified 256K 66K textimageaudio See provider page Apr 28, 2026 17.7B Online Details
OpenRouter Poolside: Laguna XS.2 (free)Verified 131K 8K text See provider page Apr 28, 2026 41.1B Online Details
OpenRouter Poolside: Laguna M.1 (free)Verified 131K 8K text See provider page Apr 28, 2026 247.3B Online Details
OpenRouter DeepSeek: DeepSeek V4 Flash (free)Verified 1.0M 384K text See provider page Apr 24, 2026 77.9B Online Details
NVIDIA NIM moonshotai/kimi-k2.6 262K 8K text Up to 40 RPM Apr 20, 2026 4.3B Unavailable Details
OpenRouter Z.ai: GLM 5.1Verified 203K 203K text See provider page Apr 7, 2026 412.3B Online Details
NVIDIA NIM z-ai/glm-5.1 203K 8K text Up to 40 RPM Apr 7, 2026 412.3B Unavailable Details
OpenRouter Google: Gemma 4 26B A4B (free)Verified 262K 33K textimage See provider page Apr 3, 2026 4.3B Online Details
OpenRouter Google: Gemma 4 31B (free)Verified 262K 33K textimage See provider page Apr 2, 2026 11.9B Online Details
OpenRouter Arcee AI: Trinity Large Thinking (free)Verified 262K 80K textreasoning See provider page Apr 1, 2026 45.4B Online Details
OpenRouter Google: Lyria 3 Pro PreviewVerified 1.0M 66K textimage See provider page Mar 30, 2026 4.7M Online Details
OpenRouter Google: Lyria 3 Clip PreviewVerified 1.0M 66K textimage See provider page Mar 30, 2026 3.3M Online Details
OpenRouter NVIDIA: Nemotron 3 Super (free)Verified 1.0M 262K text See provider page Mar 11, 2026 624.3B Online Details
NVIDIA NIM qwen/qwen3.5-122b-a10bVerified 262K 66K textimage Up to 40 RPM Feb 25, 2026 10.1B Online Details
OpenRouter NVIDIA: Llama Nemotron Embed VL 1B V2 (free)Verified 131K 8K textimageembeddings See provider page Feb 25, 2026 Online Details
NVIDIA NIM qwen/qwen3.5-397b-a17bVerified 262K 66K textimage Up to 40 RPM Feb 16, 2026 134.6B Online Details
OpenRouter MiniMax: MiniMax M2.5 (free)Verified 205K 8K text See provider page Feb 12, 2026 40.1B Online Details
OpenRouter Qwen: Qwen3 Coder 480B A35B (free)Verified 1.0M 262K textcode See provider page Feb 4, 2026 9.9B Online Details
OpenRouter Free Models RouterVerified 200K 8K textimage See provider page Feb 1, 2026 Online Details
OpenRouter LiquidAI: LFM2.5-1.2B-Thinking (free)Verified 33K 8K textreasoning See provider page Jan 20, 2026 1.1B Online Details
OpenRouter LiquidAI: LFM2.5-1.2B-Instruct (free)Verified 33K 8K text See provider page Jan 20, 2026 710.9M Online Details
OpenRouter NVIDIA: Nemotron 3 Nano 30B A3B (free)Verified 256K 8K text See provider page Dec 14, 2025 35.8B Online Details
OpenRouter NVIDIA: Nemotron Nano 12B 2 VL (free)Verified 128K 128K textimage See provider page Oct 28, 2025 10.5B Online Details
NVIDIA NIM nvidia/llama-3.3-nemotron-super-49b-v1.5Verified 131K 16K text Up to 40 RPM Oct 10, 2025 220.4M Online Details
OpenRouter Qwen: Qwen3 Next 80B A3B Instruct (free)Verified 262K 8K text See provider page Sep 11, 2025 905.6M Online Details
OpenRouter NVIDIA: Nemotron Nano 9B V2 (free)Verified 128K 8K text See provider page Sep 5, 2025 10.8B Online Details
OpenRouter OpenAI: gpt-oss-120b (free)Verified 131K 131K text See provider page Aug 5, 2025 145.5B Online Details
OpenRouter OpenAI: gpt-oss-20b (free)Verified 131K 8K text See provider page Aug 5, 2025 30.8B Online Details
OpenRouter Z.ai: GLM 4.5 Air (free)Verified 131K 96K text See provider page Jul 25, 2025 84.5B Online Details
OpenRouter Meta: Llama 3.3 70B Instruct (free)Verified 131K 8K text See provider page Dec 6, 2024 865.4M Online Details
OpenRouter Meta: Llama 3.2 3B Instruct (free)Verified 131K 8K text See provider page Sep 25, 2024 59.5M Online Details
OpenRouter Nous: Hermes 3 405B Instruct (free)Verified 131K 8K text See provider page Aug 16, 2024 62.8M Online Details
NVIDIA NIM mistralai/mistral-large-2-instruct 131K 8K text Up to 40 RPM Feb 26, 2024 667.6M Unavailable Details
Cloudflare Workers AI @cf/meta/llama-4-scout-17b-16e-instruct 10.0M 131K text 10K neurons/day (shared) Online Details
NVIDIA NIM deepseek-ai/deepseek-v4-flashVerified 1.0M 384K text Up to 40 RPM Online Details
Google Gemini Gemini 2.5 Flash 1.0M 65K text 10 RPM, 250 RPD Online Details
Google Gemini Gemini 2.5 Flash-Lite 1.0M 65K text 15 RPM, 1,000 RPD Online Details
GitHub Models gpt-4.1 1.0M 32K text 10 RPM, 50 RPD Online Details
GitHub Models gpt-4.1-mini 1.0M 32K text 15 RPM, 150 RPD Online Details
GitHub Models Llama-4-Scout-17B-16E 512K 4K text 15 RPM, 150 RPD Online Details
NVIDIA NIM stepfun-ai/step-3.5-flashVerified 262K 66K text Up to 40 RPM Online Details
Groq kimi-k2-instruct 262K 262K text 30 RPM, 14,400 RPD Online Details
Kilo Code nvidia/nemotron-3-super-120b-a12b:free 262K 32K text ~200 req/hr Online Details
OVHcloud AI Endpoints Qwen3-Coder-30B-A3B-Instruct 262K 32K textcode 2 RPM (anonymous) Online Details
Cohere Command A (111B) 256K 4K text 20 RPM Online Details
Mistral AI Mistral Small 4 256K 256K text ~1 RPS, 500K TPM Online Details
Mistral AI Mistral Large 3 256K 256K text ~1 RPS, 500K TPM Online Details
Mistral AI Codestral 256K 256K textcode ~1 RPS, 500K TPM Online Details
Cloudflare Workers AI @cf/google/gemma-4-26b-a4b-it 256K 131K text 10K neurons/day (shared) Online Details
GitHub Models Llama-4-Maverick-17B-128E 256K 4K text 10 RPM, 50 RPD Online Details
GitHub Models AI21 Jamba 1.5 Large 256K 0 text See provider page Online Details
NVIDIA NIM minimaxai/minimax-m2.7Verified 205K 131K text Up to 40 RPM Online Details
Z AI (Zhipu AI) GLM-4.7-Flash 200K 128K text 1 concurrent request Online Details
GitHub Models o3-mini 200K 100K text 10 RPM, 50 RPD Online Details
GitHub Models o4-mini 200K 100K text 10 RPM, 50 RPD Online Details
NVIDIA NIM meta/llama-guard-4-12bVerified 164K 16K textimage Up to 40 RPM Online Details
Cohere Embed 4 131K 131K text 2,000 inputs/min Online Details
Cohere Rerank 3.5 131K 131K text 10 RPM Online Details
Groq whisper-large-v3 131K 131K text 20 RPM, 2,000 RPD Online Details
Groq whisper-large-v3-turbo 131K 131K text 20 RPM, 2,000 RPD Online Details
Kilo Code bytedance-seed/dola-seed-2.0-pro:free 131K 131K text ~200 req/hr Online Details
Kilo Code x-ai/grok-code-fast-1:optimized:free 131K 131K textcode ~200 req/hr Online Details
Kilo Code arcee-ai/trinity-large-thinking:free 131K 131K text ~200 req/hr Online Details
LLM7.io deepseek-r1-0528 131K 131K text 30 RPM (120 with token) Online Details
LLM7.io deepseek-v3-0324 131K 131K text 30 RPM (120 with token) Online Details
LLM7.io gpt-4o-mini 131K 131K text 30 RPM (120 with token) Online Details
LLM7.io qwen2.5-coder-32b 131K 131K textcode 30 RPM (120 with token) Online Details
ModelScope Qwen/Qwen3.5-35B-A3B 131K 131K text 2,000 RPD total; <=500 RPD/model (dynamic) Online Details
ModelScope Qwen/Qwen3.5-27B 131K 131K text 2,000 RPD total; <=500 RPD/model (dynamic) Online Details
ModelScope Qwen/Qwen-Image 131K 131K text 2,000 RPD total; model/AIGC-specific caps Online Details
SiliconFlow deepseek-ai/DeepSeek-OCR 131K 8K text 1,000 RPM, 50K TPM Online Details
SiliconFlow Abbreviation 131K 8K text See provider page Online Details
NVIDIA NIM meta/llama-3.1-70b-instructVerified 131K 16K text Up to 40 RPM Online Details
NVIDIA NIM meta/llama-3.2-11b-vision-instructVerified 131K 16K textimage Up to 40 RPM Online Details
NVIDIA NIM meta/llama-3.2-1b-instructVerified 131K 60K text Up to 40 RPM Online Details
NVIDIA NIM meta/llama-3.2-3b-instructVerified 131K 8K text Up to 40 RPM Online Details
Chutes.ai DeepSeek-R1 131K 0 text Community-powered, no hard cap Online Details
Chutes.ai Llama 3.1 70B 131K 0 text Community-powered, no hard cap Online Details
Glhf.chat Llama 3.1 70B 131K 0 text Unlimited for free models Online Details
Groq Moonshot Kimi K2 131K 0 text See provider page Online Details
Groq Moonshot Kimi K2 0905 131K 0 text See provider page Online Details
Groq GPT-OSS 120B 131K 0 text See provider page Online Details
Groq GPT-OSS 20B 131K 0 text See provider page Online Details
Groq GPT-OSS Safeguard 20B 131K 0 text See provider page Online Details
GitHub Models Phi-4 131K 0 text See provider page Online Details
GitHub Models Mistral Large (24.11) 131K 0 text See provider page Online Details
Cerebras Llama 3.1 70B 131K 0 text See provider page Online Details
Cerebras qwen-3-235b-a22b-instruct-2507 131K 8K text 30 RPM, 14,400 RPD, 1M TPD Online Details
Cloudflare Workers AI @cf/meta/llama-3.3-70b-instruct-fp8-fast 131K 131K text 10K neurons/day (shared) Online Details
Cloudflare Workers AI @cf/meta/llama-3.1-8b-instruct-fp8-fast 131K 131K text 10K neurons/day (shared) Online Details
Cloudflare Workers AI @cf/meta/llama-3.2-11b-vision-instruct 131K 131K textimage 10K neurons/day (shared) Online Details
GitHub Models Meta-Llama-3.3-70B 131K 4K text 15 RPM, 150 RPD Online Details
Groq llama-3.3-70b-versatile 131K 32K text 30 RPM, 14,400 RPD Online Details
Groq llama-3.1-8b-instant 131K 131K text 30 RPM, 14,400 RPD Online Details
Groq llama-4-scout-17b-16e-instruct 131K 8K text 30 RPM, 14,400 RPD Online Details
Groq llama-4-maverick-17b-128e-instruct 131K 8K text 15 RPM, 500 RPD Online Details
Groq qwen3-32b 131K 131K text 30 RPM, 14,400 RPD Online Details
Groq deepseek-r1-distill-70b 131K 8K text 30 RPM, 14,400 RPD Online Details
Hugging Face Qwen2.5-7B-Instruct 131K 4K text ~1,000 RPD Online Details
OVHcloud AI Endpoints Meta-Llama-3_3-70B-Instruct 131K 4K text 2 RPM (anonymous) Online Details
OVHcloud AI Endpoints DeepSeek-R1-Distill-Llama-70B 131K 32K text 2 RPM (anonymous) Online Details
SiliconFlow deepseek-ai/DeepSeek-R1-Distill-Qwen-7B 131K 131K text 1,000 RPM, 50K TPM Online Details
Cohere Command R+ 128K 4K text 20 RPM Online Details
Cohere Command R7B 128K 4K text 20 RPM Online Details
Mistral AI Mistral Medium 3 128K 128K text ~1 RPS, 500K TPM Online Details
Mistral AI Mistral Nemo (12B) 128K 128K text ~1 RPS, 500K TPM Online Details
Mistral AI Pixtral Large 128K 128K textimage ~1 RPS, 500K TPM Online Details
Z AI (Zhipu AI) GLM-4.5-Flash 128K 8K text 1 concurrent request Online Details
Z AI (Zhipu AI) GLM-4.6V-Flash 128K 4K text 1 concurrent request Online Details
Cerebras llama3.1-8b 128K 8K text 30 RPM, 14,400 RPD, 1M TPD Online Details
Cerebras gpt-oss-120b 128K 8K text 30 RPM, 14,400 RPD, 1M TPD Online Details
Cerebras zai-glm-4.7 128K 8K text 10 RPM, 100 RPD, 1M TPD Online Details
Cloudflare Workers AI @cf/mistralai/mistral-small-3.1-24b-instruct 128K 131K text 10K neurons/day (shared) Online Details
GitHub Models gpt-4o 128K 16K text 10 RPM, 50 RPD Online Details
GitHub Models Mistral-Small-3.1 128K 4K text 15 RPM, 150 RPD Online Details
Hugging Face Meta-Llama-3.1-8B-Instruct 128K 4K text ~1,000 RPD Online Details
Hugging Face Phi-3.5-mini-instruct 128K 4K text ~1,000 RPD Online Details
Ollama Cloud llama3.1:cloud 128K 131K text Session/weekly limits (unpublished) Online Details
Ollama Cloud deepseek-r1:cloud 128K 131K text Session/weekly limits (unpublished) Online Details
Ollama Cloud qwen2.5:cloud 128K 131K text Session/weekly limits (unpublished) Online Details
OVHcloud AI Endpoints Qwen2.5-VL-72B-Instruct 128K 8K textimage 2 RPM (anonymous) Online Details
OVHcloud AI Endpoints Mistral-Nemo-Instruct-2407 128K 4K text 2 RPM (anonymous) Online Details
SiliconFlow THUDM/GLM-4.1V-9B-Thinking 66K 66K text 1,000 RPM, 50K TPM Online Details
GitHub Models DeepSeek-R1 64K 8K text 15 RPM, 150 RPD Online Details
SiliconFlow deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 33K 16K text 1,000 RPM, 50K TPM Online Details
OpenRouter Venice: Uncensored (free)Verified 33K 8K text See provider page Online Details
Glhf.chat Mixtral 8x7B 33K 0 text Unlimited for free models Online Details
Mistral AI Mistral 7B 33K 0 text See provider page Online Details
Mistral AI Mixtral 8x7B 33K 0 text See provider page Online Details
Cloudflare Workers AI Mistral 7B 33K 0 text See provider page Online Details
Cloudflare Workers AI Qwen 1.5 7B 33K 0 text See provider page Online Details
Cloudflare Workers AI @cf/qwen/qwq-32b 32K 131K text 10K neurons/day (shared) Online Details
Cloudflare Workers AI @cf/deepseek-ai/deepseek-r1-distill-qwen-32b 32K 131K text 10K neurons/day (shared) Online Details
Hugging Face Mistral-7B-Instruct-v0.3 32K 4K text ~1,000 RPD Online Details
Hugging Face Mixtral-8x7B-Instruct-v0.1 32K 4K text ~1,000 RPD Online Details
LLM7.io mistral-small-3.1-24b 32K 131K text 30 RPM (120 with token) Online Details
Ollama Cloud mistral:cloud 32K 131K text Session/weekly limits (unpublished) Online Details
OVHcloud AI Endpoints Qwen3Guard-Gen-8B 32K 4K text 2 RPM (anonymous) Online Details
OVHcloud AI Endpoints Qwen3Guard-Gen-0.6B 32K 4K text 2 RPM (anonymous) Online Details
SiliconFlow THUDM/glm-4-9b-chat 32K 32K text 1,000 RPM, 50K TPM Online Details
Ollama Cloud gemma2:cloud 8K 131K text Session/weekly limits (unpublished) Online Details
Grok (xAI) Grok-2 131K 0 text $25/month free credits, resets monthly Online Details
Grok (xAI) Grok-2 Mini 131K 0 text $25/month free credits, resets monthly Online Details
NVIDIA NIM deepseek-ai/deepseek-v4-pro 131K 8K text Up to 40 RPM Unavailable Details
NVIDIA NIM nvidia/llama-3.1-nemotron-ultra-253b-v1 131K 8K text Up to 40 RPM Unavailable Details

How to Use Free LLM API Resources

  1. Pick a model — Click any model name to see details, rate limits, and API key signup link.
  2. Get your API key — Sign up on the provider's website (most require no credit card).
  3. Copy the config — Go to the Config Generator, pick your tool and backend, copy the ready-to-use snippet.
  4. Test it — Use the Playground to test your API key before integrating.

New to LLM terminology? Check the Glossary — 22 terms explained in plain English →

See our FAQ for common questions about free LLM APIs