Verified Free LLM APIs — 147+ Tested Models, No Credit Card

44 models verified via live API · refreshed May 20, 2026 — how we verify

Free Only No Credit Card Required Verified only

Provider	Model	Context	Max Output	Modality	Rate Limit	Released	Weekly Tokens	Status
OpenRouter	Baidu Qianfan: CoBuddy (free)Verified	131K	66K	text	See provider page	May 6, 2026	23.2B	Online	Details
OpenRouter	Owl AlphaVerified	1.0M	262K	text	See provider page	Apr 28, 2026	1.2T	Online	Details
OpenRouter	NVIDIA: Nemotron 3 Nano Omni (free)Verified	256K	66K	textimageaudio	See provider page	Apr 28, 2026	17.7B	Online	Details
OpenRouter	Poolside: Laguna XS.2 (free)Verified	131K	8K	text	See provider page	Apr 28, 2026	41.1B	Online	Details
OpenRouter	Poolside: Laguna M.1 (free)Verified	131K	8K	text	See provider page	Apr 28, 2026	247.3B	Online	Details
OpenRouter	DeepSeek: DeepSeek V4 Flash (free)Verified	1.0M	384K	text	See provider page	Apr 24, 2026	77.9B	Online	Details
NVIDIA NIM	moonshotai/kimi-k2.6	262K	8K	text	Up to 40 RPM	Apr 20, 2026	4.3B	Unavailable	Details
OpenRouter	Z.ai: GLM 5.1Verified	203K	203K	text	See provider page	Apr 7, 2026	412.3B	Online	Details
NVIDIA NIM	z-ai/glm-5.1	203K	8K	text	Up to 40 RPM	Apr 7, 2026	412.3B	Unavailable	Details
OpenRouter	Google: Gemma 4 26B A4B (free)Verified	262K	33K	textimage	See provider page	Apr 3, 2026	4.3B	Online	Details
OpenRouter	Google: Gemma 4 31B (free)Verified	262K	33K	textimage	See provider page	Apr 2, 2026	11.9B	Online	Details
OpenRouter	Arcee AI: Trinity Large Thinking (free)Verified	262K	80K	textreasoning	See provider page	Apr 1, 2026	45.4B	Online	Details
OpenRouter	Google: Lyria 3 Pro PreviewVerified	1.0M	66K	textimage	See provider page	Mar 30, 2026	4.7M	Online	Details
OpenRouter	Google: Lyria 3 Clip PreviewVerified	1.0M	66K	textimage	See provider page	Mar 30, 2026	3.3M	Online	Details
OpenRouter	NVIDIA: Nemotron 3 Super (free)Verified	1.0M	262K	text	See provider page	Mar 11, 2026	624.3B	Online	Details
NVIDIA NIM	qwen/qwen3.5-122b-a10bVerified	262K	66K	textimage	Up to 40 RPM	Feb 25, 2026	10.1B	Online	Details
OpenRouter	NVIDIA: Llama Nemotron Embed VL 1B V2 (free)Verified	131K	8K	textimageembeddings	See provider page	Feb 25, 2026	—	Online	Details
NVIDIA NIM	qwen/qwen3.5-397b-a17bVerified	262K	66K	textimage	Up to 40 RPM	Feb 16, 2026	134.6B	Online	Details
OpenRouter	MiniMax: MiniMax M2.5 (free)Verified	205K	8K	text	See provider page	Feb 12, 2026	40.1B	Online	Details
OpenRouter	Qwen: Qwen3 Coder 480B A35B (free)Verified	1.0M	262K	textcode	See provider page	Feb 4, 2026	9.9B	Online	Details
OpenRouter	Free Models RouterVerified	200K	8K	textimage	See provider page	Feb 1, 2026	—	Online	Details
OpenRouter	LiquidAI: LFM2.5-1.2B-Thinking (free)Verified	33K	8K	textreasoning	See provider page	Jan 20, 2026	1.1B	Online	Details
OpenRouter	LiquidAI: LFM2.5-1.2B-Instruct (free)Verified	33K	8K	text	See provider page	Jan 20, 2026	710.9M	Online	Details
OpenRouter	NVIDIA: Nemotron 3 Nano 30B A3B (free)Verified	256K	8K	text	See provider page	Dec 14, 2025	35.8B	Online	Details
OpenRouter	NVIDIA: Nemotron Nano 12B 2 VL (free)Verified	128K	128K	textimage	See provider page	Oct 28, 2025	10.5B	Online	Details
NVIDIA NIM	nvidia/llama-3.3-nemotron-super-49b-v1.5Verified	131K	16K	text	Up to 40 RPM	Oct 10, 2025	220.4M	Online	Details
OpenRouter	Qwen: Qwen3 Next 80B A3B Instruct (free)Verified	262K	8K	text	See provider page	Sep 11, 2025	905.6M	Online	Details
OpenRouter	NVIDIA: Nemotron Nano 9B V2 (free)Verified	128K	8K	text	See provider page	Sep 5, 2025	10.8B	Online	Details
OpenRouter	OpenAI: gpt-oss-120b (free)Verified	131K	131K	text	See provider page	Aug 5, 2025	145.5B	Online	Details
OpenRouter	OpenAI: gpt-oss-20b (free)Verified	131K	8K	text	See provider page	Aug 5, 2025	30.8B	Online	Details
OpenRouter	Z.ai: GLM 4.5 Air (free)Verified	131K	96K	text	See provider page	Jul 25, 2025	84.5B	Online	Details
OpenRouter	Meta: Llama 3.3 70B Instruct (free)Verified	131K	8K	text	See provider page	Dec 6, 2024	865.4M	Online	Details
OpenRouter	Meta: Llama 3.2 3B Instruct (free)Verified	131K	8K	text	See provider page	Sep 25, 2024	59.5M	Online	Details
OpenRouter	Nous: Hermes 3 405B Instruct (free)Verified	131K	8K	text	See provider page	Aug 16, 2024	62.8M	Online	Details
NVIDIA NIM	mistralai/mistral-large-2-instruct	131K	8K	text	Up to 40 RPM	Feb 26, 2024	667.6M	Unavailable	Details
Cloudflare Workers AI	@cf/meta/llama-4-scout-17b-16e-instruct	10.0M	131K	text	10K neurons/day (shared)	—	—	Online	Details
NVIDIA NIM	deepseek-ai/deepseek-v4-flashVerified	1.0M	384K	text	Up to 40 RPM	—	—	Online	Details
Google Gemini	Gemini 2.5 Flash	1.0M	65K	text	10 RPM, 250 RPD	—	—	Online	Details
Google Gemini	Gemini 2.5 Flash-Lite	1.0M	65K	text	15 RPM, 1,000 RPD	—	—	Online	Details
GitHub Models	gpt-4.1	1.0M	32K	text	10 RPM, 50 RPD	—	—	Online	Details
GitHub Models	gpt-4.1-mini	1.0M	32K	text	15 RPM, 150 RPD	—	—	Online	Details
GitHub Models	Llama-4-Scout-17B-16E	512K	4K	text	15 RPM, 150 RPD	—	—	Online	Details
NVIDIA NIM	stepfun-ai/step-3.5-flashVerified	262K	66K	text	Up to 40 RPM	—	—	Online	Details
Groq	kimi-k2-instruct	262K	262K	text	30 RPM, 14,400 RPD	—	—	Online	Details
Kilo Code	nvidia/nemotron-3-super-120b-a12b:free	262K	32K	text	~200 req/hr	—	—	Online	Details
OVHcloud AI Endpoints	Qwen3-Coder-30B-A3B-Instruct	262K	32K	textcode	2 RPM (anonymous)	—	—	Online	Details
Cohere	Command A (111B)	256K	4K	text	20 RPM	—	—	Online	Details
Mistral AI	Mistral Small 4	256K	256K	text	~1 RPS, 500K TPM	—	—	Online	Details
Mistral AI	Mistral Large 3	256K	256K	text	~1 RPS, 500K TPM	—	—	Online	Details
Mistral AI	Codestral	256K	256K	textcode	~1 RPS, 500K TPM	—	—	Online	Details
Cloudflare Workers AI	@cf/google/gemma-4-26b-a4b-it	256K	131K	text	10K neurons/day (shared)	—	—	Online	Details
GitHub Models	Llama-4-Maverick-17B-128E	256K	4K	text	10 RPM, 50 RPD	—	—	Online	Details
GitHub Models	AI21 Jamba 1.5 Large	256K	0	text	See provider page	—	—	Online	Details
NVIDIA NIM	minimaxai/minimax-m2.7Verified	205K	131K	text	Up to 40 RPM	—	—	Online	Details
Z AI (Zhipu AI)	GLM-4.7-Flash	200K	128K	text	1 concurrent request	—	—	Online	Details
GitHub Models	o3-mini	200K	100K	text	10 RPM, 50 RPD	—	—	Online	Details
GitHub Models	o4-mini	200K	100K	text	10 RPM, 50 RPD	—	—	Online	Details
NVIDIA NIM	meta/llama-guard-4-12bVerified	164K	16K	textimage	Up to 40 RPM	—	—	Online	Details
Cohere	Embed 4	131K	131K	text	2,000 inputs/min	—	—	Online	Details
Cohere	Rerank 3.5	131K	131K	text	10 RPM	—	—	Online	Details
Groq	whisper-large-v3	131K	131K	text	20 RPM, 2,000 RPD	—	—	Online	Details
Groq	whisper-large-v3-turbo	131K	131K	text	20 RPM, 2,000 RPD	—	—	Online	Details
Kilo Code	bytedance-seed/dola-seed-2.0-pro:free	131K	131K	text	~200 req/hr	—	—	Online	Details
Kilo Code	x-ai/grok-code-fast-1:optimized:free	131K	131K	textcode	~200 req/hr	—	—	Online	Details
Kilo Code	arcee-ai/trinity-large-thinking:free	131K	131K	text	~200 req/hr	—	—	Online	Details
LLM7.io	deepseek-r1-0528	131K	131K	text	30 RPM (120 with token)	—	—	Online	Details
LLM7.io	deepseek-v3-0324	131K	131K	text	30 RPM (120 with token)	—	—	Online	Details
LLM7.io	gpt-4o-mini	131K	131K	text	30 RPM (120 with token)	—	—	Online	Details
LLM7.io	qwen2.5-coder-32b	131K	131K	textcode	30 RPM (120 with token)	—	—	Online	Details
ModelScope	Qwen/Qwen3.5-35B-A3B	131K	131K	text	2,000 RPD total; <=500 RPD/model (dynamic)	—	—	Online	Details
ModelScope	Qwen/Qwen3.5-27B	131K	131K	text	2,000 RPD total; <=500 RPD/model (dynamic)	—	—	Online	Details
ModelScope	Qwen/Qwen-Image	131K	131K	text	2,000 RPD total; model/AIGC-specific caps	—	—	Online	Details
SiliconFlow	deepseek-ai/DeepSeek-OCR	131K	8K	text	1,000 RPM, 50K TPM		—	Online	Details
SiliconFlow	Abbreviation	131K	8K	text	See provider page	—	—	Online	Details
NVIDIA NIM	meta/llama-3.1-70b-instructVerified	131K	16K	text	Up to 40 RPM	—	—	Online	Details
NVIDIA NIM	meta/llama-3.2-11b-vision-instructVerified	131K	16K	textimage	Up to 40 RPM	—	—	Online	Details
NVIDIA NIM	meta/llama-3.2-1b-instructVerified	131K	60K	text	Up to 40 RPM	—	—	Online	Details
NVIDIA NIM	meta/llama-3.2-3b-instructVerified	131K	8K	text	Up to 40 RPM	—	—	Online	Details
Chutes.ai	DeepSeek-R1	131K	0	text	Community-powered, no hard cap	—	—	Online	Details
Chutes.ai	Llama 3.1 70B	131K	0	text	Community-powered, no hard cap	—	—	Online	Details
Glhf.chat	Llama 3.1 70B	131K	0	text	Unlimited for free models	—	—	Online	Details
Groq	Moonshot Kimi K2	131K	0	text	See provider page	—	—	Online	Details
Groq	Moonshot Kimi K2 0905	131K	0	text	See provider page	—	—	Online	Details
Groq	GPT-OSS 120B	131K	0	text	See provider page	—	—	Online	Details
Groq	GPT-OSS 20B	131K	0	text	See provider page	—	—	Online	Details
Groq	GPT-OSS Safeguard 20B	131K	0	text	See provider page	—	—	Online	Details
GitHub Models	Phi-4	131K	0	text	See provider page	—	—	Online	Details
GitHub Models	Mistral Large (24.11)	131K	0	text	See provider page	—	—	Online	Details
Cerebras	Llama 3.1 70B	131K	0	text	See provider page	—	—	Online	Details
Cerebras	qwen-3-235b-a22b-instruct-2507	131K	8K	text	30 RPM, 14,400 RPD, 1M TPD	—	—	Online	Details
Cloudflare Workers AI	@cf/meta/llama-3.3-70b-instruct-fp8-fast	131K	131K	text	10K neurons/day (shared)	—	—	Online	Details
Cloudflare Workers AI	@cf/meta/llama-3.1-8b-instruct-fp8-fast	131K	131K	text	10K neurons/day (shared)	—	—	Online	Details
Cloudflare Workers AI	@cf/meta/llama-3.2-11b-vision-instruct	131K	131K	textimage	10K neurons/day (shared)	—	—	Online	Details
GitHub Models	Meta-Llama-3.3-70B	131K	4K	text	15 RPM, 150 RPD	—	—	Online	Details
Groq	llama-3.3-70b-versatile	131K	32K	text	30 RPM, 14,400 RPD	—	—	Online	Details
Groq	llama-3.1-8b-instant	131K	131K	text	30 RPM, 14,400 RPD	—	—	Online	Details
Groq	llama-4-scout-17b-16e-instruct	131K	8K	text	30 RPM, 14,400 RPD	—	—	Online	Details
Groq	llama-4-maverick-17b-128e-instruct	131K	8K	text	15 RPM, 500 RPD	—	—	Online	Details
Groq	qwen3-32b	131K	131K	text	30 RPM, 14,400 RPD	—	—	Online	Details
Groq	deepseek-r1-distill-70b	131K	8K	text	30 RPM, 14,400 RPD	—	—	Online	Details
Hugging Face	Qwen2.5-7B-Instruct	131K	4K	text	~1,000 RPD	—	—	Online	Details
OVHcloud AI Endpoints	Meta-Llama-3_3-70B-Instruct	131K	4K	text	2 RPM (anonymous)	—	—	Online	Details
OVHcloud AI Endpoints	DeepSeek-R1-Distill-Llama-70B	131K	32K	text	2 RPM (anonymous)	—	—	Online	Details
SiliconFlow	deepseek-ai/DeepSeek-R1-Distill-Qwen-7B	131K	131K	text	1,000 RPM, 50K TPM	—	—	Online	Details
Cohere	Command R+	128K	4K	text	20 RPM	—	—	Online	Details
Cohere	Command R7B	128K	4K	text	20 RPM	—	—	Online	Details
Mistral AI	Mistral Medium 3	128K	128K	text	~1 RPS, 500K TPM	—	—	Online	Details
Mistral AI	Mistral Nemo (12B)	128K	128K	text	~1 RPS, 500K TPM	—	—	Online	Details
Mistral AI	Pixtral Large	128K	128K	textimage	~1 RPS, 500K TPM	—	—	Online	Details
Z AI (Zhipu AI)	GLM-4.5-Flash	128K	8K	text	1 concurrent request	—	—	Online	Details
Z AI (Zhipu AI)	GLM-4.6V-Flash	128K	4K	text	1 concurrent request	—	—	Online	Details
Cerebras	llama3.1-8b	128K	8K	text	30 RPM, 14,400 RPD, 1M TPD	—	—	Online	Details
Cerebras	gpt-oss-120b	128K	8K	text	30 RPM, 14,400 RPD, 1M TPD	—	—	Online	Details
Cerebras	zai-glm-4.7	128K	8K	text	10 RPM, 100 RPD, 1M TPD	—	—	Online	Details
Cloudflare Workers AI	@cf/mistralai/mistral-small-3.1-24b-instruct	128K	131K	text	10K neurons/day (shared)	—	—	Online	Details
GitHub Models	gpt-4o	128K	16K	text	10 RPM, 50 RPD	—	—	Online	Details
GitHub Models	Mistral-Small-3.1	128K	4K	text	15 RPM, 150 RPD	—	—	Online	Details
Hugging Face	Meta-Llama-3.1-8B-Instruct	128K	4K	text	~1,000 RPD	—	—	Online	Details
Hugging Face	Phi-3.5-mini-instruct	128K	4K	text	~1,000 RPD	—	—	Online	Details
Ollama Cloud	llama3.1:cloud	128K	131K	text	Session/weekly limits (unpublished)	—	—	Online	Details
Ollama Cloud	deepseek-r1:cloud	128K	131K	text	Session/weekly limits (unpublished)	—	—	Online	Details
Ollama Cloud	qwen2.5:cloud	128K	131K	text	Session/weekly limits (unpublished)	—	—	Online	Details
OVHcloud AI Endpoints	Qwen2.5-VL-72B-Instruct	128K	8K	textimage	2 RPM (anonymous)	—	—	Online	Details
OVHcloud AI Endpoints	Mistral-Nemo-Instruct-2407	128K	4K	text	2 RPM (anonymous)	—	—	Online	Details
SiliconFlow	THUDM/GLM-4.1V-9B-Thinking	66K	66K	text	1,000 RPM, 50K TPM	—	—	Online	Details
GitHub Models	DeepSeek-R1	64K	8K	text	15 RPM, 150 RPD	—	—	Online	Details
SiliconFlow	deepseek-ai/DeepSeek-R1-0528-Qwen3-8B	33K	16K	text	1,000 RPM, 50K TPM	—	—	Online	Details
OpenRouter	Venice: Uncensored (free)Verified	33K	8K	text	See provider page	—	—	Online	Details
Glhf.chat	Mixtral 8x7B	33K	0	text	Unlimited for free models	—	—	Online	Details
Mistral AI	Mistral 7B	33K	0	text	See provider page	—	—	Online	Details
Mistral AI	Mixtral 8x7B	33K	0	text	See provider page	—	—	Online	Details
Cloudflare Workers AI	Mistral 7B	33K	0	text	See provider page	—	—	Online	Details
Cloudflare Workers AI	Qwen 1.5 7B	33K	0	text	See provider page	—	—	Online	Details
Cloudflare Workers AI	@cf/qwen/qwq-32b	32K	131K	text	10K neurons/day (shared)	—	—	Online	Details
Cloudflare Workers AI	@cf/deepseek-ai/deepseek-r1-distill-qwen-32b	32K	131K	text	10K neurons/day (shared)	—	—	Online	Details
Hugging Face	Mistral-7B-Instruct-v0.3	32K	4K	text	~1,000 RPD	—	—	Online	Details
Hugging Face	Mixtral-8x7B-Instruct-v0.1	32K	4K	text	~1,000 RPD	—	—	Online	Details
LLM7.io	mistral-small-3.1-24b	32K	131K	text	30 RPM (120 with token)	—	—	Online	Details
Ollama Cloud	mistral:cloud	32K	131K	text	Session/weekly limits (unpublished)	—	—	Online	Details
OVHcloud AI Endpoints	Qwen3Guard-Gen-8B	32K	4K	text	2 RPM (anonymous)	—	—	Online	Details
OVHcloud AI Endpoints	Qwen3Guard-Gen-0.6B	32K	4K	text	2 RPM (anonymous)	—	—	Online	Details
SiliconFlow	THUDM/glm-4-9b-chat	32K	32K	text	1,000 RPM, 50K TPM	—	—	Online	Details
Ollama Cloud	gemma2:cloud	8K	131K	text	Session/weekly limits (unpublished)	—	—	Online	Details
Grok (xAI)	Grok-2	131K	0	text	$25/month free credits, resets monthly	—	—	Online	Details
Grok (xAI)	Grok-2 Mini	131K	0	text	$25/month free credits, resets monthly	—	—	Online	Details
NVIDIA NIM	deepseek-ai/deepseek-v4-pro	131K	8K	text	Up to 40 RPM	—	—	Unavailable	Details
NVIDIA NIM	nvidia/llama-3.1-nemotron-ultra-253b-v1	131K	8K	text	Up to 40 RPM	—	—	Unavailable	Details

How to Use Free LLM API Resources

Pick a model — Click any model name to see details, rate limits, and API key signup link.
Get your API key — Sign up on the provider's website (most require no credit card).
Copy the config — Go to the Config Generator, pick your tool and backend, copy the ready-to-use snippet.
Test it — Use the Playground to test your API key before integrating.

New to LLM terminology? Check the Glossary — 22 terms explained in plain English →

See our FAQ for common questions about free LLM APIs

Free LLM API Models — Browse & Filter 147+ Models

How to Use Free LLM API Resources