Phi-4 — Free AI Model & API
github-models/phi-4 Overview
Quick Start
Integrate Phi-4 with 3 lines of code. See the config generator for Claude Code, Cursor, and more.
Other Free Models from GitHub Models
Rate Limits & Constraints
GitHub Models Platform Limitations
- Low per-request token limits (8K input / 4K output)
- Rate limits tied to GitHub Copilot subscription tier
- Not suitable for large-context or long-generation tasks
Features & Use Cases
Best For
Modality Support
GitHub Models Highlights
- 45+ models including GPT-4.1 and o3
- Free for all GitHub accounts
- Includes Llama 4, DeepSeek-R1, Mistral
- Base URL: models.inference.ai.azure.com
Playground — Test Phi-4
Test Phi-4 directly in your browser. Your API key is sent directly to GitHub Models — never stored.
🔒 Your key is never stored — sent directly to the model provider via our server proxy.
Ready to chat with Phi-4.
Frequently Asked Questions
How do I get an API key for Phi-4?
Sign up at GitHub Models to get your API key. No credit card is required — just an email sign-up. Once you have the key, use the code snippets in the Quick Start section above.
Is Phi-4 really free?
Yes. Phi-4 is available on GitHub Models's free tier and has been free since May 20, 2026. Rate limits apply: See provider page. Always check the provider's terms for any changes to the free tier.
What are Phi-4's rate limits?
See provider page Context window: 131K. Max output: 0. No credit card required.
What are the best free alternatives to Phi-4?
Popular free alternatives include NVIDIA: Nemotron 3 Nano Omni (free), Arcee AI: Trinity Large Thinking (free), NVIDIA: Nemotron 3 Super (free). You can also browse all 147+ free models on our site.
More questions? See our full FAQ →