o3-mini — Free AI Model & API

github-models/o3-mini

chat

Get API key →

Context Window 200K

Max Output 100K

Rate Limit 10 RPM, 50 RPD

Cost $0.00 FREE

Free Period Since May 10, 2026

Credit Card Not required

Status Online

Overview

o3-mini is OpenAI's lightweight reasoning model, free on GitHub Models for all GitHub users. Unlike standard chat models that generate responses in a single pass, o3-mini uses internal chain-of-thought deliberation to work through multi-step problems — making it particularly strong at math, competitive programming, and logic puzzles. With 200K context, 100K max output, and OpenAI SDK compatibility, it integrates seamlessly into existing toolchains. The free tier provides 10 RPM and 50 requests per day with per-request token caps. If your task requires explicit step-by-step reasoning rather than fast conversational responses, o3-mini is one of the strongest free reasoning models available.

Model ID

o3-mini

Base URL

https://models.inference.ai.azure.com

Specifications

Context: 200K · Output: 100K · Modality: text · OpenAI Compat: Yes

Quick Start

Integrate o3-mini with 3 lines of code. See the config generator for Claude Code, Cursor, and more.

from openai import OpenAI

client = OpenAI(
 base_url="https://models.inference.ai.azure.com",
 api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
 model="o3-mini",
 messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

import OpenAI from "openai";

const openai = new OpenAI({
 baseURL: "https://models.inference.ai.azure.com",
 apiKey: "YOUR_API_KEY",
});

const completion = await openai.chat.completions.create({
 model: "o3-mini",
 messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

curl https://models.inference.ai.azure.com/chat/completions \
 -H "Content-Type: application/json" \
 -H "Authorization: Bearer YOUR_API_KEY" \
 -d '{
 "model": "o3-mini",
 "messages": [{"role": "user", "content": "Hello!"}]
 }'

Other Free Models from GitHub Models

gpt-4.1

1.0M context · No card

gpt-4.1-mini

1.0M context · No card

gpt-4o

128K context · No card

o4-mini

200K context · No card

Llama-4-Scout-17B-16E

512K context · No card

Rate Limits & Constraints

Rate Limit 10 RPM, 50 RPD

Context Window 200K

Max Output Tokens 100K

Cost Free — since May 10, 2026

Credit Card Not required

OpenAI Compatible Yes — drop-in replacement

GitHub Models Platform Limitations

Low per-request token limits (8K input / 4K output)
Rate limits tied to GitHub Copilot subscription tier
Not suitable for large-context or long-generation tasks

Features & Use Cases

Best For

Chat

Modality Support

text

GitHub Models Highlights

45+ models including GPT-4.1 and o3
Free for all GitHub accounts
Includes Llama 4, DeepSeek-R1, Mistral
Base URL: models.inference.ai.azure.com

Playground — Test o3-mini

Test o3-mini directly in your browser. Your API key is sent directly to GitHub Models — never stored.

Model: o3-mini Get Key

🔒 Your key is never stored — sent directly to the model provider via our server proxy.

Ready to chat with o3-mini.

Frequently Asked Questions

How do I get an API key for o3-mini?

Sign up at GitHub Models to get your API key. No credit card is required — just an email sign-up. Once you have the key, use the code snippets in the Quick Start section above.

Is o3-mini really free?

Yes. o3-mini is available on GitHub Models's free tier and has been free since May 10, 2026. Rate limits apply: 10 RPM, 50 RPD. Always check the provider's terms for any changes to the free tier.

What are o3-mini's rate limits?

10 RPM, 50 RPD Context window: 200K. Max output: 100K. No credit card required.

What are the best free alternatives to o3-mini?

Popular free alternatives include inclusionAI: Ring-2.6-1T, Baidu Qianfan: CoBuddy (free), Owl Alpha. You can also browse all 147+ free models on our site.