kimi-k2-instruct — Free AI Model & API

groq/kimi-k2-instruct

chat

Get API key →

Context Window 262K

Max Output 262K

Rate Limit 30 RPM, 14,400 RPD

Cost $0.00 FREE

Free Period Since May 10, 2026

Credit Card Not required

Status Online

Overview

Kimi K2 Instruct on Groq is Moonshot AI's second-generation model, delivering 262K context and 262K output — one of the highest output ceilings available on any free endpoint. This makes it uniquely suited for long-form generation tasks: full document drafts, extended code refactors, or comprehensive analysis reports. Running on Groq's LPU hardware ensures responsive latency even with the high token throughput. The free tier offers 14,400 requests per day at 30 RPM. OpenAI SDK compatible; registration required, no credit card needed.

Model ID

kimi-k2-instruct

Base URL

https://api.groq.com/openai/v1

Specifications

Context: 262K · Output: 262K · Modality: text · OpenAI Compat: Yes

Quick Start

Integrate kimi-k2-instruct with 3 lines of code. See the config generator for Claude Code, Cursor, and more.

from openai import OpenAI

client = OpenAI(
 base_url="https://api.groq.com/openai/v1",
 api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
 model="kimi-k2-instruct",
 messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

import OpenAI from "openai";

const openai = new OpenAI({
 baseURL: "https://api.groq.com/openai/v1",
 apiKey: "YOUR_API_KEY",
});

const completion = await openai.chat.completions.create({
 model: "kimi-k2-instruct",
 messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

curl https://api.groq.com/openai/v1/chat/completions \
 -H "Content-Type: application/json" \
 -H "Authorization: Bearer YOUR_API_KEY" \
 -d '{
 "model": "kimi-k2-instruct",
 "messages": [{"role": "user", "content": "Hello!"}]
 }'

Other Free Models from Groq

llama-3.3-70b-versatile

131K context · No card

llama-3.1-8b-instant

131K context · No card

llama-4-scout-17b-16e-instruct

131K context · No card

llama-4-maverick-17b-128e-instruct

131K context · No card

qwen3-32b

131K context · No card

Rate Limits & Constraints

Rate Limit 30 RPM, 14,400 RPD

Context Window 262K

Max Output Tokens 262K

Cost Free — since May 10, 2026

Credit Card Not required

OpenAI Compatible Yes — drop-in replacement

Groq Platform Limitations

Rate limits vary significantly by model — check per-model limits
Some models have token-per-minute caps in addition to RPM
LPU availability may cause queuing during peak usage

Features & Use Cases

Best For

Chat

Modality Support

text

Groq Highlights

Ultra-fast inference (~2,600 tok/s)
Free tier: 14,400 RPD for most models
Supports Llama 4, Qwen3, DeepSeek-R1
OpenAI-compatible

Playground — Test kimi-k2-instruct

Test kimi-k2-instruct directly in your browser. Your API key is sent directly to Groq — never stored.

Model: kimi-k2-instruct Get Key

🔒 Your key is never stored — sent directly to the model provider via our server proxy.

Ready to chat with kimi-k2-instruct.

Frequently Asked Questions

How do I get an API key for kimi-k2-instruct?

Sign up at Groq to get your API key. No credit card is required — just an email sign-up. Once you have the key, use the code snippets in the Quick Start section above.

Is kimi-k2-instruct really free?

Yes. kimi-k2-instruct is available on Groq's free tier and has been free since May 10, 2026. Rate limits apply: 30 RPM, 14,400 RPD. Always check the provider's terms for any changes to the free tier.

What are kimi-k2-instruct's rate limits?

30 RPM, 14,400 RPD Context window: 262K. Max output: 262K. No credit card required.

What are the best free alternatives to kimi-k2-instruct?

Popular free alternatives include inclusionAI: Ring-2.6-1T, Baidu Qianfan: CoBuddy (free), Owl Alpha. You can also browse all 147+ free models on our site.

kimi-k2-instruct — Free AI Model & API

Overview

Quick Start

Other Free Models from Groq

llama-3.3-70b-versatile

llama-3.1-8b-instant

llama-4-scout-17b-16e-instruct

llama-4-maverick-17b-128e-instruct

qwen3-32b

Rate Limits & Constraints

Groq Platform Limitations

Features & Use Cases

Best For

Modality Support

Groq Highlights

Playground — Test kimi-k2-instruct

Frequently Asked Questions

Similar Free Models

inclusionAI: Ring-2.6-1T

Baidu Qianfan: CoBuddy (free)

Owl Alpha