z-ai/glm-5.1 — Free AI Model & API

nvidia-nim/z-ai-glm-5-1
chat
Context Window 203K
Max Output 8K
Rate Limit Up to 40 RPM
Cost $0.00 FREE
Free Period Since Apr 7, 2026
Credit Card Not required
Status Unavailable

Overview

Z.AI GLM 5.1 is Zhipu AI's latest model, free on NVIDIA NIM with up to 40 RPM and no daily token cap. GLM 5.1 brings Zhipu AI's newest advances in bilingual (Chinese-English) performance and general reasoning. NVIDIA's OpenAI-compatible API makes it accessible from standard toolchains. Requires free NVIDIA Developer Program membership and phone verification.

Model ID
z-ai/glm-5.1
Base URL
https://integrate.api.nvidia.com/v1
Specifications
Context: 203K · Output: 8K · Modality: text · OpenAI Compat: Yes ·Released: Apr 7, 2026

Quick Start

Integrate z-ai/glm-5.1 with 3 lines of code. See the config generator for Claude Code, Cursor, and more.

from openai import OpenAI

client = OpenAI(
 base_url="https://integrate.api.nvidia.com/v1",
 api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
 model="z-ai/glm-5.1",
 messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
import OpenAI from "openai";

const openai = new OpenAI({
 baseURL: "https://integrate.api.nvidia.com/v1",
 apiKey: "YOUR_API_KEY",
});

const completion = await openai.chat.completions.create({
 model: "z-ai/glm-5.1",
 messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);
curl https://integrate.api.nvidia.com/v1/chat/completions \
 -H "Content-Type: application/json" \
 -H "Authorization: Bearer YOUR_API_KEY" \
 -d '{
 "model": "z-ai/glm-5.1",
 "messages": [{"role": "user", "content": "Hello!"}]
 }'

Other Free Models from NVIDIA NIM

Rate Limits & Constraints

Rate Limit Up to 40 RPM
Context Window 203K
Max Output Tokens 8K
Cost Free — since Apr 7, 2026
Credit Card Not required
OpenAI Compatible Yes — drop-in replacement

NVIDIA NIM Platform Limitations

  • ~40 RPM shared across all models, not per-model
  • Some models require additional registration per model family
  • Unavailable models listed in catalog but uncallable with standard key

Features & Use Cases

Best For

Chat

Modality Support

text

NVIDIA NIM Highlights

  • 100+ open models available
  • No daily token cap
  • ~40 RPM free tier
  • No credit card required

Playground — Test z-ai/glm-5.1

Test z-ai/glm-5.1 directly in your browser. Your API key is sent directly to NVIDIA NIM — never stored.

Model: z-ai/glm-5.1 Get Key

🔒 Your key is never stored — sent directly to the model provider via our server proxy.

Ready to chat with z-ai/glm-5.1.

Frequently Asked Questions

How do I get an API key for z-ai/glm-5.1?

Sign up at NVIDIA NIM to get your API key. No credit card is required — just an email sign-up. Once you have the key, use the code snippets in the Quick Start section above.

Is z-ai/glm-5.1 really free?

Yes. z-ai/glm-5.1 is available on NVIDIA NIM's free tier and has been free since Apr 7, 2026. Rate limits apply: Up to 40 RPM. Always check the provider's terms for any changes to the free tier.

What are z-ai/glm-5.1's rate limits?

Up to 40 RPM Context window: 203K. Max output: 8K. No credit card required.

What are the best free alternatives to z-ai/glm-5.1?

Popular free alternatives include inclusionAI: Ring-2.6-1T, Baidu Qianfan: CoBuddy (free), Owl Alpha. You can also browse all 147+ free models on our site.

More questions? See our full FAQ →

Similar Free Models