Meta-Llama-3_3-70B-Instruct — Free AI Model & API

ovhcloud-ai-endpoints/meta-llama-3-3-70b-instruct
chat
Context Window 131K
Max Output 4K
Rate Limit 2 RPM (anonymous)
Cost $0.00 FREE
Free Period Since May 10, 2026
Credit Card Not required
Status Online

Overview

Llama 3.3 70B Instruct on OVHcloud AI Endpoints provides free access to Meta's flagship 70B model with 131K context and OpenAI-compatible API. OVHcloud is a major European cloud provider, making this endpoint particularly attractive for EU-based developers who want GDPR-compliant infrastructure. The free tier output is capped at 4K tokens per request — adequate for chat and short generation, but not long-form writing. Registration required.

Model ID
meta-llama-3-3-70b-instruct
Base URL
https://oai.endpoints.kepler.ai.cloud.ovh.net/v1
Specifications
Context: 131K · Output: 4K · Modality: text · OpenAI Compat: Yes

Quick Start

Integrate Meta-Llama-3_3-70B-Instruct with 3 lines of code. See the config generator for Claude Code, Cursor, and more.

from openai import OpenAI

client = OpenAI(
 base_url="https://oai.endpoints.kepler.ai.cloud.ovh.net/v1",
 api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
 model="meta-llama-3-3-70b-instruct",
 messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
import OpenAI from "openai";

const openai = new OpenAI({
 baseURL: "https://oai.endpoints.kepler.ai.cloud.ovh.net/v1",
 apiKey: "YOUR_API_KEY",
});

const completion = await openai.chat.completions.create({
 model: "meta-llama-3-3-70b-instruct",
 messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);
curl https://oai.endpoints.kepler.ai.cloud.ovh.net/v1/chat/completions \
 -H "Content-Type: application/json" \
 -H "Authorization: Bearer YOUR_API_KEY" \
 -d '{
 "model": "meta-llama-3-3-70b-instruct",
 "messages": [{"role": "user", "content": "Hello!"}]
 }'

Other Free Models from OVHcloud AI Endpoints

Rate Limits & Constraints

Rate Limit 2 RPM (anonymous)
Context Window 131K
Max Output Tokens 4K
Cost Free — since May 10, 2026
Credit Card Not required
OpenAI Compatible Yes — drop-in replacement

OVHcloud AI Endpoints Platform Limitations

  • Anonymous tier has very low, unpublished rate limits
  • Some models experience cold starts (5-10s first request)
  • EU-hosted only — higher latency outside Europe

Features & Use Cases

Best For

Chat

Modality Support

text

OVHcloud AI Endpoints Highlights

  • No registration needed for anonymous tier
  • EU-hosted (GDPR compliant)
  • Qwen3-Coder, Mistral, Llama, DeepSeek
  • OpenAI-compatible endpoint

Playground — Test Meta-Llama-3_3-70B-Instruct

Test Meta-Llama-3_3-70B-Instruct directly in your browser. Your API key is sent directly to OVHcloud AI Endpoints — never stored.

Model: Meta-Llama-3_3-70B-Instruct Get Key

🔒 Your key is never stored — sent directly to the model provider via our server proxy.

Ready to chat with Meta-Llama-3_3-70B-Instruct.

Frequently Asked Questions

How do I get an API key for Meta-Llama-3_3-70B-Instruct?

Sign up at OVHcloud AI Endpoints to get your API key. No credit card is required — just an email sign-up. Once you have the key, use the code snippets in the Quick Start section above.

Is Meta-Llama-3_3-70B-Instruct really free?

Yes. Meta-Llama-3_3-70B-Instruct is available on OVHcloud AI Endpoints's free tier and has been free since May 10, 2026. Rate limits apply: 2 RPM (anonymous). Always check the provider's terms for any changes to the free tier.

What are Meta-Llama-3_3-70B-Instruct's rate limits?

2 RPM (anonymous) Context window: 131K. Max output: 4K. No credit card required.

What are the best free alternatives to Meta-Llama-3_3-70B-Instruct?

Popular free alternatives include inclusionAI: Ring-2.6-1T, Baidu Qianfan: CoBuddy (free), Owl Alpha. You can also browse all 147+ free models on our site.

More questions? See our full FAQ →

Similar Free Models