Chat
IBM: Granite 4.1 8B
ibm-granite/granite-4.1-8b
131KContext Window
4KMax Output
Normal
Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks including tool calling, retrieval-augmented generation (RAG), code generation with fill-in-the-middle support, text summarization, classification, and extraction. The model handles 12 languages (English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, and Chinese) and implements OpenAI-compatible tool calling. Released under the Apache 2.0 license.
Capabilities
Text GenerationCode GenerationAnalysis & Reasoning
Technical Specs
Input Modality
Text
Output Modality
Text
Arch
—
Pricing
Pay per use, no monthly fees| Billing Type | Unit | Price |
|---|---|---|
| Text Input | — | $0.0500/M tokens |
| Text Output | — | $0.1000/M tokens |
| Cache Read | — | $0.0500/M tokens |
Quick Start
from openai import OpenAI
client = OpenAI(
base_url="https://api.uniontoken.ai/v1",
api_key="YOUR_UNIONTOKEN_API_KEY",
)
response = client.chat.completions.create(
model="ibm-granite/granite-4.1-8b",
messages=[
{"role": "user", "content": "Hello!"}
],
)
print(response.choices[0].message.content)FAQ
IBM: Granite 4.1 8B
ibm-granite/granite-4.1-8b
In< ¥0.001/1K
Out< ¥0.001/1K
Context Window131K
Max Output4K
Related Models
View All → →Ready to get started?
Get 1M free tokens on registration, no monthly fees or minimum spend
Register Now →