Chat
Qwen: Qwen3.6 Flash
qwen/qwen3.6-flash
1000KContext Window
4KMax Output
Normal
Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in above 256K tokens. Prompt caching is supported, with both explicit cache read and cache creation pricing.
Capabilities
Text GenerationVideo Generation
Technical Specs
Input Modality
Text
Output Modality
Text
Arch
—
Pricing
Pay per use, no monthly fees| Billing Type | Unit | Price |
|---|---|---|
| Text Input | — | $0.2500/M tokens |
| Text Output | — | $1.5000/M tokens |
| Cache Write 1h | — | $0.3125/M tokens |
| Cache Write | — | $0.3125/M tokens |
Quick Start
from openai import OpenAI
client = OpenAI(
base_url="https://api.uniontoken.ai/v1",
api_key="YOUR_UNIONTOKEN_API_KEY",
)
response = client.chat.completions.create(
model="qwen/qwen3.6-flash",
messages=[
{"role": "user", "content": "Hello!"}
],
)
print(response.choices[0].message.content)FAQ
Qwen: Qwen3.6 Flash
qwen/qwen3.6-flash
In< ¥0.001/1K
Out< ¥0.001/1K
Context Window1000K
Max Output4K
Related Models
View All → →Ready to get started?
Get 1M free tokens on registration, no monthly fees or minimum spend
Register Now →