AI Models//Qwen: Qwen3.6 Flash
Chat

Qwen: Qwen3.6 Flash

qwen/qwen3.6-flash
1000KContext Window
4KMax Output
Normal

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in above 256K tokens. Prompt caching is supported, with both explicit cache read and cache creation pricing.

Capabilities

Text GenerationVideo Generation

Technical Specs

Input Modality
Text
Output Modality
Text
Arch

Pricing

Pay per use, no monthly fees
Billing TypeUnitPrice
Text Input$0.2500/M tokens
Text Output$1.5000/M tokens
Cache Write 1h$0.3125/M tokens
Cache Write$0.3125/M tokens

Quick Start

from openai import OpenAI

client = OpenAI(
    base_url="https://api.uniontoken.ai/v1",
    api_key="YOUR_UNIONTOKEN_API_KEY",
)

response = client.chat.completions.create(
    model="qwen/qwen3.6-flash",
    messages=[
        {"role": "user", "content": "Hello!"}
    ],
)

print(response.choices[0].message.content)

FAQ

Ready to get started?

Get 1M free tokens on registration, no monthly fees or minimum spend

Register Now →