Qwen: Qwen3.6 Flash

qwen/qwen3.6-flash

1000KContext Window

4KMax Output

Normal

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in above 256K tokens. Prompt caching is supported, with both explicit cache read and cache creation pricing.

Capabilities

Text GenerationVideo Generation

Technical Specs

Input Modality

Text

Output Modality

Text

Arch

—

Pricing

Pay per use, no monthly fees

Billing Type	Unit	Price
Text Input	—	$0.2500/M tokens
Text Output	—	$1.5000/M tokens
Cache Write 1h	—	$0.3125/M tokens
Cache Write	—	$0.3125/M tokens

Quick Start

from openai import OpenAI

client = OpenAI(
    base_url="https://api.uniontoken.ai/v1",
    api_key="YOUR_UNIONTOKEN_API_KEY",
)

response = client.chat.completions.create(
    model="qwen/qwen3.6-flash",
    messages=[
        {"role": "user", "content": "Hello!"}
    ],
)

print(response.choices[0].message.content)