AI Models/Qwen/Qwen: Qwen3 30B A3B Thinking 2507
QwenChat

Qwen: Qwen3 30B A3B Thinking 2507

qwen/qwen3-30b-a3b-thinking-2507
131KContext Window
131KMax Output
Supported Protocols:reasoninginclude_reasoningmax_tokenstemperaturetop_ptop_kmin_pfrequency_penaltypresence_penaltyrepetition_penaltystopseedlogit_biasresponse_formatstructured_outputstoolstool_choice
Normal

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking. The model is designed specifically for “thinking mode,” where internal reasoning traces are separated from final answers. Compared to earlier Qwen3-30B releases, this version improves performance across logical reasoning, mathematics, science, coding, and multilingual benchmarks. It also demonstrates stronger instruction following, tool use, and alignment with human preferences. With higher reasoning efficiency and extended output budgets, it is best suited for advanced research, competitive problem solving, and agentic applications requiring structured long-context reasoning.

Capabilities

🧠 Reasoning🔧 Function CallingText GenerationCode GenerationAnalysis & ReasoningReasoning

Technical Specs

Input Modality
Text
Output Modality
Text
Arch
Default Temperature
0.7
Default Top_P
1

Pricing

Pay per use, no monthly fees
Billing TypeUnitPrice
Text Input$0.0800/M tokens
Text Output$0.4000/M tokens
Cache Read$0.0800/M tokens
Reasoning$0.0800/M tokens

Quick Start

from openai import OpenAI

client = OpenAI(
    base_url="https://api.uniontoken.ai/v1",
    api_key="YOUR_UNIONTOKEN_API_KEY",
)

response = client.chat.completions.create(
    model="qwen/qwen3-30b-a3b-thinking-2507",
    messages=[
        {"role": "user", "content": "Hello!"}
    ],
)

print(response.choices[0].message.content)

FAQ

Ready to get started?

Get 1M free tokens on registration, no monthly fees or minimum spend

Register Now →