GeminiChat
Google: Gemini 2.5 Flash
google/gemini-2.5-flash
1049KContext Window
66KMax Output
Supported Protocols:reasoninginclude_reasoningstructured_outputsresponse_formatmax_tokenstemperaturetop_pseedtoolstool_choicestop
Normal
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).
Capabilities
👁 Vision🧠 Reasoning🔧 Function CallingText GenerationCode GenerationAnalysis & ReasoningReasoning
Technical Specs
Input Modality
Text、Image、Text、Audio、Video
Output Modality
Text
Arch
—
Default Temperature
0.7
Default Top_P
1
Pricing
Pay per use, no monthly fees| Billing Type | Unit | Price |
|---|---|---|
| Text Input | — | $0.3000/M tokens |
| Text Output | — | $2.5000/M tokens |
| Cache Read | — | $0.0300/M tokens |
| Cache Write 1h | — | $0.0833/M tokens |
| Cache Write | — | $0.0833/M tokens |
| Audio Cache Read | — | $0.1000/M tokens |
| Image Input | — | < $0.001/张 |
| Audio Input | — | $1.0000/分钟 |
| Web Search | — | $0.0140/次 |
Quick Start
from openai import OpenAI
client = OpenAI(
base_url="https://api.uniontoken.ai/v1",
api_key="YOUR_UNIONTOKEN_API_KEY",
)
response = client.chat.completions.create(
model="google/gemini-2.5-flash",
messages=[
{"role": "user", "content": "Hello!"}
],
)
print(response.choices[0].message.content)FAQ
Gemini
Google: Gemini 2.5 Flash
google/gemini-2.5-flash
In< ¥0.001/1K
Out< ¥0.001/1K
Context Window1049K
Max Output66K
Related Models
View All → →Ready to get started?
Get 1M free tokens on registration, no monthly fees or minimum spend
Register Now →