Gemini 2.5 Flash Lite Preview 06-17 by Google AI — Pricing, Benchmarks &amp; Real Outputs

Alternatives to Gemini 2.5 Flash Lite Preview 06-17

Gemini 2.5 Flash Lite Preview 06-17's competitors have been quietly putting in work.

Gemini 2.5 Flash Lite Preview 06-17 has opinions about your brand too.Ask 10 models what they tell people about you. Verbatim receipts.Run the mirror

Updated Jun 17, 2025

Share

Loading...

Compare Gemini 2.5 Flash Lite Preview 06-17

Alternatives to Gemini 2.5 Flash Lite Preview 06-17

Gemini 2.5 Flash Lite Preview 06-17's competitors have been quietly putting in work.

Gemini 2.5 Flash Lite Preview 06-17 has opinions about your brand too.Ask 10 models what they tell people about you. Verbatim receipts.Run the mirror

Updated Jun 17, 2025

Share

Gemini 2.5 Flash Lite Preview 06-17

Gemini:

2.5 Flash Lite Preview 06-17

Gemini 2.5 Flash Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

Provider

Release Date

2025-06-17

Size

SMALL

Parameters

Not disclosed

Pricing

In: $0.10/1M

Out: $0.40/1M

Get API accessProvider and language code samples

Provider

fromimport openai  OpenAI

client = OpenAI(
"https://openrouter.ai/api/v1"    base_url=,
"$OPENROUTER_API_KEY"    api_key=,
)

response = client.chat.completions.create(
"google/gemini-2.5-flash-lite-preview-06-17"    model=,
"role""user""content""Hello!"    messages=[{: , : }],
)
print(response.choices[0].message.content)

Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.

Also on Google Vertex AI

Deep analysisPsychometrics, taste index, writing DNA

SubjectiveBench

Taste Index

Across 11 scored outputs

150.15xFloor

0100headroom →

Taste is judged on an uncapped scale, originality first. The space past 100 is craft today's models rarely reach.

Craft22

Originality11

Plays it safe

share of outputs that are the default answer

73%

Writing DNA

Stylometric Fingerprint

Based on 3 text responses

Tick = global average

Vocabulary Diversity61%

Unique words vs. total words. Higher = richer vocabulary.

Sentence Length10.0 words

Average words per sentence.

Hedging0.12

"Might", "perhaps", "arguably" per 100 words.

Bold Formatting1.9

**Bold** markers per 1,000 characters.

List Usage4.2

Bullet and numbered list items per 1,000 characters.

Section Structure0.05

Markdown headings per 1,000 characters.

Emoji Usage0.00

Emoji per 1,000 characters.

Transitions0.00

"However", "moreover", "furthermore" per 100 words.

Opening Habits

Consistency

81%

Across 3 responses

Sponsored

Model Responses

11 outputs from Gemini 2.5 Flash Lite Preview 06-17

Related Models

Google: Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google's GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic workflows, simple data extraction, and applications where responsiveness and API cost are the primary constraints. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.

Google: Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Delivers 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Priced at half the cost of Gemini 3 Flash.

Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview is Google's frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window. The 3.1 update introduces measurable gains in SWE benchmarks and real-world coding environments, along with stronger autonomous task execution in structured domains such as finance and spreadsheet-based workflows. Designed for advanced development and agentic systems, it introduces a new medium thinking level to better balance cost, speed, and performance.

Gemini 2.5 Pro Preview 06-05

Gemini 2.5 Pro is Google's state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs "thinking" capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities. Pricing: $1.25/M input tokens, $10/M output tokens, $5.16/K input images.

Gemini 2.5 Flash Preview 05-20

Gemini 2.5 Flash May 20th Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter.

Gemini 2.5 Flash Preview 05-20 (thinking)

Gemini 2.5 Flash May 20th Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter.

Keep exploring

Gemini 2.5 Flash Lite Preview 06-17 vs MiniMax M3

Real outputs compared side by side

Best AI for Creative Writing

Find the best AI for creative writing. Ranked across comedy, fiction, satire,...

Loading...

Compare Gemini 2.5 Flash Lite Preview 06-17

Alternatives to Gemini 2.5 Flash Lite Preview 06-17

Gemini 2.5 Flash Lite Preview 06-17's competitors have been quietly putting in work.

Gemini 2.5 Flash Lite Preview 06-17 has opinions about your brand too.Ask 10 models what they tell people about you. Verbatim receipts.Run the mirror

Updated Jun 17, 2025

Share

Gemini 2.5 Flash Lite Preview 06-17

Gemini:

2.5 Flash Lite Preview 06-17

Gemini 2.5 Flash Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

Provider

Release Date

2025-06-17

Size

SMALL

Parameters

Not disclosed

Pricing

In: $0.10/1M

Out: $0.40/1M

Get API accessProvider and language code samples

Provider

fromimport openai  OpenAI

client = OpenAI(
"https://openrouter.ai/api/v1"    base_url=,
"$OPENROUTER_API_KEY"    api_key=,
)

response = client.chat.completions.create(
"google/gemini-2.5-flash-lite-preview-06-17"    model=,
"role""user""content""Hello!"    messages=[{: , : }],
)
print(response.choices[0].message.content)

Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.

Also on Google Vertex AI

Deep analysisPsychometrics, taste index, writing DNA

SubjectiveBench

Taste Index

Across 11 scored outputs

150.15xFloor

0100headroom →

Taste is judged on an uncapped scale, originality first. The space past 100 is craft today's models rarely reach.

Craft22

Originality11

Plays it safe

share of outputs that are the default answer

73%

Writing DNA

Stylometric Fingerprint

Based on 3 text responses

Tick = global average

Vocabulary Diversity61%

Unique words vs. total words. Higher = richer vocabulary.

Sentence Length10.0 words

Average words per sentence.

Hedging0.12

"Might", "perhaps", "arguably" per 100 words.

Bold Formatting1.9

**Bold** markers per 1,000 characters.

List Usage4.2

Bullet and numbered list items per 1,000 characters.

Section Structure0.05

Markdown headings per 1,000 characters.

Emoji Usage0.00

Emoji per 1,000 characters.

Transitions0.00

"However", "moreover", "furthermore" per 100 words.

Opening Habits

Consistency

81%

Across 3 responses

Sponsored

Model Responses

11 outputs from Gemini 2.5 Flash Lite Preview 06-17

Related Models

Google: Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google's GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic workflows, simple data extraction, and applications where responsiveness and API cost are the primary constraints. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.

Google: Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Delivers 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Priced at half the cost of Gemini 3 Flash.

Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview is Google's frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window. The 3.1 update introduces measurable gains in SWE benchmarks and real-world coding environments, along with stronger autonomous task execution in structured domains such as finance and spreadsheet-based workflows. Designed for advanced development and agentic systems, it introduces a new medium thinking level to better balance cost, speed, and performance.

Gemini 2.5 Pro Preview 06-05

Gemini 2.5 Pro is Google's state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs "thinking" capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities. Pricing: $1.25/M input tokens, $10/M output tokens, $5.16/K input images.

Gemini 2.5 Flash Preview 05-20

Gemini 2.5 Flash May 20th Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter.

Gemini 2.5 Flash Preview 05-20 (thinking)

Gemini 2.5 Flash May 20th Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter.

Keep exploring

Gemini 2.5 Flash Lite Preview 06-17 vs MiniMax M3

Real outputs compared side by side

Best AI for Creative Writing

Find the best AI for creative writing. Ranked across comedy, fiction, satire,...

Loading...

Compare Gemini 2.5 Flash Lite Preview 06-17

Alternatives to Gemini 2.5 Flash Lite Preview 06-17

Gemini 2.5 Flash Lite Preview 06-17's competitors have been quietly putting in work.

Gemini 2.5 Flash Lite Preview 06-17 has opinions about your brand too.Ask 10 models what they tell people about you. Verbatim receipts.Run the mirror

Gemini 2.5 Flash Lite Preview 06-17

Gemini:

2.5 Flash Lite Preview 06-17

Gemini 2.5 Flash Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

Provider

Release Date

2025-06-17

Size

SMALL

Parameters

Not disclosed

Pricing

In: $0.10/1M

Out: $0.40/1M

Get API accessProvider and language code samples

Provider

fromimport openai  OpenAI

client = OpenAI(
"https://openrouter.ai/api/v1"    base_url=,
"$OPENROUTER_API_KEY"    api_key=,
)

response = client.chat.completions.create(
"google/gemini-2.5-flash-lite-preview-06-17"    model=,
"role""user""content""Hello!"    messages=[{: , : }],
)
print(response.choices[0].message.content)

Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.

Also on Google Vertex AI

Deep analysisPsychometrics, taste index, writing DNA

SubjectiveBench

Taste Index

Across 11 scored outputs

150.15xFloor

0100headroom →

Taste is judged on an uncapped scale, originality first. The space past 100 is craft today's models rarely reach.

Craft22

Originality11

Plays it safe

share of outputs that are the default answer

73%

Writing DNA

Stylometric Fingerprint

Based on 3 text responses

Tick = global average

Vocabulary Diversity61%

Unique words vs. total words. Higher = richer vocabulary.

Sentence Length10.0 words

Average words per sentence.

Hedging0.12

"Might", "perhaps", "arguably" per 100 words.

Bold Formatting1.9

**Bold** markers per 1,000 characters.

List Usage4.2

Bullet and numbered list items per 1,000 characters.

Section Structure0.05

Markdown headings per 1,000 characters.

Emoji Usage0.00

Emoji per 1,000 characters.

Transitions0.00

"However", "moreover", "furthermore" per 100 words.

Opening Habits

Consistency

81%

Across 3 responses

Sponsored

Model Responses

11 outputs from Gemini 2.5 Flash Lite Preview 06-17

Related Models

Google: Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google's GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic workflows, simple data extraction, and applications where responsiveness and API cost are the primary constraints. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.

Google: Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Delivers 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Priced at half the cost of Gemini 3 Flash.

Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview is Google's frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window. The 3.1 update introduces measurable gains in SWE benchmarks and real-world coding environments, along with stronger autonomous task execution in structured domains such as finance and spreadsheet-based workflows. Designed for advanced development and agentic systems, it introduces a new medium thinking level to better balance cost, speed, and performance.

Gemini 2.5 Pro Preview 06-05

Gemini 2.5 Pro is Google's state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs "thinking" capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities. Pricing: $1.25/M input tokens, $10/M output tokens, $5.16/K input images.

Gemini 2.5 Flash Preview 05-20

Gemini 2.5 Flash May 20th Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter.

Gemini 2.5 Flash Preview 05-20 (thinking)

Gemini 2.5 Flash May 20th Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter.

Keep exploring

Gemini 2.5 Flash Lite Preview 06-17 vs MiniMax M3

Real outputs compared side by side

Best AI for Creative Writing

Find the best AI for creative writing. Ranked across comedy, fiction, satire,...

Gemini 2.5 Flash Lite Preview 06-17

Gemini:

2.5 Flash Lite Preview 06-17

Gemini 2.5 Flash Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

Provider

Release Date

2025-06-17

Size

SMALL

Parameters

Not disclosed

Pricing

In: $0.10/1M

Out: $0.40/1M

Get API accessProvider and language code samples

Provider

fromimport openai  OpenAI

client = OpenAI(
"https://openrouter.ai/api/v1"    base_url=,
"$OPENROUTER_API_KEY"    api_key=,
)

response = client.chat.completions.create(
"google/gemini-2.5-flash-lite-preview-06-17"    model=,
"role""user""content""Hello!"    messages=[{: , : }],
)
print(response.choices[0].message.content)

Set OPENROUTER_API_KEY with your OpenRouter API key from openrouter.ai/keys.

Also on Google Vertex AI

Deep analysisPsychometrics, taste index, writing DNA

SubjectiveBench

Taste Index

Across 11 scored outputs

150.15xFloor

0100headroom →

Taste is judged on an uncapped scale, originality first. The space past 100 is craft today's models rarely reach.

Craft22

Originality11

Plays it safe

share of outputs that are the default answer

73%

Writing DNA

Stylometric Fingerprint

Based on 3 text responses

Tick = global average

Vocabulary Diversity61%

Unique words vs. total words. Higher = richer vocabulary.

Sentence Length10.0 words

Average words per sentence.

Hedging0.12

"Might", "perhaps", "arguably" per 100 words.

Bold Formatting1.9

**Bold** markers per 1,000 characters.

List Usage4.2

Bullet and numbered list items per 1,000 characters.

Section Structure0.05

Markdown headings per 1,000 characters.

Emoji Usage0.00

Emoji per 1,000 characters.

Transitions0.00

"However", "moreover", "furthermore" per 100 words.

Opening Habits

Consistency

81%

Across 3 responses

Sponsored

Model Responses

11 outputs from Gemini 2.5 Flash Lite Preview 06-17

Related Models

Google: Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google's GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic workflows, simple data extraction, and applications where responsiveness and API cost are the primary constraints. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.

Google: Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Delivers 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Priced at half the cost of Gemini 3 Flash.

Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview is Google's frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window. The 3.1 update introduces measurable gains in SWE benchmarks and real-world coding environments, along with stronger autonomous task execution in structured domains such as finance and spreadsheet-based workflows. Designed for advanced development and agentic systems, it introduces a new medium thinking level to better balance cost, speed, and performance.

Gemini 2.5 Pro Preview 06-05

Gemini 2.5 Pro is Google's state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs "thinking" capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities. Pricing: $1.25/M input tokens, $10/M output tokens, $5.16/K input images.

Gemini 2.5 Flash Preview 05-20

Gemini 2.5 Flash May 20th Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter.

Gemini 2.5 Flash Preview 05-20 (thinking)

Gemini 2.5 Flash May 20th Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter.

Keep exploring

Gemini 2.5 Flash Lite Preview 06-17 vs MiniMax M3

Real outputs compared side by side