Google 更新 Gemini 2.5 Flash 和 Flash-Lite 模型

Google released an updated version of Gemini 2.5 Flash and Gemini 2.5 Flash-Lite preview models across AI Studio and Vertex AI, plus rolling aliases—gemini-flash-latest and gemini-flash-lite-latest—that always point to the newest preview in each family. For production stability, Google advises pinning fixed strings (gemini-2.5-flash, gemini-2.5-flash-lite). Google will give a two-week email notice before retargeting a -latest alias, and notes that rate limits, features, and cost may vary across alias updates.

https://developers.googleblog.com/en/continuing-to-bring-you-our-latest-models-with-an-improved-gemini-2-5-flash-and-flash-lite-release/

What actually changed?

Flash

agentic tool use

+5 point

SWE-Bench Verified

48.9% → 54.0%

Flash-Lite

instruction following

reduced verbosity

multimodal/translation

~50% fewer output tokens

~24% fewer

https://developers.googleblog.com/en/continuing-to-bring-you-our-latest-models-with-an-improved-gemini-2-5-flash-and-flash-lite-release/

Independent Stats from the community thread

Artificial Analysis (the account behind the AI benchmarking site) received pre-release access and published external measurements across intelligence and speed. Highlights from the thread and companion pages:

Throughput

Gemini 2.5 Flash-Lite (Preview 09-2025, reasoning)

fastest proprietary model

~887 output tokens/s

Intelligence index deltas

Flash

Flash-Lite

Token efficiency

cost-per-success

Google shared pre-release access for the new Gemini 2.5 Flash & Flash-Lite Preview 09-2025 models. We’ve independently benchmarked gains in intelligence (particularly for Flash-Lite), output speed and token efficiency compared to predecessors

Key takeaways from our intelligence… pic.twitter.com/ybzKvZBH5A
— Artificial Analysis (@ArtificialAnlys) September 25, 2025

Cost surface and context budgets (for deployment choices)

Flash-Lite GA list price

$0.10 / 1M input tokens

$0.40 / 1M output tokens

Context

~1M-token

Browser-agent angle and the o3 claim

A circulating claim says the “new Gemini Flash has o3-level accuracy, but is 2× faster and 4× cheaper on browser-agent tasks.” This is community-reported, not in Google’s official post. It likely traces to private/limited task suites (DOM navigation, action planning) with specific tool budgets and timeouts. Use it as a hypothesis for your own evals; don’t treat it as a cross-bench truth.

This is insane! The new Gemini Flash model released yesterday has the same accuracy as o3, but it is 2x faster and 4x cheaper for browser agent tasks.

I ran evaluations the whole day and could not believe this. The previous gemini-2.5-flash had only 71% on this benchmark. https://t.co/KdgkuAK30W pic.twitter.com/F69BiZHiwD
— Magnus Müller (@mamagnus00) September 26, 2025

Practical guidance for teams

Pin vs. chase -latest

pin

-latest

High-QPS or token-metered endpoints

Flash-Lite preview

Agent/tool pipelines

Flash preview

Model strings (current)

Previews

gemini-2.5-flash-preview-09-2025

gemini-2.5-flash-lite-preview-09-2025

Stable

gemini-2.5-flash

gemini-2.5-flash-lite

Rolling aliases

gemini-flash-latest

gemini-flash-lite-latest

Summary

Google’s new release update tightens tool-use competence (Flash) and token/latency efficiency (Flash-Lite) and introduces -latest aliases for faster iteration. External benchmarks from Artificial Analysis indicate meaningful throughput and intelligence-index gains for the Sept 2025. previews, with Flash-Lite now testing as the fastest proprietary model in their harness. Validate on your workload—especially browser-agent stacks—before committing to the aliases in production.

The post The Latest Gemini 2.5 Flash-Lite Preview is Now the Fastest Proprietary Model (External Tests) and 50% Fewer Output Tokens appeared first on MarkTechPost.

What actually changed?

Independent Stats from the community thread

Cost surface and context budgets (for deployment choices)

Browser-agent angle and the o3 claim

Practical guidance for teams

Model strings (current)

Summary

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签