Prime Highlights
- Google has released Gemini 3 Flash, a fast and cost-efficient AI model, now the default in the Gemini app and Google’s AI-powered search.
- The model delivers major performance upgrades, matching or exceeding leading AI models like Gemini 3 Pro and OpenAI’s GPT-5.2 on key benchmarks.
Key Facts
- Gemini 3 Flash scored 33.7% on Humanity’s Last Exam and 81.2% on the MMMU-Pro multimodal reasoning benchmark, outperforming its predecessor, Gemini 2.5 Flash.
- The model is globally available for consumers, businesses, and developers via Vertex AI, Gemini Enterprise, and a preview API, with pricing starting at $0.50 per million input tokens.
Background
Google has released Gemini 3 Flash, a new fast and cost-efficient artificial intelligence model, as it looks to strengthen its position against rivals such as OpenAI. The model is based on Gemini 3, which was introduced last month, and is now the default option in the Gemini app and Google’s AI-powered search experience.
The Gemini 3 Flash model arrives six months after the launch of Gemini 2.5 Flash and brings major performance upgrades. On key benchmarks, the new model performs far better than its predecessor and, in some areas, matches leading models such as Gemini 3 Pro and OpenAI’s GPT-5.2.
On Humanity’s Last Exam, a test designed to measure expertise across fields, Gemini 3 Flash scored 33.7% without tool use. This is a sharp rise from the 11% score of Gemini 2.5 Flash and comes close to Gemini 3 Pro’s 37.5% and GPT-5.2’s 34.5%. On the MMMU-Pro benchmark, which measures multimodal reasoning, Gemini 3 Flash led the field with an 81.2% score.
Google is rolling out the model globally for consumers. Users can upload videos, images, sketches, or audio and ask for analysis, tips, or explanations. The model also supports visual answers, app prototyping through prompts, and a better understanding of user intent.
For businesses and developers, Gemini 3 Flash is available through Vertex AI, Gemini Enterprise, and a preview API. Companies such as JetBrains, Figma and Harvey have already adopted it. Pricing starts at $0.50 per million input tokens and $3 per million output tokens.
Google said it now processes over one trillion tokens every day on its API, showing that more people are using its AI tools as competition in the industry grows.