Gemini 3 Flash

Gemini 3 Flash combines Gemini 3 Pro's reasoning capabilities with the Flash line's levels on latency, efficiency, and cost. It not only enables everyday tasks with improved reasoning, but is designed to tackle the most complex agentic workflows.

Gemini 3 Flash uses several new features to improve performance, control, and multimodal fidelity:

For more information on using these features, see Get started with Gemini 3.

Try in Agent Platform View in Model Garden (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Agent Platform API enabled.
Model ID gemini-3-flash-preview Supported inputs & outputs Token limits
  • Maximum input tokens: 1,048,576
  • Maximum output tokens: 65,536
  • Capabilities Consumption options See Consumption options for more information. Technical specifications Images Documents Video Audio Parameter defaults Supported regions

    Model availability

    See Deployments and endpoints for more information. Knowledge cutoff date January 2025 Versions Supported languages See Supported languages. Pricing See Pricing.