Skip to main content
Google Gemini 3 Flash is a natively multimodal model designed for speed and efficiency across a wide range of tasks. Delivers strong performance across text, image, and code generation while maintaining cost-effectiveness.

Model details

ItemValueDescription
Model nameGoogle Gemini 3 FlashThe name of the model.
Model categoryPremiumThe category of the model: Standard or Premium.
API model namegoogle__gemini_3_flashThe name of the model that is used in the Box AI API for model overrides. The user must provide this exact name for the API to work.
Hosting layerGoogleThe trusted organization that securely hosts LLM.
Model providerGoogleThe organization that provides this model.
Release dateDecember 17th, 2025The release date for the model.
Knowledge cutoff dateJanuary 2025The date after which the model does not get any information updates.
Input context window1m tokensThe number of tokens supported by the input context window.
Maximum output tokens65k tokensThe number of tokens that can be generated by the model in a single request.
Empirical throughputNot specifiedThe number of tokens the model can generate per second.
Open sourceNoSpecifies if the model’s code is available for public use.

Additional documentation

For additional information, see official Google Gemini 3 Pro documentation.