Skip to main content
Google Gemini 3.1 Flash Lite is a multimodal model designed for high-speed, cost-efficient tasks like basic summarization, Q&A, and structured data extraction. It is a standard-tier model suited to high-volume, low-complexity use cases.

Model details

ItemValueDescription
Model nameGoogle Gemini 3.1 Flash LiteThe name of the model.
Model categoryStandardThe category of the model: Standard or Premium.
API model namegoogle__gemini_3_1_flash_liteThe name of the model that is used in the . You must provide this exact name for the API to work.
ComplianceN/AGovernment compliance frameworks and authorizations applicable to this model.
Hosting layerGoogleThe trusted organization that securely hosts the LLM.
Model providerGoogleThe organization that provides this model.
Release dateMarch 4th, 2026The release date for the model.
Knowledge cutoff dateJanuary 2025The date after which the model does not receive information updates.
Input context window1m tokensThe number of tokens supported by the input context window.
Maximum output tokens65k tokensThe number of tokens that can be generated by the model in a single request.
Empirical throughputNot specifiedThe number of tokens the model can generate per second.
Open sourceNoSpecifies whether the model’s code is available for public use.
ComplianceN/AThe compliance certifications applicable to this model.

Additional documentation

For more information, see .