Google Gemini 1.5 Flash

Google Gemini 1.5 Flash

Google Gemini 1.5 Flash is a multimodal model designed to handle lightweight tasks. It is designed for high-volume, low-latency tasks, making it highly efficient for large-scale use cases like summarization, multimodal processing, and categorization

Model details

ItemValueDescription
Model nameGoogle Gemini 1.5 FlashThe name of the model.
API model namegoogle__gemini_1_5_flash_001The name of the model that is used in the Box AI API for model overrides. The user must provide this exact name for the API to work.
Hosting layerGoogleThe trusted organization that securely hosts LLM.
Model providerGoogleThe organization that provides this model.
Release dateMay 14th 2024The release date for the model.
Knowledge cutoff dateNovember 2023The date after which the model does not get any information updates.
Input context window1m tokensThe number of tokens supported by the input context window.
Maximum output tokens8k tokensThe number of tokens that can be generated by the model in a single request.
Empirical throughput176The number of tokens the model can generate per second.
Open sourceNoSpecifies if the model's code is available for public use.

Additional documentation

For additional information, see official Google Gemini 1.5 Flash documentation.