Supported AI models - Box Dev Docs

Box supports a variety of AI models, categorized along two dimensions: access level and capability tier.

Access levels

Level	Description	Configuration Required
Core models	Built into Box AI and available by default for all customers	None
Customer-enabled models	Require activation by Box admins in the Admin Console or a request to Box. Some models may be subject to additional terms or pricing.	Yes

Capability tiers

Tier	Description	Best For
Standard models	High-speed, cost-efficient performance	Basic summarization, Q&A, structured data extraction from shorter or simpler documents. High-volume, low-complexity use cases
Premium models	Advanced reasoning, larger context windows, better performance on complex content	Multi-step reasoning, large taxonomies, lengthy or unstructured documents, domain-specific content

A model can be both customer-enabled and premium, or core and standard. In other words, access level and capability tiers are independent categorizations (for example, models can be either capability tier regardless of access level). The two categorizations are complementary.

Using models

How to use the supported AI models:

Get the .
Override the AI agent configuration used in , , , endpoints.

When using the model parameter in your API calls, use the API Name visible on each model card listed below. For example, to get the AI agent configuration for a specific model, use the parameter and provide the openai__gpt_5_mini API name. Make sure you use two underscores after the provider name.

The list may change depending on the model availability.

Core models

Box AI is powered by the following AI models. These models are integrated with Box AI to facilitate various use cases while adhering to enterprise grade standards. Below, you’ll find information about each model, including its capabilities, intended applications, and applicable usage guidelines.

Models offered in Preview mode have not been fully performance-tested at scale. You may experience variability in output quality, availability, and accuracy.

OpenAI

openai__gpt_5_6_sol

Most capable GPT-5.6 model, built for complex, high-stakes professional workPremiumCompatible with Box Agent

openai__gpt_5_6_terra

Multimodal model for broad general-purpose work and most coding tasksPremiumCompatible with Box Agent

openai__gpt_5_5

Advanced model for the most complex professional workPremiumCompatible with Box Agent

openai__gpt_5_4

Multimodal model for both broad general-purpose work and most coding tasksPremiumCompatible with Box Agent

openai__gpt_5_4_mini

Faster, more cost-efficient version of GPT-5.4Standard

openai__gpt_5_2

Multimodal model for coding and agentic tasks across various industriesPremiumCompatible with Box Agent

openai__gpt_5_1

Multimodal model with enterprise-grade performance and adaptive reasoningPremium

openai__gpt_5

Multimodal model with advanced reasoning and long-context understandingPremium

openai__gpt_5_mini

Model designed for well-defined tasks and precise prompts, suitable for lightweight tasksStandard

azureopenaitext_embedding_ada_002

2nd-generation embedding model for text search, code search, and sentence similarityEmbeddingsStandardISMAPFedRAMP ModerateFedRAMP HighDoD IL2

Google

google__gemini_3_5_flash

Fast, multimodal model with advanced reasoning for demanding enterprise tasksPremiumFedRAMP ModerateFedRAMP HighDoD IL4DoD IL5Compatible with Box Agent

google__gemini_3_1_flash_lite

Gemini multimodal model for high-speed and cost-efficient tasksStandardFedRAMP High

google__gemini_2_5_pro

Multimodal model with a 1 million token context window and advanced reasoning capabilitiesPremiumISMAPFedRAMP ModerateFedRAMP HighDoD IL5

google__gemini_2_5_flash

Multimodal model offering well-round capabilites, including thinking capabilitiesStandardISMAPFedRAMP ModerateFedRAMP HighDoD IL5

Anthropic

aws__claude_sonnet_5

Powerful, versatile model built for daily use, scaled production, and complex tasksPremiumFedRAMP ModerateCompatible with Box Agent

aws__claude_4_8_opus

Premium model from Anthropic, hosted on Amazon Web ServicesPremiumFedRAMP ModerateCompatible with Box Agent

aws__claude_4_7_opus

Multimodal model for coding, enterprise agents, and professional workPremiumISMAPFedRAMP ModerateCompatible with Box Agent

aws__claude_4_6_opus

Multimodal model for complex tasks with a 1 million token context windowPremiumISMAPFedRAMP ModerateCompatible with Box Agent

aws__claude_4_6_sonnet

Multimodal model for complex tasks with a 1 million token context windowPremiumISMAPFedRAMP ModerateCompatible with Box Agent

aws__claude_4_5_opus

Premium model combining maximum intelligence with practical performancePremiumISMAPFedRAMP ModerateCompatible with Box Agent

aws__claude_4_5_sonnet

Model that excels at complex agents, coding, and autonomous multi-step workflowsPremiumISMAPFedRAMP ModerateCompatible with Box Agent

aws__claude_4_5_haiku

Fast and cost-efficient model with strong reasoning, best for low-latency applicationsStandardISMAPFedRAMP ModerateCompatible with Box Agent

ibm__llama_4_maverick

Multimodal model with a 1 million token context window and advanced reasoning capabilitiesStandard

Mistral AI

ibm__mistral_medium_2505

High-performance enterprise model for coding and advanced reasoningPreviewStandard

ibm__mistral_small_3_1_24b_instruct_2503

Fast open-source multimodal model with low latencyPreviewStandard

Customer-enabled models

Certain Box AI customers may enable additional AI models upon their request or upon the models otherwise being made available through their Admin Console. Use of these models may be subject to additional terms. By selecting a customer-enabled model, the customer acknowledges that their data may be processed by additional subprocessors of their choice.

Models offered in Beta mode are provided on an as-is basis.

OpenAI

openai__gpt_o3

Multimodal model, highly efficient in handling complex, multi-step tasksBetaPremium

Google

google__gemini_3_1_pro

Model for complex tasks with advanced reasoning and a large context windowBetaPremium

google__gemini_3_flash

Multimodal model designed for speed and efficiency across a wide range of tasksBetaStandard

Anthropic

aws__claude_fable_5

Premium model from Anthropic, hosted on Amazon Web ServicesBetaPremiumCompatible with Box Agent

Default models

The following tables list the default AI models used when you do not override the agent configuration. The model that runs depends on the agent type, the provider, and the capability tier.

Default settings shown below may not be applicable for certain customers with specific configurations or requirements.

Legacy agents

Legacy agents represent the initial generation of Box AI capabilities, designed to handle basic, single-step tasks such as simple Q&A and document summarization. The table below specifies available models for a particular use case.

Use case	Provider	Model for Standard Version	Model for Advanced or Enhanced Version
Q&A for Documents, Notes, and Hubs	OpenAI (Default)
	Claude
	Llama

Box agents

Box agents are the next generation of intelligent assistants built with advanced reasoning and orchestration, capable of planning and executing complex, multi-step workflows across your entire Box content ecosystem. The table below specifies available models for each specialized agent type.

Agent	Model for Standard Version	Model for Pro or Enhanced Version
Box Agent
Extract Agent
Security Classification Agent		N/A
Threat Analysis Agent		N/A

Retired models

The following models are retired and no longer available in Box AI.

Model name	API name
Azure OpenAI GPT-4.1	`azure__openai__gpt_4_1`
Azure OpenAI GPT-4.1 mini	`azure__openai__gpt_4_1_mini`
Azure OpenAI GPT-4o	`azure__openai__gpt_4o`
Azure OpenAI GPT-4o mini	`azure__openai__gpt_4o_mini`
Claude 4 Opus	`aws__claude_4_opus`
Claude 4 Sonnet	`aws__claude_4_sonnet`
Titan Text Lite	`aws__titan_text_lite`
Gemini 2.0 Flash	`google__gemini_2_0_flash_001`
Gemini 2.0 Flash Lite	`google__gemini_2_0_flash_lite_preview`
Gemini 3 Pro	`google__gemini_3_pro`
Llama 4 Scout	`ibm__llama_4_scout`
Llama 3.2 90B Vision	`ibm__llama_3_2_90b_vision_instruct`
Grok 3 Beta	`xai__grok_3_beta`
Grok 3 mini Beta	`xai__grok_3_mini_beta`

​Access levels

​Capability tiers

​Using models

​Core models

openai__gpt_5_6_sol

openai__gpt_5_6_terra

openai__gpt_5_5

openai__gpt_5_4

openai__gpt_5_4_mini

openai__gpt_5_2

openai__gpt_5_1

openai__gpt_5

openai__gpt_5_mini

azure__openai__text_embedding_ada_002

google__gemini_3_5_flash

google__gemini_3_1_flash_lite

google__gemini_2_5_pro

google__gemini_2_5_flash

aws__claude_sonnet_5

aws__claude_4_8_opus

aws__claude_4_7_opus

aws__claude_4_6_opus

aws__claude_4_6_sonnet

aws__claude_4_5_opus

aws__claude_4_5_sonnet

aws__claude_4_5_haiku

ibm__llama_4_maverick

ibm__mistral_medium_2505

ibm__mistral_small_3_1_24b_instruct_2503

​Customer-enabled models

openai__gpt_o3

google__gemini_3_1_pro

google__gemini_3_flash

aws__claude_fable_5

​Default models

​Legacy agents

​Box agents

​Retired models

Access levels

Capability tiers

Using models

Core models

azureopenaitext_embedding_ada_002

Customer-enabled models

Default models

Legacy agents

Box agents

Retired models