Name: MAI-Thinking-1: Microsoft's 256K Reasoning Model (2026)
Brand: Microsoft

Question 1

What is MAI-Thinking-1 and who built it?

Accepted Answer

MAI-Thinking-1 is Microsoft AI's first in-house flagship reasoning model, announced at Microsoft Build on June 2, 2026. It is a sparse Mixture of Experts model with 35 billion active parameters out of roughly 1 trillion total, built on Microsoft's internal MAI-Base-1 foundation model through what the company calls its Hill-Climbing Machine training pipeline. Unlike earlier Microsoft Copilot deployments built on OpenAI's models, MAI-Thinking-1 was trained entirely from scratch on commercially licensed data with zero distillation from any third-party model. It scores 97.0% on AIME 2025, 84.2% on GPQA Diamond, and 73.5% on SWE-bench Verified. Microsoft designed it to reduce Copilot and Foundry's dependency on OpenAI while competing with Claude Opus 4.6 on agentic coding benchmarks. It launched in private preview on Microsoft Foundry with a 256K context window.

Question 2

How much does MAI-Thinking-1 cost per 1M tokens?

Accepted Answer

Microsoft has not publicly disclosed per-token pricing for MAI-Thinking-1 as of its June 2, 2026 private preview launch on Microsoft Foundry. No input, output, or cached-input rates, batch discount, or provisioned throughput pricing have been published. For comparison, competing reasoning models in this benchmark tier such as OpenAI's o3 price around $10 per 1M input and $40 per 1M output tokens, and Google's Gemini 2.5 Pro prices around $1.25 per 1M input and $10 per 1M output tokens with a 200K thinking budget, but these are not MAI-Thinking-1's actual prices. Microsoft says pricing will be announced via the Azure AI Foundry Models pricing page ahead of the model's public preview. There is no free tier and no option to self-host, since the model is proprietary and API-only. Anyone budgeting for production use should treat any third-party cost estimate for MAI-Thinking-1 today as speculative until Microsoft publishes official rates.

Question 3

What is MAI-Thinking-1's context window and max output?

Accepted Answer

MAI-Thinking-1 ships with a 256,000-token context window, which Microsoft describes as enough to hold roughly 600 pages of text in a single request. Microsoft has not published a specific max output token limit for the model, which is unusual for a headline Build 2026 release; most 2026 frontier models disclose this figure alongside context window size. There is no publicly verified needle-in-haystack or long-context recall evaluation for MAI-Thinking-1 at launch. At 256K tokens, its context window sits below the 1M-token windows offered by Claude Opus 4.8, GPT-5.5, and Gemini 3.1 Pro, but above many mid-weight open-weights competitors. Document handling for PDFs or multi-file inputs has not been detailed in Microsoft's public materials. Developers should test actual generation length limits empirically during the Foundry private preview period.

Question 4

How does MAI-Thinking-1 compare on benchmarks vs Claude Opus 4.6?

Accepted Answer

Microsoft states that MAI-Thinking-1 is toe-to-toe with Claude Opus 4.6 on SWE-bench Pro, where MAI-Thinking-1 scores 52.8% despite having far fewer active parameters (35B vs Opus 4.6's larger footprint). MAI-Thinking-1 scores 73.5% on SWE-bench Verified, 97.0% on AIME 2025, 94.5% on AIME 2026, and 84.2% on GPQA Diamond. In blind human side-by-side evaluations across 1,276 tasks, raters preferred MAI-Thinking-1's responses over Anthropic's Sonnet 4.6, though this comparison is against Sonnet rather than Opus. These benchmark figures are Microsoft-reported at launch and have not yet been independently cross-verified by third-party leaderboards like LMArena or Artificial Analysis. A roughly 3-5 point gap on SWE-bench Pro or Verified translates to meaningfully fewer one-shot correct pull requests on real agentic coding tasks. Microsoft has not published ARC-AGI 2 or LMArena Elo scores for MAI-Thinking-1, an absence that makes independent cross-model ranking harder until those numbers appear.

Question 5

Is MAI-Thinking-1 open source or proprietary?

Accepted Answer

MAI-Thinking-1 is fully proprietary and closed-weight. There is no Hugging Face release, no downloadable weights, and no open or research-only license variant. Access is exclusively through Microsoft Foundry, currently in private preview as of June 2026, with a public preview planned via the MAI Playground. Microsoft has also committed to distributing the model through third-party inference providers including Fireworks AI, Baseten, and OpenRouter, plus AI gateway integrations like LiteLLM, Portkey, Azure AI Foundry Model Router, Helicone, and Kong AI Gateway, but none of these routes involve open weights; they are all hosted API access to the same closed model. There is no self-hosting path, no quantized GGUF release, and no VRAM requirement to publish since the model cannot be run locally. Commercial use is gated behind Microsoft Foundry's preview terms rather than a public license file.

Question 6

What modalities does MAI-Thinking-1 support?

Accepted Answer

MAI-Thinking-1 is a text-in, text-out reasoning model. Confirmed input modalities are text and tool-call payloads; confirmed output modalities are text and tool-calls. Microsoft has not announced vision, audio, or video input or output support for MAI-Thinking-1, unlike some competing frontier models that ship multimodal by default. The model supports native function calling, structured output, and a distinct developer-instruction role compatible with the Chat Completions API format. Parallel tool calls and computer-use style agent loops have not been explicitly confirmed in Microsoft's public materials. For multimodal workloads, Microsoft's separate Image 2.5 and Voice 2 models, released in the same MAI family wave, handle image and speech generation instead of MAI-Thinking-1 itself. Teams needing a single model that handles both reasoning and vision should look at Claude Opus 4.8, GPT-5.5, or Gemini 3.1 Pro instead.

Question 7

Does MAI-Thinking-1 train on user data?

Accepted Answer

Microsoft has not publicly detailed MAI-Thinking-1's default API data retention or training-on-inputs policy as of its June 2026 private preview launch. The model is served through Microsoft Foundry, which Microsoft markets as offering enterprise governance and Azure data residency controls, but specific SOC 2 Type II, ISO 27001, HIPAA-eligibility, and GDPR compliance statements have not been published for this specific model at launch, unlike some longer-established Azure OpenAI Service offerings. There is no confirmed zero-retention enterprise tier disclosed yet for MAI-Thinking-1 specifically. Given Microsoft's broader Azure compliance posture, enterprise customers evaluating this model for regulated workloads should request explicit data handling and compliance documentation directly from their Microsoft account team rather than relying on general Azure AI Foundry claims, since MAI-Thinking-1's own compliance certifications had not been separately published as of this writing.

Question 8

Who is MAI-Thinking-1 best for and who should avoid it?

Accepted Answer

MAI-Thinking-1 is best for enterprise teams already standardized on Microsoft Azure and Foundry who want a first-party reasoning model for agentic coding, math-heavy analysis, and tool-calling workflows without depending on OpenAI's models inside Copilot. It is also a strong fit for teams whose reasoning workloads lean toward math and graduate-science problems, given its 97.0% AIME 2025 and 84.2% GPQA Diamond scores. Teams should avoid MAI-Thinking-1 if they need vision or audio input today, since it is text-and-tool-calls only. Cost-sensitive teams needing firm budget numbers should also wait, since pricing remains undisclosed during private preview as of June 2026. Teams running long, messy multi-turn terminal agent automation should consider alternatives too, since MAI-Thinking-1's Terminal-Bench 2.0 score (46.0%) and MultiChallenge score (53.0%) trail top frontier agentic models like Claude Opus 4.6 and GPT-5.5 on those specific axes.

MAI-Thinking-1: Microsoft's 256K Reasoning Model (2026)

About MAI-Thinking-1

Pricing

Key Features

Pros

Cons

Benchmarks

Frequently Asked Questions