Name: GPT-5.2: 400K Context, 92.4% GPQA Diamond (2026)
Brand: OpenAI
Price: 1.75 USD
Availability: InStock

Question 1

What is GPT-5.2 and who built it?

Accepted Answer

GPT-5.2 is a multimodal large language model built by OpenAI, released on December 11, 2025 as the fourth variant in the GPT-5 product family. It uses a Mixture-of-Experts (MoE) transformer architecture, where tokens are routed to a subset of expert networks per inference call, reducing compute cost while maintaining high total model capacity. OpenAI has not published the parameter count, but inference cost patterns and MoE routing behavior suggest between 2 and 5 trillion total parameters. The model launched with three service tiers: Instant (low latency, no chain-of-thought), Thinking (extended reasoning before responding), and Pro (maximum quality). A coding-specialized companion, GPT-5.2-Codex, launched the same day with additional agent sandboxing and configurable network access. The model improved over GPT-5.1 across agentic coding (SWE-bench Verified), graduate-level science reasoning (GPQA Diamond), and advanced math (AIME 2025, perfect score in Thinking tier). GPT-5.2 was deprecated from the API on May 8, 2026 and fully retired from ChatGPT on June 12, 2026, with GPT-5.5 as the designated successor.

Question 2

How much does GPT-5.2 cost per 1M tokens?

Accepted Answer

GPT-5.2 was priced at $1.75 per 1M input tokens and $14.00 per 1M output tokens through the OpenAI API. Cached input tokens, available after a minimum 1,024-token cache match, cost $0.18 per 1M, a 90% reduction from the standard input rate. Artificial Analysis computed a blended rate of $1.87 per 1M tokens using a 7:2:1 cache-hit weighting. Summarizing a 100,000-token research paper costs approximately $0.20 in total tokens at those rates. A daily coding agent consuming 1M input and 200K output tokens costs about $4.55 per day. The $14/M output price was a frequent developer complaint for generation-heavy workloads like bulk test scaffolding or long-form code generation. GPT-5.5, the successor, reduced output token pricing as part of its release rationale, making it the cost-preferred option for output-heavy tasks.

Question 3

What is GPT-5.2's context window and max output?

Accepted Answer

GPT-5.2 features a 400,000-token context window and a 128,000-token maximum output limit. The 400K context is triple the 128,000-token window in GPT-5 and GPT-5.1, making it the largest context window in the GPT-5 family at the time of release. OpenAI reported over 99% recall accuracy on internal needle-in-haystack tests at full 400K depth. Independent evaluations found modest degradation above 350,000 tokens, with recall for early-context instructions dropping from 99% to around 88% near the ceiling. For comparison, Claude Opus 4.5 offered 200K context and Gemini 3 Pro offered a 1M-token window; GPT-5.2 sat between those two on raw size but demonstrated stronger recall at the 100K-to-300K range in independent tests. The model handles PDFs, multi-image inputs, audio transcripts, and long video content within the same context limit. For full-codebase review, multi-document legal analysis, or long transcript synthesis without chunking, the 400K window covered the majority of real enterprise workloads.

Question 4

How does GPT-5.2 compare on benchmarks vs Claude and Gemini?

Accepted Answer

GPT-5.2 led on advanced math and science reasoning at launch. On AIME 2025, GPT-5.2 Thinking achieved a perfect 100%, outperforming Gemini 3 Pro which lagged by 5 percentage points. On FrontierMath, GPT-5.2 scored 40.3%, ahead of Claude Opus 4.5 at 37.6% and Gemini 3 Pro at 31.1%. On GPQA Diamond (graduate-level science), GPT-5.2 hit 92.4%, a 4.3-point improvement over GPT-5.1 and ahead of the Claude and Gemini releases at the time. For coding on SWE-bench Verified, GPT-5.2 reached 80%, which was competitive with the top frontier models at launch. On video understanding with Video-MMMU, GPT-5.2 scored 90.5% versus Gemini 3 Pro's 87.6%. On the LMArena human-preference leaderboard (crowd-sourced votes), GPT-5.2 reached Elo 1402 in May 2026, placing third behind Claude Opus 4.6 at 1418 and Gemini 3.1 Pro at 1406, with overlapping confidence intervals across all three meaning the gap is not statistically decisive. In practice, task type and cost efficiency should drive selection over raw leaderboard rank.

Question 5

Is GPT-5.2 open source or proprietary?

Accepted Answer

GPT-5.2 is fully proprietary: the model weights are closed, it is API-only, and there is no self-hosting path. During its active period, access was through the OpenAI direct API at platform.openai.com and through Microsoft Azure OpenAI Service. GPT-5.2 did not receive its own AWS Bedrock listing before deprecation; Bedrock support for the GPT family launched later with GPT-5.4 and GPT-5.5 in mid-2026. No availability on Google Vertex AI, Together AI, or Fireworks AI is documented for GPT-5.2 specifically. OpenAI's separate open-weight releases, gpt-oss-20b and gpt-oss-120b, are entirely distinct from the GPT-5.2 family and do not share its weights or architecture. All commercial use of GPT-5.2 was governed by OpenAI's standard API commercial terms. There is no VRAM requirement, quantization path, or container image for self-hosted GPT-5.2 deployment.

Question 6

What modalities does GPT-5.2 support?

Accepted Answer

GPT-5.2 accepts text, images, audio, video, and PDFs as inputs within a single API request, and generates text and tool-call outputs. Multimodal inputs can be mixed in one request rather than requiring separate API calls for each modality type. Video understanding scored 90.5% on Video-MMMU, and chart comprehension on CharXiv with Python reached 88.7%. Audio input is supported for transcription and reasoning over spoken content. Audio output is not available from the base API; text-to-speech requires a separate TTS model call. Function calling and structured JSON output are fully supported, including parallel tool calls within a single completion response. Computer use (screen reading and UI interaction) is available in the ChatGPT product interface but is not exposed as a direct API capability in GPT-5.2. Code execution is supported via tool-call integrations rather than a native sandboxed environment in the base API.

Question 7

Does GPT-5.2 train on user data?

Accepted Answer

OpenAI does not train on API inputs by default for GPT-5.2 or any other GPT-5 family model. API inputs and outputs are retained for up to 30 days for abuse monitoring, then deleted unless flagged for review. Enterprise customers can enable a zero-data retention option via a direct agreement with OpenAI, in which case no input or output data is stored beyond the API call. OpenAI's API terms prohibit using API traffic for model training unless the user explicitly opts in, which is not the default for business or enterprise accounts. GPT-5.2 API access is covered by OpenAI's SOC 2 Type II certification. HIPAA-eligible configurations are available through a Business Associate Agreement with OpenAI. GDPR compliance applies for EU users under OpenAI's Data Processing Agreement. When accessed through Azure OpenAI Service, data handling follows Microsoft's Azure retention and residency policies, which may differ from OpenAI's direct API terms and can include EU data residency options.

Question 8

Who is GPT-5.2 best for and who should avoid it?

Accepted Answer

During its active period, GPT-5.2 was best for teams running large-context document analysis using the 400K window, agentic coding pipelines where 80% SWE-bench and 98.7% Tau2-bench tool accuracy mattered, and advanced math or science reasoning tasks where 100% AIME and 40.3% FrontierMath performance led the field. Enterprise teams on Azure with existing OpenAI SDK integrations found low migration overhead from GPT-5.1. However, GPT-5.2 is deprecated as of June 2026, making it a poor choice for any new project: the API cutoff will interrupt service, and GPT-5.5 or GPT-5.4 should be the migration target. The $14/M output price made it expensive for bulk generation workloads; teams running content pipelines or large-scale test generation were better served by GPT-5.4 once it released. Voice-first applications should avoid GPT-5.2 since it produces no audio output natively. On-device or air-gapped deployments are not possible given the closed-weights, API-only architecture.

GPT-5.2: 400K Context, 92.4% GPQA Diamond (2026)

About GPT-5.2

Pricing

Key Features

Pros

Cons

Benchmarks

Frequently Asked Questions