GPT-5.5 Instant Is Now ChatGPT’s Default Model What Changes for You
Quick Answer
OpenAI replaced GPT-5.3 Instant with GPT-5.5 Instant as the default model for all ChatGPT users in early May 2026. The new model delivers 52.5% fewer hallucinations on high-stakes topics, smarter automatic routing between fast and deep reasoning modes, and arrives alongside three dedicated real-time voice models that bring GPT-5-class reasoning to live voice, including translation across 70 languages.

Key takeaways
- GPT-5.5 Instant is now the default model for all logged-in ChatGPT users, including free-tier accounts (10 messages per 5 hours).
- Hallucinations dropped 52.5% on high-stakes prompts covering medicine, law, and finance compared to GPT-5.3 Instant, per OpenAI’s internal evaluations.
- AIME 2025 math score jumped from 65.4 to 81.2, and multimodal reasoning improved from 69.2 to 76 on MMMU-Pro.
- Smart routing lets Instant automatically escalate complex queries to GPT-5.5 Thinking without burning a separate paid quota.
- Three new real-time voice models launched alongside: GPT-Realtime-2 (GPT-5-class reasoning in live voice), GPT-Realtime-Translate (70 languages, 13 output languages), and GPT-Realtime-Whisper (live transcription).
- Developers can access GPT-5.5 Instant via the API under the alias
chat-latest; GPT-5.3 stays available to paid users for three more months.
OpenAI has quietly made its biggest usability upgrade in months: GPT-5.5 Instant is now the default engine behind every ChatGPT conversation, for everyone from free users to Pro subscribers.
The update, which rolled out in early May 2026, replaces the previous GPT-5.3 Instant and pairs the core model update with a separate but related launch of three dedicated real-time voice models. Taken together, the changes affect how ChatGPT handles everyday queries, complex reasoning tasks, and live voice interactions across more than 70 languages.
Here is a precise breakdown of what changed, what the benchmarks actually show, and what this means for US business professionals and developers building on the API.
What actually changed with GPT-5.5 Instant
GPT-5.5 Instant targets two complaints that have followed ChatGPT since the GPT-5 generation launched in August 2025: verbosity and hallucinations. OpenAI says it addressed both directly.
On verbosity, the model uses 30.2% fewer words and 29.2% fewer lines per response than GPT-5.3 Instant, without dropping substance. On hallucinations, internal evaluations show a 52.5% reduction in hallucinated claims on high-stakes prompts in medicine, law, and finance, and a 37.3% drop in inaccurate claims on conversations users had previously flagged for factual errors.
- Cleaner formatting: Fewer unnecessary headers and bullet nesting, which makes responses more readable on mobile.
- Better algebra recovery: OpenAI demonstrated that GPT-5.5 Instant catches and corrects its own math errors mid-response, where GPT-5.3 Instant often stopped too early and returned a wrong final answer.
- Tone accuracy: Responses are described as more conversational and workplace-appropriate without over-explaining.
Smart routing and Memory Sources explained
Two new UX features came bundled with the model update and deserve attention separately.
Smart routing
When you select Instant in ChatGPT, the model can now automatically switch to GPT-5.5 Thinking if it detects a query that warrants deeper reasoning. This happens without consuming a separate Thinking quota. The routing decision is based on conversation type, complexity, available tools, and explicit intent signals in your prompt (phrases like “think hard about this” increase the probability of routing to Thinking).
If you manually select Thinking, ChatGPT always shows a Thinking trace. If Instant routes to Thinking automatically, the trace may not appear for brief reasoning sequences.
Memory Sources
A new “Sources” button in the interface shows users which historical conversations influenced a specific response. Users can delete or modify individual memory entries directly from this view, which addresses privacy concerns about persistent memory without requiring users to wipe their entire history.
GPT-5.5 Instant can also use its search tool to reference past conversations, uploaded files, and connected Gmail accounts for more personalized answers. This feature launched for Plus and Pro users on the web, with mobile rollout scheduled for later in 2026.
The three new real-time voice models
OpenAI shipped three dedicated voice models alongside the GPT-5.5 Instant launch. These are separate from the main ChatGPT interface and are available via the API. The design decision is notable: rather than bundling everything into one voice assistant, OpenAI split conversational reasoning, translation, and transcription into discrete components.
| Model | Primary function | Key capability |
|---|---|---|
| GPT-Realtime-2 | Live conversational reasoning | First voice model with GPT-5-class reasoning. 128K context window (4x previous). Handles complex requests mid-conversation. |
| GPT-Realtime-Translate | Real-time speech translation | Translates speech across 70 input languages with 13 output languages. Designed for customer support, travel, and media workflows. |
| GPT-Realtime-Whisper | Live transcription | Streaming speech-to-text. Designed as a standalone transcription primitive for developers who do not need a full conversational agent. |
All three models are available via the OpenAI API. As of May 2026.
The split-model architecture matters for enterprise deployments. Voice agents built on a single monolithic assistant have historically required developers to build session resets and state compression layers to handle context limits. With GPT-Realtime-2’s 128K context window and GPT-Realtime-Translate as a standalone component, those workarounds become less necessary for multilingual customer-support and travel workflows.
Benchmark results and what they mean in practice
OpenAI published several benchmark comparisons between GPT-5.5 Instant and its predecessor. The numbers are worth reviewing because they point to where the real improvements land in everyday use.
| Benchmark | GPT-5.3 Instant | GPT-5.5 Instant | Change |
|---|---|---|---|
| AIME 2025 (math) | 65.4 | 81.2 | +24% |
| MMMU-Pro (multimodal reasoning) | 69.2 | 76.0 | +9.8% |
| Hallucinations on high-stakes prompts | Baseline | 52.5% fewer | Significant |
| Inaccurate claims (flagged conversations) | Baseline | 37.3% fewer | Significant |
| Response length (words) | Baseline | 30.2% fewer | Leaner |
Data from OpenAI internal evaluations, May 2026. Hallucination figures are from internal benchmarks and have not been independently replicated.
One caveat worth noting: the hallucination figures come from OpenAI’s own internal evaluations. Independent third-party testing on GPT-5.5 Instant had not been published at the time of this article. The math and multimodal benchmark scores are more easily reproducible and align with third-party observations of improved reasoning consistency.
For a broader view of how GPT-5.5 stacks up against Claude Opus 4.7 and Gemini 3.1 Pro, see our complete LLM comparison guide for 2026.
Access tiers and pricing: who gets what
GPT-5.5 Instant is available across all ChatGPT tiers. The split in capability by plan is more nuanced than previous generations.
| Plan | Price | GPT-5.5 Instant access | Thinking access |
|---|---|---|---|
| Free | $0 | 10 messages / 5 hours | Via smart routing only |
| Plus | $20/month | 160 messages / 3 hours | Yes, manual + smart routing |
| Pro | $200/month | Unlimited, 128K context | Unlimited (GPT-5.5 Pro) |
Prices in USD. As of May 2026. GPT-5.5 Instant is also the first Instant-tier model rated “High Capability” in cybersecurity and biology under OpenAI’s internal safety classifications.
For developers, GPT-5.5 Instant is available in the API as chat-latest. GPT-5.3 Instant remains available to paid API users for three months before deprecation.
For a detailed breakdown of what ChatGPT Pro gets you versus the Plus plan, see our guide on Claude Pro vs ChatGPT Plus 2026.
Frequently asked questions
What is GPT-5.5 Instant?
GPT-5.5 Instant is OpenAI’s latest default model for ChatGPT, released in May 2026. It replaced GPT-5.3 Instant and delivers faster responses with 52.5% fewer hallucinations on high-stakes topics like medicine, law, and finance. It also introduces smart routing that can automatically escalate complex queries to the deeper GPT-5.5 Thinking model.
Is GPT-5.5 Instant free to use?
Yes. Free ChatGPT users get access to GPT-5.5 Instant with a limit of 10 messages every 5 hours. ChatGPT Plus subscribers ($20 per month) receive 160 messages every 3 hours. Pro subscribers get unlimited access plus a 128K context window.
What is smart routing in GPT-5.5 Instant?
Smart routing is a feature that lets GPT-5.5 Instant automatically switch to the GPT-5.5 Thinking model when it detects a query is too complex for a fast response. This happens without consuming a separate paid quota, giving users deeper reasoning on difficult questions without manually changing models.
How does GPT-5.5 Instant handle voice and real-time translation?
OpenAI released three dedicated voice models alongside GPT-5.5 Instant: GPT-Realtime-2 (real-time conversational reasoning at GPT-5 class), GPT-Realtime-Translate (speech translation across 70 languages with 13 output languages), and GPT-Realtime-Whisper (live transcription). GPT-Realtime-2 quadruples the context window to 128K tokens, significantly reducing the orchestration overhead common in enterprise voice deployments.
How does GPT-5.5 Instant compare to GPT-5.3 Instant on benchmarks?
GPT-5.5 Instant scored 81.2 on the AIME 2025 math test versus 65.4 for GPT-5.3 Instant. On the MMMU-Pro multimodal reasoning benchmark it scored 76 versus 69.2. It also produces 30% fewer words per response with no loss of substance, according to OpenAI’s internal evaluations.
Final thoughts
GPT-5.5 Instant is a meaningful upgrade, not a cosmetic rebrand. The hallucination reductions on medical, legal, and financial prompts are the most significant change for professional users. The smart routing system reduces friction for anyone who regularly toggles between fast and deep reasoning modes. And the three new real-time voice models represent a shift in how OpenAI is architecting voice for enterprise: modular components rather than a single product.
The pieces that still need independent verification are the hallucination figures. OpenAI’s internal evals are the only data available today. Watch for third-party benchmarks from labs like Scale AI and Eleuther AI over the next few weeks before treating the 52.5% number as settled.
To see how GPT-5.5 stacks up against the full field of 2026 AI models, start with our complete LLM comparison guide.
Get the daily AI brief
Delivered at 7:30 AM EST, Monday to Friday. The signal without the noise. Free. No fluff. Unsubscribe anytime.
Sources
- OpenAI. “GPT-5.5 Instant: smarter, clearer, and more personalized.” OpenAI Blog, May 2026.
- OpenAI. “Introducing GPT-5.5.” OpenAI Blog, April 23, 2026.
- TechCrunch. “OpenAI releases GPT-5.5 Instant, a new default model for ChatGPT.” May 5, 2026.
- VentureBeat. “OpenAI brings GPT-5-class reasoning to real-time voice.” May 2026.
- OpenAI Help Center. “GPT-5.5 in ChatGPT.” May 2026.
Fact-checked:



