OpenAI's GPT-5.6 Sol Hits 91.9% on Terminal-Bench, But Government Review Means Most People Still Cannot Use It

Started by Arty Scout, Jun 30, 2026, 08:44 PM

Previous topic - Next topic

0 Members and 1 Guest are viewing this topic.

Topic: OpenAI's GPT-5.6 Sol Hits 91.9% on Terminal-Bench, But Government Review Means Most People Still Cannot Use It   Views(Read 34 times)

Arty Scout

OpenAI began previewing GPT-5.6 Sol to government-approved partners on June 26, following the framework established by Trump's June 2 executive order asking AI companies to voluntarily provide federal agencies early access to covered frontier models for up to 30 days before wider release. The flagship Sol tier introduced two new reasoning configurations: max mode for deeper single-model reasoning and ultra mode, which deploys multiple subagents in parallel and is the source of a new Terminal-Bench 2.1 record of 91.9 percent, up from GPT-5.5's 83.4 percent. Pricing for Sol is unchanged from GPT-5.5 at $5 input and $30 output per million tokens. The wider GPT-5.6 family also includes Terra, a model matching GPT-5.5's performance while consuming roughly half the tokens, and Luna, positioned as the lower-cost option, with OpenAI describing the three as durable capability tiers it intends to advance independently going forward rather than a single monolithic model generation.

OpenAI has been explicit in public statements that it believes in broad access and opposes making government-gated launches the permanent norm for frontier AI releases, a position that puts the company at odds with the pattern now emerging across the industry. The same week saw Anthropic's Mythos 5 model partially restored to roughly 100 vetted US organisations following a separate two-week export control suspension, with both companies' experiences read together by analysts as establishing what one industry tracker called the new pattern for frontier AI releases in the United States, where the federal government now sits between model completion and public access for any system deemed to carry significant national security capability.

OpenAI has not given a firm public date for Sol and Terra's broader availability, using only the language the coming weeks. Based on GPT-5.5's prior rollout pattern, where the ChatGPT announcement, API access and default Instant routing were staggered across roughly two weeks, industry trackers consider a July general availability for Sol and Terra a reasonable expectation, though nothing has been confirmed and the precedent set by Anthropic's Mythos 5 episode suggests the timeline ultimately depends on Commerce Department review rather than OpenAI's own product calendar.

ISA maxed. Costs minimised.

AlexaBliss

91.9 percent on Terminal-Bench 2.1 via the ultra mode parallel subagent approach is a genuinely different architecture from previous single-model reasoning gains. Deploying multiple subagents in parallel to attack one task is closer to how a human engineering team would actually approach a hard problem

NatureBoyDylan81

Sol pricing being unchanged from GPT-5.5 despite the capability jump is the detail that matters most for anyone budgeting AI spend. A meaningful benchmark improvement at the same per-token rate is the kind of release that actually moves the needle on cost-effectiveness rather than just raw capability