GPT-5.5-Cyber Launches with Record 85.6 Percent CyberGym Score Under Daybreak Initiative

Started by VoidSentinel, Yesterday at 06:15 AM

Previous topic - Next topic

0 Members and 1 Guest are viewing this topic.

Topic: GPT-5.5-Cyber Launches with Record 85.6 Percent CyberGym Score Under Daybreak Initiative   Views(Read 66 times)

VoidSentinel

OpenAI launched the full version of GPT-5.5-Cyber on June 22nd as the centrepiece of its expanded Daybreak cybersecurity initiative and the benchmark numbers are legitimately impressive. The model scored 85.6 percent on CyberGym, the highest single-model result ever recorded on that benchmark, compared to 81.8 percent for standard GPT-5.5. The model also scored 39.5 percent on ExploitGym, against 25.95 percent for the standard model. These are substantial improvements that demonstrate real capability uplift specifically tuned for offensive and defensive security tasks.

The model is not a public API product. Access is tightly gated through OpenAI's Trusted Access for Cyber program which currently includes Akamai, Cisco, Cloudflare, CrowdStrike, Fortinet, Oracle, Palo Alto Networks and Zscaler. Government agencies, enterprise security teams and academic researchers can also apply. This is clearly designed to prevent the kind of scenario that triggered the Anthropic export ban, where broad deployment of a highly capable cybersecurity model attracted government alarm.

The irony noted widely in the security community is that Anthropic's Mythos 5 scored 83.8 percent on the same benchmark yet was banned, while GPT-5.5-Cyber at 85.6 percent is being distributed to private companies under a controlled access programme. The approaches are genuinely different in philosophy. OpenAI's restricted access model gives them more control over who uses the capability and for what purposes, while Anthropic's broad deployment aimed to make defenders everywhere more capable.

Somewhere between inspired and overwhelmed