GPT-5.5 Bio Bug Bounty
OpenAI has launched a red-teaming bug bounty program specifically targeting biosafety risks in GPT-5.5, offering rewards up to $25,000. The program focuses on finding universal jailbreaks that could bypass biological safety guardrails. This represents a structured external adversarial evaluation of a frontier model's safety properties in a high-stakes domain.
Related guides (3)
Related events (8)
Anthropic expands model safety bug bounty to target universal jailbreaks in CBRN and cybersecurity domains
Anthropic is expanding its HackerOne-partnered bug bounty program to offer up to $15,000 for novel universal jailbreak attacks against a next-generation safety mitigation system not yet publicly deployed. The program specifically targets high-risk domains including CBRN (chemical, biological, radiological, nuclear) and cybersecurity, with participants given early access to test the new safeguards before release. The initiative begins as invite-only and aligns with Anthropic's commitments under the White House Voluntary AI Commitments and G7 Hiroshima Process Code of Conduct.
Introducing the OpenAI Safety Bug Bounty Program
OpenAI has launched a Safety Bug Bounty program targeting AI-specific abuse and safety risks. The program focuses on agentic vulnerabilities, prompt injection, and data exfiltration scenarios. This extends traditional security bug bounty models into AI safety territory, incentivizing external researchers to surface novel attack vectors.
OpenAI Launches Bug Bounty Program
OpenAI announced a formal bug bounty program to crowdsource security vulnerability discovery across its products and services. The initiative is framed as part of OpenAI's broader commitment to building secure and trustworthy AI systems. Researchers who find and responsibly disclose vulnerabilities will be eligible for rewards.
Anthropic launches bug bounty program to stress-test ASL-3 Constitutional Classifiers
Anthropic launched an invite-only bug bounty program in partnership with HackerOne to find universal jailbreaks in its Constitutional Classifiers system before public deployment, offering up to $25,000 per verified vulnerability. The program targets CBRN-related safety bypasses on Claude 3.7 Sonnet and is part of Anthropic's work to meet its AI Safety Level-3 (ASL-3) Deployment Standard under its Responsible Scaling Policy. A follow-up update extended the program to test Constitutional Classifiers on the new Claude Opus 4 model and began accepting reports of universal jailbreaks found on public platforms. The initiative reflects Anthropic's structured approach to pre-deployment safety validation for increasingly capable models.
GPT-5.1-Codex-Max System Card
OpenAI has published the system card for GPT-5.1-Codex-Max, a coding-focused model variant. The card details model-level safety mitigations including specialized safety training against harmful tasks and prompt injection attacks, as well as product-level controls such as agent sandboxing and configurable network access. This represents OpenAI's formal safety documentation for an agentic coding model deployment.
Medical Research with GPT-5
OpenAI published a blog post describing how GPT-5 is being used for medical research applications. The post appears to be an announcement or case study highlighting GPT-5's capabilities in a healthcare/research context. Specific details about methods, benchmarks, or outcomes are not provided in the available text.
OpenAI Launches GPT-5.5 and GPT-5.5-Cyber with Expanded Trusted Access for Cyber Program
OpenAI is expanding its Trusted Access for Cyber program with two new models: GPT-5.5 and GPT-5.5-Cyber, a specialized variant aimed at cybersecurity applications. The program provides verified defenders with access to these models to accelerate vulnerability research and protect critical infrastructure. This represents a continuation of OpenAI's strategy of releasing domain-specialized model variants with controlled access tiers for sensitive use cases.
OpenAI Expands Trusted Access for Cyber Defense Program with GPT-5.4-Cyber
OpenAI is expanding its Trusted Access for Cyber program, introducing a specialized model called GPT-5.4-Cyber to vetted cybersecurity defenders. The program aims to provide advanced AI capabilities to legitimate security professionals while strengthening safeguards against misuse. This represents a structured approach to deploying frontier AI in sensitive security contexts with access controls.


