7OpenAI Blog·1mo ago

GPT-5.5 Bio Bug Bounty

OpenAI has launched a red-teaming bug bounty program specifically targeting biosafety risks in GPT-5.5, offering rewards up to $25,000. The program focuses on finding universal jailbreaks that could bypass biological safety guardrails. This represents a structured external adversarial evaluation of a frontier model's safety properties in a high-stakes domain.

Frontier Model Releases Evaluation and Benchmarking AI Safety Research GPT-5.5 Bio Bug Bounty OpenAI GPT-5.5

Related guides (3)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

GPT-5.5

GPT-5.5: OpenAI's Most Capable Model — and Its Most Complicated

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Related events (8)

6Anthropic News·17d ago·source ↗

Anthropic expands model safety bug bounty to target universal jailbreaks in CBRN and cybersecurity domains

Anthropic is expanding its HackerOne-partnered bug bounty program to offer up to $15,000 for novel universal jailbreak attacks against a next-generation safety mitigation system not yet publicly deployed. The program specifically targets high-risk domains including CBRN (chemical, biological, radiological, nuclear) and cybersecurity, with participants given early access to test the new safeguards before release. The initiative begins as invite-only and aligns with Anthropic's commitments under the White House Voluntary AI Commitments and G7 Hiroshima Process Code of Conduct.

AI Safety Research Regulatory Developments HackerOne Hiroshima AI Process White House Voluntary AI Commitments +1 more

5Openai Blog·1mo ago·source ↗

Introducing the OpenAI Safety Bug Bounty Program

OpenAI has launched a Safety Bug Bounty program targeting AI-specific abuse and safety risks. The program focuses on agentic vulnerabilities, prompt injection, and data exfiltration scenarios. This extends traditional security bug bounty models into AI safety territory, incentivizing external researchers to surface novel attack vectors.

AI Safety Research Enterprise Deployment Patterns prompt injection OpenAI Safety Bug Bounty agentic vulnerabilities +3 more

3Openai Blog·1mo ago·source ↗

OpenAI Launches Bug Bounty Program

OpenAI announced a formal bug bounty program to crowdsource security vulnerability discovery across its products and services. The initiative is framed as part of OpenAI's broader commitment to building secure and trustworthy AI systems. Researchers who find and responsibly disclose vulnerabilities will be eligible for rewards.

AI Safety Research OpenAI

6Anthropic News·19d ago·source ↗

Anthropic launches bug bounty program to stress-test ASL-3 Constitutional Classifiers

Anthropic launched an invite-only bug bounty program in partnership with HackerOne to find universal jailbreaks in its Constitutional Classifiers system before public deployment, offering up to $25,000 per verified vulnerability. The program targets CBRN-related safety bypasses on Claude 3.7 Sonnet and is part of Anthropic's work to meet its AI Safety Level-3 (ASL-3) Deployment Standard under its Responsible Scaling Policy. A follow-up update extended the program to test Constitutional Classifiers on the new Claude Opus 4 model and began accepting reports of universal jailbreaks found on public platforms. The initiative reflects Anthropic's structured approach to pre-deployment safety validation for increasingly capable models.

Frontier Model Releases AI Safety Research Constitutional Classifiers Claude Opus 4.6 HackerOne +3 more

7Openai Blog·1mo ago·source ↗

GPT-5.1-Codex-Max System Card

OpenAI has published the system card for GPT-5.1-Codex-Max, a coding-focused model variant. The card details model-level safety mitigations including specialized safety training against harmful tasks and prompt injection attacks, as well as product-level controls such as agent sandboxing and configurable network access. This represents OpenAI's formal safety documentation for an agentic coding model deployment.

Frontier Model Releases AI Safety Research prompt injection GPT-5.1-Codex-Max OpenAI +2 more

5Openai Blog·1mo ago·source ↗

Medical Research with GPT-5

OpenAI published a blog post describing how GPT-5 is being used for medical research applications. The post appears to be an announcement or case study highlighting GPT-5's capabilities in a healthcare/research context. Specific details about methods, benchmarks, or outcomes are not provided in the available text.

Frontier Model Releases Enterprise Deployment Patterns OpenAI GPT-5.5

7Openai Blog·1mo ago·source ↗

OpenAI Launches GPT-5.5 and GPT-5.5-Cyber with Expanded Trusted Access for Cyber Program

OpenAI is expanding its Trusted Access for Cyber program with two new models: GPT-5.5 and GPT-5.5-Cyber, a specialized variant aimed at cybersecurity applications. The program provides verified defenders with access to these models to accelerate vulnerability research and protect critical infrastructure. This represents a continuation of OpenAI's strategy of releasing domain-specialized model variants with controlled access tiers for sensitive use cases.

Frontier Model Releases AI Safety Research GPT-5.5-Cyber Trusted Access for Cyber OpenAI +2 more

7Openai Blog·1mo ago·source ↗

OpenAI Expands Trusted Access for Cyber Defense Program with GPT-5.4-Cyber

OpenAI is expanding its Trusted Access for Cyber program, introducing a specialized model called GPT-5.4-Cyber to vetted cybersecurity defenders. The program aims to provide advanced AI capabilities to legitimate security professionals while strengthening safeguards against misuse. This represents a structured approach to deploying frontier AI in sensitive security contexts with access controls.

Frontier Model Releases AI Safety Research GPT-5.5-Cyber Trusted Access for Cyber OpenAI +2 more