3OpenAI Blog·1mo ago

OpenAI Launches Bug Bounty Program

OpenAI announced a formal bug bounty program to crowdsource security vulnerability discovery across its products and services. The initiative is framed as part of OpenAI's broader commitment to building secure and trustworthy AI systems. Researchers who find and responsibly disclose vulnerabilities will be eligible for rewards.

AI Safety Research OpenAI

Related guides (2)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Related events (8)

5Openai Blog·1mo ago·source ↗

Introducing the OpenAI Safety Bug Bounty Program

OpenAI has launched a Safety Bug Bounty program targeting AI-specific abuse and safety risks. The program focuses on agentic vulnerabilities, prompt injection, and data exfiltration scenarios. This extends traditional security bug bounty models into AI safety territory, incentivizing external researchers to surface novel attack vectors.

AI Safety Research Enterprise Deployment Patterns prompt injection OpenAI Safety Bug Bounty agentic vulnerabilities +3 more

6Anthropic News·16d ago·source ↗

Anthropic expands model safety bug bounty to target universal jailbreaks in CBRN and cybersecurity domains

Anthropic is expanding its HackerOne-partnered bug bounty program to offer up to $15,000 for novel universal jailbreak attacks against a next-generation safety mitigation system not yet publicly deployed. The program specifically targets high-risk domains including CBRN (chemical, biological, radiological, nuclear) and cybersecurity, with participants given early access to test the new safeguards before release. The initiative begins as invite-only and aligns with Anthropic's commitments under the White House Voluntary AI Commitments and G7 Hiroshima Process Code of Conduct.

AI Safety Research Regulatory Developments HackerOne Hiroshima AI Process White House Voluntary AI Commitments +1 more

4Openai Blog·1mo ago·source ↗

OpenAI Publishes Outbound Coordinated Vulnerability Disclosure Policy

OpenAI has published a formal outbound coordinated vulnerability disclosure (CVD) policy, establishing how the company will handle and disclose security vulnerabilities it discovers in third-party systems or products. This represents a structured commitment to responsible disclosure practices when OpenAI's research or operations uncover vulnerabilities outside its own infrastructure. The policy signals OpenAI's growing role as a security actor with obligations to the broader ecosystem.

AI Safety Research Coordinated Vulnerability Disclosure OpenAI

7Openai Blog·1mo ago·source ↗

GPT-5.5 Bio Bug Bounty

OpenAI has launched a red-teaming bug bounty program specifically targeting biosafety risks in GPT-5.5, offering rewards up to $25,000. The program focuses on finding universal jailbreaks that could bypass biological safety guardrails. This represents a structured external adversarial evaluation of a frontier model's safety properties in a high-stakes domain.

Frontier Model Releases Evaluation and Benchmarking GPT-5.5 Bio Bug Bounty OpenAI GPT-5.5 +1 more

6Anthropic News·18d ago·source ↗

Anthropic launches bug bounty program to stress-test ASL-3 Constitutional Classifiers

Anthropic launched an invite-only bug bounty program in partnership with HackerOne to find universal jailbreaks in its Constitutional Classifiers system before public deployment, offering up to $25,000 per verified vulnerability. The program targets CBRN-related safety bypasses on Claude 3.7 Sonnet and is part of Anthropic's work to meet its AI Safety Level-3 (ASL-3) Deployment Standard under its Responsible Scaling Policy. A follow-up update extended the program to test Constitutional Classifiers on the new Claude Opus 4 model and began accepting reports of universal jailbreaks found on public platforms. The initiative reflects Anthropic's structured approach to pre-deployment safety validation for increasingly capable models.

Frontier Model Releases AI Safety Research Constitutional Classifiers Claude Opus 4.6 HackerOne +3 more

4Openai Blog·1mo ago·source ↗

OpenAI Cybersecurity Grant Program

OpenAI announced a grant program aimed at developing AI-powered cybersecurity capabilities for defenders. The initiative provides funding and support to researchers and organizations working on defensive cybersecurity applications of AI. This represents OpenAI's effort to direct AI capabilities toward security defense rather than offense.

AI Safety Research Enterprise Deployment Patterns OpenAI Cybersecurity Grant Program OpenAI

5Openai Blog·1mo ago·source ↗

Announcing the OpenAI Safety Fellowship

OpenAI has announced a Safety Fellowship, described as a pilot program aimed at supporting independent safety and alignment research while developing the next generation of AI safety talent. The announcement is sparse on details but signals a structured investment in external safety research capacity. This follows broader industry trends of labs funding independent safety work to build the research ecosystem.

AI Safety Research Alignment and RLHF OpenAI Safety Fellowship AI alignment OpenAI

4Openai Blog·1mo ago·source ↗

OpenAI Awards Up to $2M in Grants for AI and Mental Health Research

OpenAI is launching a grant program of up to $2 million to fund research at the intersection of AI and mental health. The program targets studies examining real-world risks, benefits, and applications of AI with the goal of improving safety and well-being. No specific grantees or research directions are named in the announcement.

AI Safety Research OpenAI