Preparing for malicious uses of AI
OpenAI co-authored a multi-institutional paper forecasting how malicious actors could misuse AI technology, produced in collaboration with FHI, CSER, CNAS, EFF, and others over nearly a year. The paper outlines potential threat vectors and proposes prevention and mitigation strategies. This represents an early coordinated effort among AI safety and policy organizations to systematically address AI misuse risks.
Related guides (3)
Related events (8)
Disrupting Malicious Uses of AI: OpenAI June 2025 Report
OpenAI published its June 2025 report on detecting and preventing malicious uses of its AI systems. The report features case studies of threat actors attempting to abuse OpenAI's models and the countermeasures deployed. This is part of OpenAI's ongoing transparency series on adversarial misuse.
Disrupting malicious uses of AI by state-affiliated threat actors
OpenAI published a report detailing its efforts to detect and disrupt state-affiliated threat actors attempting to misuse its AI systems for malicious purposes. The report identifies specific nation-state groups that were found leveraging OpenAI tools for activities such as influence operations, cyberattack research, and other adversarial tasks. OpenAI describes the countermeasures taken to terminate these actors' access and outlines broader policy implications for AI misuse by state actors.
Disrupting Malicious Uses of AI: OpenAI October 2025 Report
OpenAI published its October 2025 report on detecting and disrupting malicious uses of its AI systems. The report covers enforcement actions, policy violations, and efforts to counter real-world harms from misuse. This is part of OpenAI's ongoing transparency series documenting adversarial abuse patterns and mitigation responses.
Disrupting Malicious Uses of AI
OpenAI published a report on its efforts to detect and disrupt malicious uses of its AI systems. The post covers threat actor activity identified and terminated on OpenAI's platform, including influence operations, cyberattack assistance, and other adversarial uses. It represents OpenAI's ongoing transparency reporting on abuse cases and countermeasures.
Disrupting Malicious Uses of AI | OpenAI Threat Report February 2026
OpenAI published its latest threat report examining how malicious actors are combining AI models with websites and social platforms for harmful purposes. The report analyzes detection and defense implications of these combined attack vectors. This represents OpenAI's ongoing effort to document and counter adversarial misuse of AI systems.
Strengthening cyber resilience as AI capabilities advance
OpenAI published a post outlining its approach to cybersecurity risk as its models grow more capable, covering risk assessment frameworks, misuse mitigation, and collaboration with the security community. The piece addresses both offensive risk (AI-enabled attacks) and defensive applications. It represents OpenAI's public positioning on responsible deployment in a high-stakes domain.
Preparing for future AI risks in biology
OpenAI has published a post outlining its proactive approach to assessing and mitigating biosecurity risks from advanced AI systems capable of biological applications. The piece describes capability evaluations and safeguards designed to prevent misuse of AI in biology and medicine. This reflects OpenAI's ongoing effort to get ahead of dual-use risks before capabilities reach dangerous thresholds.
Protecting People from Harmful Manipulation
Google DeepMind has published research examining AI's potential for harmful manipulation across domains including finance and health. The work identifies manipulation risks and proposes new safety measures to address them. This represents a proactive safety research effort from a Tier 1 lab focused on misuse and adversarial deployment scenarios.


