Entity · technique

automated red teaming

techniqueactiveautomated-red-teaming-e6a1d107·1 events·first seen May 20, 2026

Aliases: automated red teaming

Co-occurring entities

prompt injection ChatGPT Atlas Reinforcement Learning OpenAI

More like this (12)

human red teaming AI-assisted red teaming red-teaming OpenAI Red Teaming Network automated AI research automated test suite automated theorem proving Automatic Domain Randomization Red-Teaming Resistance Leaderboard adversarial training Automated Reference Verification System autoresearch

Recent events (1)

6Openai Blog·May 20, 2026·source ↗

Continuously hardening ChatGPT Atlas against prompt injection

OpenAI is applying automated red teaming trained with reinforcement learning to harden ChatGPT Atlas, its browser agent, against prompt injection attacks. The approach creates a proactive discover-and-patch loop to identify novel exploits before they can be weaponized. This work is framed as part of broader efforts to secure increasingly agentic AI systems against adversarial manipulation of external content.

AI Safety Research Agent and Tool Ecosystem prompt injection ChatGPT Atlas Reinforcement Learning +3 more