technique
automated red teaming
techniqueactive
automated-red-teaming-e6a1d107·1 events·first seen 28d agoAliases: automated red teaming
Co-occurring entities
More like this (12)
Recent events (1)
Continuously hardening ChatGPT Atlas against prompt injection
OpenAI is applying automated red teaming trained with reinforcement learning to harden ChatGPT Atlas, its browser agent, against prompt injection attacks. The approach creates a proactive discover-and-patch loop to identify novel exploits before they can be weaponized. This work is framed as part of broader efforts to secure increasingly agentic AI systems against adversarial manipulation of external content.