Almanac
technique

automated red teaming

techniqueactiveautomated-red-teaming-e6a1d107·1 events·first seen 28d ago

Aliases: automated red teaming

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·28d ago·source ↗

Continuously hardening ChatGPT Atlas against prompt injection

OpenAI is applying automated red teaming trained with reinforcement learning to harden ChatGPT Atlas, its browser agent, against prompt injection attacks. The approach creates a proactive discover-and-patch loop to identify novel exploits before they can be weaponized. This work is framed as part of broader efforts to secure increasingly agentic AI systems against adversarial manipulation of external content.