Almanac
← Events
5OpenAI Blog·1mo ago

Understanding prompt injections: a frontier security challenge

OpenAI has published a blog post addressing prompt injection attacks as a key security challenge for AI systems. The post covers how these attacks work and outlines OpenAI's multi-pronged approach including research, model training improvements, and safeguard development. This signals OpenAI's formal positioning on agentic security threats as their models are increasingly deployed in tool-using and autonomous contexts.

Related guides (4)

Related events (8)

6Openai Blog·1mo ago·source ↗

Designing AI agents to resist prompt injection

OpenAI published a blog post describing how ChatGPT's agent workflows are designed to resist prompt injection and social engineering attacks. The approach focuses on constraining risky actions and protecting sensitive data within agentic pipelines. This represents OpenAI's public articulation of defensive design principles for deployed AI agents.

4Openai Blog·1mo ago·source ↗

Strengthening cyber resilience as AI capabilities advance

OpenAI published a post outlining its approach to cybersecurity risk as its models grow more capable, covering risk assessment frameworks, misuse mitigation, and collaboration with the security community. The piece addresses both offensive risk (AI-enabled attacks) and defensive applications. It represents OpenAI's public positioning on responsible deployment in a high-stakes domain.

6Openai Blog·1mo ago·source ↗

Continuously hardening ChatGPT Atlas against prompt injection

OpenAI is applying automated red teaming trained with reinforcement learning to harden ChatGPT Atlas, its browser agent, against prompt injection attacks. The approach creates a proactive discover-and-patch loop to identify novel exploits before they can be weaponized. This work is framed as part of broader efforts to secure increasingly agentic AI systems against adversarial manipulation of external content.

4Openai Blog·1mo ago·source ↗

Security on the path to AGI

OpenAI published a post outlining its approach to security as the organization advances toward AGI. The piece describes how security measures are being built directly into infrastructure and models proactively. The content is high-level and framing-oriented, with limited technical specifics visible in the excerpt.

7Openai Blog·1mo ago·source ↗

OpenAI to Acquire Promptfoo

OpenAI announced the acquisition of Promptfoo, an AI security platform focused on identifying and remediating vulnerabilities in AI systems during development. The acquisition signals OpenAI's intent to deepen its enterprise security capabilities. Promptfoo has been widely used by developers to red-team and evaluate LLM applications for safety and reliability issues.

5Openai Blog·1mo ago·source ↗

Cybersecurity in the Intelligence Age

OpenAI has published a five-part action plan aimed at strengthening cybersecurity through AI-powered defense capabilities. The plan focuses on democratizing access to AI-based cyber defense tools and protecting critical infrastructure systems. This represents OpenAI's public positioning on how AI should be applied to national and enterprise security challenges.

5Openai Blog·1mo ago·source ↗

Introducing the OpenAI Safety Bug Bounty Program

OpenAI has launched a Safety Bug Bounty program targeting AI-specific abuse and safety risks. The program focuses on agentic vulnerabilities, prompt injection, and data exfiltration scenarios. This extends traditional security bug bounty models into AI safety territory, incentivizing external researchers to surface novel attack vectors.

6Openai Blog·1mo ago·source ↗

Preparing for future AI risks in biology

OpenAI has published a post outlining its proactive approach to assessing and mitigating biosecurity risks from advanced AI systems capable of biological applications. The piece describes capability evaluations and safeguards designed to prevent misuse of AI in biology and medicine. This reflects OpenAI's ongoing effort to get ahead of dual-use risks before capabilities reach dangerous thresholds.