Entity · model

gpt-oss-safeguard

modelactivegpt-oss-safeguard-1117f024·3 events·first seen May 19, 2026

Aliases: gpt-oss-safeguard, gpt-oss-safeguard-120b, gpt-oss-safeguard-20b

Co-occurring entities

More like this (12)

gpt-oss-20b GPT-OSS GPT-OSS 120B gpt-oss usage policy GPT-next GPT-f GPT GPT-Image-2 GPT-4o GPT-5.2 GPT-Image-1.5 GPT-J 6B

Recent events (3)

7Openai Blog·May 20, 2026·source ↗

OpenAI Releases gpt-oss-safeguard-120b and gpt-oss-safeguard-20b: Open-Weight Policy-Reasoning Safety Models

OpenAI has released two open-weight reasoning models, gpt-oss-safeguard-120b and gpt-oss-safeguard-20b, post-trained from the gpt-oss base models to perform policy-conditioned content labeling. The models are designed to reason from a provided policy document and classify content accordingly, functioning as configurable safety classifiers. A technical report accompanies the release, covering capabilities and baseline safety evaluations benchmarked against the underlying gpt-oss models.

Open Weights Progress AI Safety Research GPT-OSS gpt-oss-safeguard OpenAI +1 more

7Openai Blog·May 20, 2026·source ↗

Introducing gpt-oss-safeguard

OpenAI has released gpt-oss-safeguard, a set of open-weight reasoning models designed for safety classification tasks. The models are intended to help developers implement and iterate on custom content safety policies. This represents OpenAI's entry into the open-weight safety tooling space, providing infrastructure-level moderation capabilities that can be customized and deployed independently.

Open Weights Progress AI Safety Research gpt-oss-safeguard OpenAI +2 more

5Openai Blog·May 19, 2026·source ↗

OpenAI Releases Teen Safety Policies for Developers via gpt-oss-safeguard

OpenAI has published prompt-based teen safety policies targeting developers who build on its models, specifically leveraging the gpt-oss-safeguard model to moderate age-specific risks. The release provides structured guidance and tooling for filtering or adjusting AI outputs in contexts where minors may be users. This represents an extension of OpenAI's safety infrastructure into the developer-facing layer, addressing regulatory and reputational pressure around youth-facing AI deployments.

AI Safety Research Enterprise Deployment Patterns gpt-oss-safeguard OpenAI +1 more