5OpenAI Blog·1mo ago

OpenAI Releases Teen Safety Policies for Developers via gpt-oss-safeguard

OpenAI has published prompt-based teen safety policies targeting developers who build on its models, specifically leveraging the gpt-oss-safeguard model to moderate age-specific risks. The release provides structured guidance and tooling for filtering or adjusting AI outputs in contexts where minors may be users. This represents an extension of OpenAI's safety infrastructure into the developer-facing layer, addressing regulatory and reputational pressure around youth-facing AI deployments.

AI Safety Research Enterprise Deployment Patterns Regulatory Developments gpt-oss-safeguard OpenAI

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From LLM Demo to Production Reality

Read asIn-depth

Regulatory DevelopmentsTopic guide

AI Regulatory Developments: From Voluntary Frameworks to Government Enforcement

Read asIn-depth

Related events (8)

5Openai Blog·1mo ago·source ↗

OpenAI Updates Model Spec with Under-18 Teen Protection Principles

OpenAI is revising its Model Spec to include new Under-18 Principles that govern how ChatGPT interacts with teenage users. The update introduces stronger guardrails and age-appropriate behavioral guidance grounded in developmental science. This builds on OpenAI's broader ongoing effort to improve safety for minors using ChatGPT.

AI Safety Research Regulatory Developments Under-18 Principles ChatGPT OpenAI +2 more

5Openai Blog·1mo ago·source ↗

OpenAI Building Age Prediction and Parental Controls in ChatGPT

OpenAI is developing age prediction capabilities and parental control features within ChatGPT to deliver age-appropriate experiences for teenage users. The initiative aims to support families with new safety tools and restrict content based on inferred or verified user age. This represents a product-safety effort at the intersection of AI deployment and child protection policy.

Enterprise Deployment Patterns Regulatory Developments ChatGPT age prediction OpenAI

7Openai Blog·1mo ago·source ↗

Introducing gpt-oss-safeguard

OpenAI has released gpt-oss-safeguard, a set of open-weight reasoning models designed for safety classification tasks. The models are intended to help developers implement and iterate on custom content safety policies. This represents OpenAI's entry into the open-weight safety tooling space, providing infrastructure-level moderation capabilities that can be customized and deployed independently.

Open Weights Progress AI Safety Research gpt-oss-safeguard OpenAI +2 more

4Openai Blog·1mo ago·source ↗

OpenAI Safety Practices Update

OpenAI published a safety update reaffirming its commitment to responsible development and deployment of AGI. The post is a high-level statement from a Tier 1 lab on its safety posture. The body excerpt is brief and does not detail specific new policies, evaluations, or technical measures.

AI Safety Research AGI (Artificial General Intelligence)OpenAI

6Openai Blog·17d ago·source ↗

OpenAI publishes public policy agenda covering safety, youth protection, and global standards

OpenAI released a formal public policy agenda outlining its positions on AI safety, youth protection, workforce transition, and international standards. The document represents OpenAI's stated priorities for engaging with governments and regulators. As a tier-1 primary source from a leading frontier lab, it signals how OpenAI intends to shape AI governance discussions.

AI Safety Research Regulatory Developments OpenAI

5Openai Blog·1mo ago·source ↗

Building more helpful ChatGPT experiences for everyone

OpenAI is announcing a set of ChatGPT safety and helpfulness improvements including new parental controls for teen users, routing of sensitive conversations to reasoning models, and partnerships with external experts. The update reflects OpenAI's ongoing effort to balance accessibility with safeguards across different user demographics. Routing sensitive queries to reasoning models is a notable architectural/policy decision that may affect response quality and safety outcomes.

AI Safety Research Enterprise Deployment Patterns OpenAI Reasoning Models ChatGPT OpenAI

7Openai Blog·1mo ago·source ↗

OpenAI Releases gpt-oss-safeguard-120b and gpt-oss-safeguard-20b: Open-Weight Policy-Reasoning Safety Models

OpenAI has released two open-weight reasoning models, gpt-oss-safeguard-120b and gpt-oss-safeguard-20b, post-trained from the gpt-oss base models to perform policy-conditioned content labeling. The models are designed to reason from a provided policy document and classify content accordingly, functioning as configurable safety classifiers. A technical report accompanies the release, covering capabilities and baseline safety evaluations benchmarked against the underlying gpt-oss models.

Open Weights Progress AI Safety Research GPT-OSS gpt-oss-safeguard OpenAI +1 more

5Openai Blog·1mo ago·source ↗

An update on our safety & security practices

OpenAI published an update on its safety and security practices. The post appears to be a high-level overview of the company's current approach to model safety and security. As a Tier 1 source announcement, it likely covers internal safety processes, red-teaming, or policy commitments, though the body text is minimal.

AI Safety Research OpenAI