OpenAI Releases Teen Safety Policies for Developers via gpt-oss-safeguard
OpenAI has published prompt-based teen safety policies targeting developers who build on its models, specifically leveraging the gpt-oss-safeguard model to moderate age-specific risks. The release provides structured guidance and tooling for filtering or adjusting AI outputs in contexts where minors may be users. This represents an extension of OpenAI's safety infrastructure into the developer-facing layer, addressing regulatory and reputational pressure around youth-facing AI deployments.
Related guides (4)
Related events (8)
OpenAI Updates Model Spec with Under-18 Teen Protection Principles
OpenAI is revising its Model Spec to include new Under-18 Principles that govern how ChatGPT interacts with teenage users. The update introduces stronger guardrails and age-appropriate behavioral guidance grounded in developmental science. This builds on OpenAI's broader ongoing effort to improve safety for minors using ChatGPT.
OpenAI Building Age Prediction and Parental Controls in ChatGPT
OpenAI is developing age prediction capabilities and parental control features within ChatGPT to deliver age-appropriate experiences for teenage users. The initiative aims to support families with new safety tools and restrict content based on inferred or verified user age. This represents a product-safety effort at the intersection of AI deployment and child protection policy.
Introducing gpt-oss-safeguard
OpenAI has released gpt-oss-safeguard, a set of open-weight reasoning models designed for safety classification tasks. The models are intended to help developers implement and iterate on custom content safety policies. This represents OpenAI's entry into the open-weight safety tooling space, providing infrastructure-level moderation capabilities that can be customized and deployed independently.
OpenAI Safety Practices Update
OpenAI published a safety update reaffirming its commitment to responsible development and deployment of AGI. The post is a high-level statement from a Tier 1 lab on its safety posture. The body excerpt is brief and does not detail specific new policies, evaluations, or technical measures.
OpenAI publishes public policy agenda covering safety, youth protection, and global standards
OpenAI released a formal public policy agenda outlining its positions on AI safety, youth protection, workforce transition, and international standards. The document represents OpenAI's stated priorities for engaging with governments and regulators. As a tier-1 primary source from a leading frontier lab, it signals how OpenAI intends to shape AI governance discussions.
Building more helpful ChatGPT experiences for everyone
OpenAI is announcing a set of ChatGPT safety and helpfulness improvements including new parental controls for teen users, routing of sensitive conversations to reasoning models, and partnerships with external experts. The update reflects OpenAI's ongoing effort to balance accessibility with safeguards across different user demographics. Routing sensitive queries to reasoning models is a notable architectural/policy decision that may affect response quality and safety outcomes.
OpenAI Releases gpt-oss-safeguard-120b and gpt-oss-safeguard-20b: Open-Weight Policy-Reasoning Safety Models
OpenAI has released two open-weight reasoning models, gpt-oss-safeguard-120b and gpt-oss-safeguard-20b, post-trained from the gpt-oss base models to perform policy-conditioned content labeling. The models are designed to reason from a provided policy document and classify content accordingly, functioning as configurable safety classifiers. A technical report accompanies the release, covering capabilities and baseline safety evaluations benchmarked against the underlying gpt-oss models.
An update on our safety & security practices
OpenAI published an update on its safety and security practices. The post appears to be a high-level overview of the company's current approach to model safety and security. As a Tier 1 source announcement, it likely covers internal safety processes, red-teaming, or policy commitments, though the body text is minimal.



