6OpenAI Blog·1mo ago

Introducing OpenAI Privacy Filter

OpenAI has released an open-weight model called Privacy Filter designed to detect and redact personally identifiable information (PII) in text. The model is described as achieving state-of-the-art accuracy on PII detection tasks. This is OpenAI's first open-weight release focused specifically on data privacy and compliance use cases.

Open Weights Progress AI Safety Research Enterprise Deployment Patterns Personally Identifiable Information (PII)OpenAI OpenAI Privacy Filter

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From LLM Demo to Production Reality

Read asIn-depth

Related events (8)

5Openai Blog·1mo ago·source ↗

OpenAI Releases Teen Safety Policies for Developers via gpt-oss-safeguard

OpenAI has published prompt-based teen safety policies targeting developers who build on its models, specifically leveraging the gpt-oss-safeguard model to moderate age-specific risks. The release provides structured guidance and tooling for filtering or adjusting AI outputs in contexts where minors may be users. This represents an extension of OpenAI's safety infrastructure into the developer-facing layer, addressing regulatory and reputational pressure around youth-facing AI deployments.

AI Safety Research Enterprise Deployment Patterns gpt-oss-safeguard OpenAI +1 more

8Openai Blog·1mo ago·source ↗

OpenAI Releases Most Capable Open-Weights Models

OpenAI has released what it describes as its most capable open-weights models, framing the move as a major step toward broader AI accessibility. The announcement emphasizes openness, flexibility, and global reach as core motivations. This marks a significant shift in OpenAI's historically closed model distribution strategy.

Frontier Model Releases Open Weights Progress open-weight models OpenAI +2 more

5Hugging Face Blog·1mo ago·source ↗

Experimenting with Automatic PII Detection on the Hub using Presidio

Hugging Face describes an experiment integrating Microsoft's Presidio library for automatic personally identifiable information (PII) detection across datasets hosted on the Hub. The effort aims to flag or redact sensitive data before it can be used in model training pipelines. This represents a practical infrastructure-level approach to data governance and privacy compliance for open ML datasets.

Evaluation and Benchmarking Enterprise Deployment Patterns Microsoft Presidio Hugging Face

4Openai Blog·1mo ago·source ↗

OpenAI Launches Free Moderation Endpoint for API Developers

OpenAI introduced a new Moderation endpoint as a free tool for API developers, replacing its previous content filter. The endpoint is designed to help developers detect and filter harmful or policy-violating content in their applications. This represents an incremental improvement to OpenAI's content moderation infrastructure.

AI Safety Research Enterprise Deployment Patterns OpenAI Moderation Endpoint OpenAI API OpenAI

5Openai Blog·1mo ago·source ↗

OpenAI Upgrades Moderation API with GPT-4o-Based Multimodal Model

OpenAI has released an updated Moderation API powered by a new model built on GPT-4o, extending content moderation capabilities to both text and images. The update aims to improve accuracy in detecting harmful content, giving developers better tools for building moderation systems. This represents an expansion of OpenAI's safety infrastructure into multimodal domains.

AI Safety Research Enterprise Deployment Patterns GPT-4o OpenAI Moderation API OpenAI +1 more

4Openai Blog·9d ago·source ↗

OpenAI endorses EU Code of Practice on AI content transparency

OpenAI announced support for the EU Code of Practice on AI content transparency, committing to provenance standards and tools that help users identify AI-generated content. The announcement positions OpenAI as aligned with European regulatory frameworks for trustworthy AI. This is a policy/regulatory alignment move rather than a technical release.

AI Safety Research Regulatory Developments EU Code of Practice on AI content transparency OpenAI European Union

7Openai Blog·1mo ago·source ↗

Introducing gpt-oss-safeguard

OpenAI has released gpt-oss-safeguard, a set of open-weight reasoning models designed for safety classification tasks. The models are intended to help developers implement and iterate on custom content safety policies. This represents OpenAI's entry into the open-weight safety tooling space, providing infrastructure-level moderation capabilities that can be customized and deployed independently.

Open Weights Progress AI Safety Research gpt-oss-safeguard OpenAI +2 more

7Openai Blog·1mo ago·source ↗

OpenAI Releases gpt-oss-safeguard-120b and gpt-oss-safeguard-20b: Open-Weight Policy-Reasoning Safety Models

OpenAI has released two open-weight reasoning models, gpt-oss-safeguard-120b and gpt-oss-safeguard-20b, post-trained from the gpt-oss base models to perform policy-conditioned content labeling. The models are designed to reason from a provided policy document and classify content accordingly, functioning as configurable safety classifiers. A technical report accompanies the release, covering capabilities and baseline safety evaluations benchmarked against the underlying gpt-oss models.

Open Weights Progress AI Safety Research GPT-OSS gpt-oss-safeguard OpenAI +1 more