Almanac
product

Nuclear Proliferation Risk Classifier

productactiveprovisionalnuclear-proliferation-risk-classifier-dcc087fb·1 events·first seen 15d ago

Aliases: Nuclear Proliferation Risk Classifier

Co-occurring entities

More like this (12)

Recent events (1)

7Anthropic News·15d ago·source ↗

Anthropic and NNSA Co-Develop Nuclear Safeguards Classifier for Claude Traffic

Anthropic, in partnership with the U.S. Department of Energy's National Nuclear Security Administration (NNSA) and DOE national laboratories, has co-developed an AI classifier that distinguishes between concerning and benign nuclear-related conversations with 96% accuracy in preliminary testing. The classifier has already been deployed on live Claude traffic as part of Anthropic's misuse-detection infrastructure. Anthropic plans to share the approach with the Frontier Model Forum as a replicable blueprint for other AI developers. This represents the first public-private partnership of this kind for nuclear proliferation risk monitoring in frontier AI systems.