product
Nuclear Proliferation Risk Classifier
productactiveprovisional
nuclear-proliferation-risk-classifier-dcc087fb·1 events·first seen 15d agoAliases: Nuclear Proliferation Risk Classifier
Co-occurring entities
More like this (12)
CBRN (Chemical, Biological, Radiological, Nuclear) risk categorynuclear normNational Nuclear Security AdministrationU.S. Department of Energy National Nuclear Security AdministrationPRNetNIST AI RMFRisk Direction IndexInvariant Risk MinimizationFission-AIprobing classifiersSafety Detection ClassifierAI Risk Management Framework
Recent events (1)
Anthropic and NNSA Co-Develop Nuclear Safeguards Classifier for Claude Traffic
Anthropic, in partnership with the U.S. Department of Energy's National Nuclear Security Administration (NNSA) and DOE national laboratories, has co-developed an AI classifier that distinguishes between concerning and benign nuclear-related conversations with 96% accuracy in preliminary testing. The classifier has already been deployed on live Claude traffic as part of Anthropic's misuse-detection infrastructure. Anthropic plans to share the approach with the Frontier Model Forum as a replicable blueprint for other AI developers. This represents the first public-private partnership of this kind for nuclear proliferation risk monitoring in frontier AI systems.