organization
Global Project Against Hate and Extremism
organizationactiveprovisional
global-project-against-hate-and-extremism-2ad0210c·1 events·first seen 13d agoAliases: Global Project Against Hate and Extremism
Co-occurring entities
More like this (12)
Recent events (1)
Anthropic details red teaming methods and calls for standardized AI testing practices
Anthropic published a detailed overview of red teaming approaches used to test Claude and other AI systems, covering domain-specific expert testing, automated red teaming, multilingual/multicultural testing, and multimodal red teaming. The post documents empirical findings about when each method is appropriate, highlights partnerships with organizations like Thorn, Institute for Strategic Dialogue, and Singapore's IMDA, and closes with policy recommendations for building a standardized AI testing ecosystem. The piece is notable for its operational specificity and its explicit call for industry-wide standards to enable cross-system safety comparisons.