technique
Activation Atlases
techniqueactive
activation-atlases-ed46aed4·1 events·first seen 28d agoAliases: Activation Atlases
Co-occurring entities
More like this (12)
Recent events (1)
Introducing Activation Atlases
OpenAI and Google researchers jointly developed activation atlases, a new neural network interpretability technique that visualizes what interactions between neurons represent. The method aims to improve understanding of internal decision-making processes in AI systems. This work is positioned as a tool for identifying weaknesses and investigating failures in deployed AI systems.