Almanac
technique

Activation Atlases

techniqueactiveactivation-atlases-ed46aed4·1 events·first seen 28d ago

Aliases: Activation Atlases

Co-occurring entities

More like this (12)

Recent events (1)

5Openai Blog·28d ago·source ↗

Introducing Activation Atlases

OpenAI and Google researchers jointly developed activation atlases, a new neural network interpretability technique that visualizes what interactions between neurons represent. The method aims to improve understanding of internal decision-making processes in AI systems. This work is positioned as a tool for identifying weaknesses and investigating failures in deployed AI systems.