paper

Explaining Attention with Program Synthesis

paperactiveprovisionalexplaining-attention-with-program-synthesis-22511918·1 events·first seen 2d ago

Aliases: Explaining Attention with Program Synthesis

Co-occurring entities

Llama 3.2 GPT-2 TinyStories TinyLlama-1.1B

More like this (12)

ProbSparse Attention Functional Attention symbolic attention heads Neuronal Stochastic Attention Circuit (NSAC)sparse attention Listening with Attention: Entropy-Guided Explainability for Transformer-Based Audio Models How Do Instructions Shape Speech? Cross-Attention Attribution for Style-Captioned Text-to-Speech Lie-Algebra Attention reference attention Sliding Window Attention code synthesis LLMs bidirectional attention

Recent events (1)

6arXiv · cs.LG·2d ago·source ↗

Program synthesis used to reverse-engineer transformer attention heads with executable Python surrogates

Researchers propose a pipeline that approximates transformer attention heads with executable Python programs generated by a language model, then re-ranked by held-out predictive accuracy. Applied to GPT-2, TinyLlama-1.1B, and Llama-3B, fewer than 1,000 programs reproduce attention patterns with >75% average IoU similarity on TinyStories. Replacing 25% of attention heads with programmatic surrogates incurs only a 16% average perplexity increase while preserving downstream QA performance, demonstrating a path toward symbolic transparency in neural models.

Evaluation and Benchmarking AI Safety Research Llama 3.2 GPT-2 Explaining Attention with Program Synthesis +2 more