-
Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs
modality specific circuits
-
Position-aware Automatic Circuit Discovery
position aware circuits
-
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics
arithmetic heuristics
-
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights
neural thickets
-
Attention Residuals
attention residuals