-
Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
circuits in fine-tunes
-
Causal Abstractions of Neural Networks
Causal abstractions
-
End-to-End Test-Time Training for Long Context
TTT-E2E
-
Interpreting Physics in Video World Models
physics in video models \+ V-JEPA background
-
Language Models use Lookbacks to Track Beliefs
lookback mechanisms