-
ML Tea: Planning and Problem-Solving with General, Scalable Neuro-Symbolic Models
ML Tea LLMs \+ symbolic computing
-
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
SAIA
-
Continuous Language Model Interpolation yields Dynamic and Controllable Text Generation
continuous model interpolation with lora
-
Natural Emergent Misalignment from Reward Hacking in Production RL
Anthropic emergent misalignment
-
A Mathematical Framework for Transformer Circuits
Transformer Circuits Framework