-
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains
Multiagent Fine Tunes
-
ML Tea: Planning and Problem-Solving with General, Scalable Neuro-Symbolic Models
ML Tea LLMs \+ symbolic computing
-
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
SAIA
-
Continuous Language Model Interpolation yields Dynamic and Controllable Text Generation
continuous model interpolation with lora
-
Natural Emergent Misalignment from Reward Hacking in Production RL
Anthropic emergent misalignment