-
To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning
Backtracking vs Best of n
-
Training Language Models to Self-Correct via Reinforcement Learning
Aviral Kumar self-correcting LLMs
-
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
MIT ColBERT
-
Large Language Diffusion Models
LLaDA
-
Teaching AI to see the world more like we do
Deepmind Align Vision Representations