-
Training Language Models to Self-Correct via Reinforcement Learning
Aviral Kumar self-correcting LLMs
-
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
MIT ColBERT
-
Large Language Diffusion Models
LLaDA
-
Teaching AI to see the world more like we do
Deepmind Align Vision Representations
-
Self-Adapting Language Models
Zweiger et al SEAL continual learning