-
What’s in the Image? A Deep-Dive into the Vision of Vision Language Models
VLM QA mech interp
-
Generative Modeling via Drifting
Drifting
-
FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
Flux.1 Kontext
-
MaxRL: Maximum Likelihood via Reinforcement Learning
Maximum likelihood RL
-
Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers
Circuits in DiTs