-
What are Diffusion Models?
Lilian Weng Diffusion Blog Post (Discrete)
-
Building Diffusion Model's theory from ground up
ICLR Diffusion explained blogpost
-
Weight-sparse transformers have interpretable circuits
Leo Gao OpenAI sparse circuits
-
Between the Bars: Gradient-based Jailbreaks are Bugs that induce Features
Gradient Based Jailbreaking
-
Breakpoint: Scalable evaluation of system-level reasoning in LLM code agents
Breakpoint