Reading Notes

Notes and summaries from reading ML/AI papers (and some blog posts). All credit to the content in the papers and blog posts goes to the original authors.

Data Debiasing with Datamodels (D3M): Improving Subgroup Robustness via Data Selection

Debiasing data using TRAK

1 min read · November 17, 2025

2025 · partial-read · distillation
Ambient Diffusion Omni: Training Good Models with Bad Data

diffusion using low quality data

1 min read · November 17, 2025

2025 · partial-read · distillation
Boomerang Distillation Enables Zero-Shot Model Size Interpolation

Boomerang Distillation

2 min read · November 16, 2025

2025 · distillation
To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning

Backtracking vs Best of n

2 min read · November 15, 2025

2025 · distillation
Training Language Models to Self-Correct via Reinforcement Learning

Aviral Kumar self-correcting LLMs

3 min read · November 15, 2025

2025 · distillation