Reading Notes

Notes and summaries from reading ML/AI papers (and some blog posts). All credit to the content in the papers and blog posts goes to the original authors.

Deriving Muon

Muon

3 min read · November 18, 2025

2025 · distillation
Curiosity-driven Red-teaming for Large Language Models

Curiosity Driven Red Teaming

2 min read · November 18, 2025

2025 · distillation
Guided Speculative Inference for Efficient Test-Time Alignment of LLMs

Harvard Guided speculative decoding

2 min read · November 18, 2025

2025 · distillation
Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search

Stratego

2 min read · November 18, 2025

2025 · partial-read · distillation
Domain-Aware Scaling Laws Uncover Data Synergy

Data domain synergy in scaling laws

1 min read · November 17, 2025

2025 · partial-read · distillation