Working notes — things I'm reading, thinking about, and trying to figure out. Less polished than the long-form posts, sometimes revised in place.
Quick notes on how the temporal/height/width split shifted between Qwen2-VL and Qwen2.5-VL.
Patch tokenization is not enough — the model has to know what a row is.
A book of feedback loops, leverage points, and the long shadow of stocks.
An exploration into tokenizers and layout