Working notes — things I'm reading, thinking about, and trying to figure out. Less polished than the long-form posts, sometimes revised in place.
Can you perform tasks over documents purely using the document image?
In which DeepSeek argues that document images can be more dense, lossless input representations.
A tiny, two-stage, DONUT-based OCR model.
A benchmark for the current frontier of retrievers: possible to verify with reasoning models, difficult to retrieve.
TODO
Evolving rubrics for training a small, powerful deep research model.
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
TODO
Novel RL techniques for training a surprisingly powerful small prover.
TODO
TODO
TODO
TODO
TODO
TODO
TODO
No notes match the selected tags.