computer_vision / briefing
For most of a decade, optical character recognition felt like a settled engineering problem. In the last twelve months, it has become the most crowded frontier in vision–language modelling.
computer_vision / briefing
The second figure in the OCR Cambrian series. How modern document VLMs encode position — and why it matters more than the parameter count.