Joe Barrow field_notes

Field Notes

Paper Notes: Kosmos-2.5

last updated 2026-05-23

kosmos-2.5.png Dataset: 357.4MM document images, split into OCR and Markup:

Model size: 1.3B params

Tasks: ocr (image + bbox), image to markdown (<md> prompt), and docvqa