Joe Barrow field_notes

Field Notes

Paper Notes: Kosmos-2.5

last updated 2026-05-10

Dataset: 357.4MM document images, split into OCR and Markup:

Model size: 1.3B params

Tasks: ocr (image + bbox), image to markdown (<md> prompt), and docvqa