Joe Barrow publications

Publications

My Favorite Papers

CommonForms: A Large, Diverse Dataset for Form Field Detection — WACV, 2026. A dataset and models for automatically detecting form fields from PDFs. The Python package is downloaded thousands of times a month.
[copy bibtex] [Data] [Code]

Syntopical Graphs for Computational Argumentation Tasks — ACL, 2021. Building claim-relation graphs to improve corpus understanding, inspired by Mortimer Adler’s Syntopical Reading.
[copy bibtex]

A Joint Model for Document Segmentation and Segment Labeling — ACL, 2020. Learning to segment documents (back when LSTMs were still cool).
[copy bibtex]

Bias and Fairness in Large Language Models: A Survey — Computational Linguistics, 2024. An in-depth survey on bias and fairness in NLP.
[copy bibtex]

PDFTriage: Question Answering over Long, Structured Documents — EMNLP (Industry Track), 2024. Helping LLMs to see documents like people do.
[copy bibtex]

Chain of Logic: Rule-Based Reasoning with Large Language Models — Findings of ACL, 2024. Rule-based reasoning for legal NLP.
[copy bibtex]

Other Papers

Patents

Recorded Talks

Richard Hamming believed that it’s the job of a scientist to communicate via publications, prepared talks, and impromptu talks. I do my best. If you’re interested in me speaking somewhere, reach out!

I’ve given lots of unrecorded talks and lectures on: