allenai / olmocrView on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
16,947Feb 19, 2026Updated last week

Alternatives and similar repositories for olmocr

Users that are interested in olmocr are comparing it to the libraries listed below

Sorting:

Are these results useful?