getomni-ai/benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/getomni-ai/benchmark)

getomni-ai / benchmark

OCR Benchmark

☆636

Alternatives and similar repositories for benchmark

Users that are interested in benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

opendatalab / OmniDocBench
View on GitHub
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
☆1,862Jun 26, 2026Updated last week
renatomaaliw3 / public_files
View on GitHub
☆12Apr 27, 2026Updated 2 months ago
docling-project / docling-eval
View on GitHub
Evaluation framework for document processing models and services.
☆76Updated this week
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,010Jun 13, 2026Updated 3 weeks ago
allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆18,650Mar 25, 2026Updated 3 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
getomni-ai / zerox
View on GitHub
OCR & Document Extraction using vision models
☆12,242May 20, 2025Updated last year
run-llama / llama_cloud_services
View on GitHub
Knowledge Agents and Management in the Cloud
☆4,252May 18, 2026Updated last month
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆62,334Jun 29, 2026Updated last week
SwaggasDeCatas / emuThreeDS
View on GitHub
World's first Nintendo 3DS emulator for Apple devices based on Citra.
☆18Apr 7, 2023Updated 3 years ago
mzbac / mlx.voxtral
View on GitHub
☆19Aug 19, 2025Updated 10 months ago
recally-io / go-markitdown
View on GitHub
A CLI tool and library written in Go for converting documents to Markdown format.
☆25Sep 27, 2025Updated 9 months ago
garg-ankush / scipe
View on GitHub
SCIPE is a powerful tool for evaluating and diagnosing LLM (Large Language Model) graphs or chains.
☆25Nov 5, 2024Updated last year
JigsawStack / jigsawstack-mcp-server
View on GitHub
Model Context Protocol Server that allows AI models to interact with JigsawStack models!
☆23Jul 11, 2025Updated 11 months ago
Unstructured-IO / unstructured
View on GitHub
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…
☆15,066Jun 24, 2026Updated last week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
katanaml / sparrow
View on GitHub
Structured data extraction, instruction calling and agentic workflows with ML, LLM and Vision LLM
☆5,175Updated this week
SciPhi-AI / R2R
View on GitHub
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
☆7,906Nov 7, 2025Updated 8 months ago
illuin-tech / colpali
View on GitHub
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,683Updated this week
morphik-org / morphik-core
View on GitHub
The most accurate document search and store for building AI apps
☆3,624Updated this week
ai-forever / StackMix-OCR
View on GitHub
☆48Dec 16, 2022Updated 3 years ago
Filimoa / open-parse
View on GitHub
Improved file parsing for LLM’s
☆3,162May 17, 2026Updated last month
docling-project / docling-ibm-models
View on GitHub
☆207Jun 4, 2026Updated last month
mindee / doctr
View on GitHub
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
☆6,169Updated this week
dswang2011 / DocLLM
View on GitHub
DocLLM: A layout-aware generative language model for multimodal document understanding
☆143Jan 3, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
microsoft / table-transformer
View on GitHub
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…
☆2,921Jun 24, 2024Updated 2 years ago
Update-For-Integrated-Business-AI / CORU
View on GitHub
☆19Jul 7, 2025Updated 11 months ago
neuml / txtai
View on GitHub
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
☆12,683Jun 22, 2026Updated 2 weeks ago
NVIDIA / NeMo-Retriever
View on GitHub
NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever Library …
☆2,942Updated this week
confident-ai / deepeval
View on GitHub
The LLM Evaluation Framework
☆16,516Jun 26, 2026Updated last week
unslothai / unsloth
View on GitHub
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
☆67,571Jun 29, 2026Updated last week
Lightning-AI / LitServe
View on GitHub
A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.
☆3,906Jun 23, 2026Updated last week
doc-analysis / ReadingBank
View on GitHub
ReadingBank: A Benchmark Dataset for Reading Order Detection
☆117Aug 26, 2024Updated last year
datalab-to / marker
View on GitHub
Convert PDF to markdown + JSON quickly with high accuracy
☆37,195Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆35,605Jun 25, 2026Updated last week
ariG23498 / timm-wrapper-examples
View on GitHub
Notebooks to demonstrate TimmWrapper
☆17Jan 16, 2025Updated last year
qyhou / curated-document-layout-analysis
View on GitHub
A curated list of resources on Document Layout Analysis
☆12Aug 7, 2025Updated 10 months ago
nttmdlab-nlp / InstructDoc
View on GitHub
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
☆162May 31, 2024Updated 2 years ago
raphael-seo / Versatile-OCR-Program
View on GitHub
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
☆680May 13, 2026Updated last month
D-Star-AI / dsRAG
View on GitHub
High-performance retrieval engine for unstructured data
☆1,588Nov 10, 2025Updated 7 months ago
Dicklesworthstone / llm_aided_ocr
View on GitHub
Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs
☆2,934Mar 22, 2026Updated 3 months ago