docling-project/docling-eval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/docling-project/docling-eval)

docling-project / docling-eval

Evaluation framework for document processing models and services.

☆77

Alternatives and similar repositories for docling-eval

Users that are interested in docling-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

docling-project / docling-sdg
View on GitHub
A set of tools to create synthetically-generated data from documents
☆48Aug 15, 2025Updated 11 months ago
docling-project / docling-ibm-models
View on GitHub
☆207Updated this week
docling-project / docling-core
View on GitHub
Docling core data types and transformations
☆271Updated this week
DS4SD / deepsearch-glm
View on GitHub
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆60Jan 27, 2025Updated last year
docling-project / docling-parse
View on GitHub
Simple package to extract text with coordinates from programmatic PDFs
☆326Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
docling-project / docling-graph
View on GitHub
Transform unstructured documents into validated, rich and queryable knowledge graphs.
☆181Updated this week
docling-project / docling-haystack
View on GitHub
Docling Haystack integration
☆29Apr 9, 2026Updated 3 months ago
IBM / SynthTabNet
View on GitHub
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆154Sep 17, 2025Updated 10 months ago
docling-project / docling-serve
View on GitHub
Running Docling as an API service
☆1,700Updated this week
rasyosef / splade-index
View on GitHub
Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba
☆38Oct 16, 2025Updated 9 months ago
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
Hofer-Julian / marimo-pixi-starter-template
View on GitHub
marimo + pixi starter template
☆18Jan 31, 2025Updated last year
doclang-project / doclang
View on GitHub
DocLang spec and reference toolkit
☆519Jul 15, 2026Updated last week
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ihdia / seamformer
View on GitHub
Official repository accompaying the ICDAR 2023 paper
☆14Oct 3, 2023Updated 2 years ago
felixdittrich92 / docling-OCR-OnnxTR
View on GitHub
OnnxTR OCR plugin for Docling
☆21Jun 28, 2026Updated 3 weeks ago
huggingface / llm-course
View on GitHub
A course on building Large Language Models
☆19Mar 24, 2025Updated last year
huggingface / finepdfs
View on GitHub
Codebase for FinePDFs
☆187Jan 9, 2026Updated 6 months ago
nozomio-labs / chromium-agent-nia
View on GitHub
Chromagent - AI agent that answers questions grounded in the Chromium codebase and documentation. Powered by Nia.
☆22Dec 20, 2025Updated 7 months ago
docling-project / docling-operator
View on GitHub
☆16Apr 8, 2026Updated 3 months ago
georgeretsi / HTR-best-practices
View on GitHub
Basic HTR concepts/modules to boost performance
☆41Nov 30, 2024Updated last year
kartikgill / taco-box
View on GitHub
An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR
☆15Dec 4, 2021Updated 4 years ago
thanhnghiadk / syntactic_HME_generation
View on GitHub
This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.
☆14Feb 24, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
georgeretsi / Seq2Emb
View on GitHub
Create handwritten word embeddings from a text recognition Seq2Seq system.
☆11Dec 1, 2022Updated 3 years ago
ThunderVVV / RCLSTR
View on GitHub
Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`
☆17Sep 22, 2023Updated 2 years ago
dshea89 / tesseract-retraining-pipeline
View on GitHub
Intuitive interface for fine-tuning and retraining a Tesseract OCR language model
☆10Jul 4, 2025Updated last year
koaning / mosync
View on GitHub
A utility for async batch jobs in marimo
☆13Mar 12, 2025Updated last year
achillean / redis-keys
View on GitHub
Using Shodan to get a breakdown of the most common key names in public Redis servers.
☆12Dec 10, 2017Updated 8 years ago
yqingli123 / TDv2
View on GitHub
The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.
☆12Jul 28, 2022Updated 3 years ago
JGalego / RAGmap
View on GitHub
A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍
☆14Apr 15, 2025Updated last year
usnistgov / cookiecutter-nist-python
View on GitHub
A cookiecutter template for python projects/packages at NIST
☆15Updated this week
vals-ai / model-library
View on GitHub
Simple provider agnostic LLM gateway
☆20Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yufanchen96 / GraphDoc
View on GitHub
Graph-based Document Structure Analysis
☆18Mar 26, 2025Updated last year
cue-lang / cue-py
View on GitHub
☆17Updated this week
getomni-ai / benchmark
View on GitHub
OCR Benchmark
☆640Oct 21, 2025Updated 9 months ago
KhronosGroup / glTF-MaterialX-Converter
View on GitHub
Prototype tooling between glTF (JSON) and MaterialX (XML) file formats.
☆19Jul 24, 2025Updated last year
tehranixyz / CodeRosetta
View on GitHub
CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming
☆11Nov 18, 2024Updated last year
talkiq / llm-evaluate
View on GitHub
☆11Nov 12, 2024Updated last year
legout / veritascribe
View on GitHub
AI-Powered Thesis Review Tool
☆17Aug 8, 2025Updated 11 months ago