docling-project/docling-core

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/docling-project/docling-core)

docling-project / docling-core

Docling core data types and transformations

☆271

Alternatives and similar repositories for docling-core

Users that are interested in docling-core are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

docling-project / docling-ibm-models
View on GitHub
☆207Updated this week
docling-project / docling-parse
View on GitHub
Simple package to extract text with coordinates from programmatic PDFs
☆326Updated this week
docling-project / docling-sdg
View on GitHub
A set of tools to create synthetically-generated data from documents
☆48Aug 15, 2025Updated 11 months ago
docling-project / docling-serve
View on GitHub
Running Docling as an API service
☆1,700Updated this week
docling-project / docling-langchain
View on GitHub
Docling LangChain integration
☆74Nov 17, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
docling-project / docling-mcp
View on GitHub
Making docling agentic through MCP
☆695Updated this week
docling-project / docling-eval
View on GitHub
Evaluation framework for document processing models and services.
☆77Jul 16, 2026Updated last week
DS4SD / deepsearch-examples
View on GitHub
Examples using the Deep Search functionalities
☆90Jan 29, 2025Updated last year
docling-project / docling-jobkit
View on GitHub
☆34Updated this week
DS4SD / deepsearch-glm
View on GitHub
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆60Jan 27, 2025Updated last year
DS4SD / deepsearch-toolkit
View on GitHub
Interact with the Deep Search platform for new knowledge explorations and discoveries
☆228Jan 24, 2025Updated last year
docling-project / docling-haystack
View on GitHub
Docling Haystack integration
☆29Apr 9, 2026Updated 3 months ago
docling-project / docling-operator
View on GitHub
☆16Apr 8, 2026Updated 3 months ago
DS4SD / ragnardoc
View on GitHub
☆22Feb 1, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆63,762Updated this week
DS4SD / PatCID
View on GitHub
[Nat. Commun.] PatCID: an open-access dataset of chemical structures in patent documents
☆75Oct 27, 2025Updated 8 months ago
data-prep-kit / data-prep-kit
View on GitHub
Open source project for data preparation for GenAI applications
☆949Jul 14, 2026Updated last week
DS4SD / DocLayNet
View on GitHub
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
☆450Feb 1, 2023Updated 3 years ago
doclang-project / doclang
View on GitHub
DocLang spec and reference toolkit
☆519Jul 15, 2026Updated last week
kermitt2 / arxiv_harvester
View on GitHub
Poor man's simple harvester for arXiv resources
☆14Jul 14, 2023Updated 3 years ago
JustlyAI / lmss_entity_extractor
View on GitHub
Tool to apply Legal Matter Specification Standard (LMSS) to documents
☆12Aug 15, 2024Updated last year
dhdaines / playa
View on GitHub
Parallel and LAzY Analyzer for PDFs 🏖️
☆47Apr 28, 2026Updated 2 months ago
alea-institute / kl3m-data
View on GitHub
KL3M training data collection and preprocessing
☆22Apr 14, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
docling-project / docling-graph
View on GitHub
Transform unstructured documents into validated, rich and queryable knowledge graphs.
☆181Updated this week
hirmeos / entity-fishing-client-python
View on GitHub
Repository hosting the common code for the entity-fishing clients
☆10May 18, 2026Updated 2 months ago
allthemusicllc / atp-tools
View on GitHub
AllThePatents tooling
☆11Mar 23, 2024Updated 2 years ago
alycialee / beyond-scale-language-data-diversity
View on GitHub
☆13Updated this week
foundation-model-stack / fms-model-optimizer
View on GitHub
FMS Model Optimizer is a framework for developing reduced precision neural network models.
☆21Jun 24, 2026Updated last month
opendatalab / OmniDocBench
View on GitHub
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
☆1,914Updated this week
pypdfium2-team / pypdfium2
View on GitHub
Python bindings to PDFium, reasonably cross-platform.
☆801Updated this week
duaibeom / chemOCR
View on GitHub
DB-based Optical Chemical Structure Recognition
☆14Sep 12, 2022Updated 3 years ago
odantasvictor / movimentacoes_processuais
View on GitHub
Robô para consulta e geração de relatório de movimentações processuais, através de consulta pública, de processos do PJE
☆40Nov 16, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
foundation-model-stack / fms-acceleration
View on GitHub
🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
☆14Jan 30, 2026Updated 5 months ago
softcite / softcite_kb
View on GitHub
A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources
☆18May 14, 2023Updated 3 years ago
reflex-dev / reflex-llamaindex-template
View on GitHub
☆15Jun 2, 2026Updated last month
conda-forge / jaxlib-feedstock
View on GitHub
A conda-smithy repository for jaxlib.
☆17Jul 3, 2026Updated 3 weeks ago
alea-institute / FOLIO
View on GitHub
FOLIO: Federated Open Legal Information Ontology
☆41May 27, 2026Updated last month
explosion / spacy-layout
View on GitHub
📚 Process PDFs, Word documents and more with spaCy
☆909Mar 27, 2026Updated 3 months ago
deepdoctection / notebooks
View on GitHub
Repository for deepdoctection tutorial notebooks
☆54Jan 1, 2026Updated 6 months ago