mit1208 / Document-AILinks

☆18

Alternatives and similar repositories for Document-AI

Users that are interested in Document-AI are comparing it to the libraries listed below

Sorting:

dswang2011 / DocLLM
DocLLM: A layout-aware generative language model for multimodal document understanding
☆126Updated last year
sebischair / FusionSent
Repository of the ICNLSP 2024 paper "Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes…
☆17Updated 6 months ago
ppaanngggg / yolo-doclaynet
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
☆120Updated 4 months ago
DS4SD / deepsearch-glm
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆55Updated 5 months ago
deepdoctection / notebooks
Repository for deepdoctection tutorial notebooks
☆46Updated last month
LynnHaDo / Document-Layout-Analysis
Object Detection Model for Scanned Documents
☆94Updated 4 months ago
andreagemelli / doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆128Updated 2 years ago
CycloneBoy / pdf_table
A Unified Toolkit for Deep Learning-Based Table Extraction
☆41Updated 8 months ago
docling-project / docling-ibm-models
☆130Updated last week
docling-project / docling-parse
Simple package to extract text with coordinates from programmatic PDFs
☆141Updated last week
henrikalbihn / gliner-as-a-service
GLiNER model in a FastAPI microservice.
☆44Updated 7 months ago
wjbmattingly / qwen2-vl-finetune-huggingface
This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.
☆73Updated this week
kyegomez / Kosmos2.5
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
☆72Updated 3 months ago
docling-project / docling-core
A python library to define and validate data types in Docling.
☆155Updated this week
ai8hyf / TF-ID
TF-ID: Table/Figure IDentifier for academic papers
☆238Updated last year
DS4SD / deepsearch-examples
Examples using the Deep Search functionalities
☆81Updated 5 months ago
Update-For-Integrated-Business-AI / CORU
☆11Updated last week
Unstructured-IO / unstructured-inference
☆188Updated 2 weeks ago
lfoppiano / structure-vision
Viewer for the structure extracted by Grobid on PDF documents
☆52Updated 2 months ago
moured / YOLOv10-Document-Layout-Analysis
YOLOv10 trained on DocLayNet dataset.
☆77Updated 8 months ago
marieai / marie-ai
Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…
☆70Updated this week
s-emanuilov / litepali
LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.
☆52Updated 9 months ago
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 9 months ago
CaseDrive / publaynet-models
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆28Updated 2 years ago
felixdittrich92 / OnnxTR
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR
☆136Updated 3 weeks ago
SCUT-DLVCLab / Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
☆194Updated 4 months ago
microsoft / CompHRDoc
Datasets and Evaluation Scripts for CompHRDoc
☆46Updated 4 months ago
godatadriven / llm-archetype-batch-use-case
General solution to archetype LLM batch use case
☆34Updated last year
GeorgeLuImmortal / DocLLM_reimplementation
☆22Updated last year
plaggy / rag-containers
Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.
☆67Updated 6 months ago