mit1280 / Document-AI

☆18

Alternatives and similar repositories for Document-AI:

Users that are interested in Document-AI are comparing it to the libraries listed below

dswang2011 / DocLLM
DocLLM: A layout-aware generative language model for multimodal document understanding
☆119Updated last year
GeorgeLuImmortal / DocLLM_reimplementation
☆21Updated 10 months ago
andreagemelli / doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆117Updated last year
henrikalbihn / gliner-as-a-service
GLiNER model in a FastAPI microservice.
☆34Updated last month
SCUT-DLVCLab / Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
☆174Updated last month
DS4SD / docling-ibm-models
☆65Updated this week
s-emanuilov / litepali
LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.
☆35Updated 3 months ago
butlerlabs / docai
DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…
☆19Updated 2 years ago
LynnHaDo / Document-Layout-Analysis
Object Detection Model for Scanned Documents
☆86Updated last year
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆74Updated 3 months ago
deepdoctection / notebooks
Repository for deepdoctection tutorial notebooks
☆40Updated last month
DS4SD / deepsearch-examples
Examples using the Deep Search functionalities
☆56Updated this week
CycloneBoy / pdf_table
A Unified Toolkit for Deep Learning-Based Table Extraction
☆28Updated 2 months ago
DS4SD / docling-parse
Simple package to extract text with coordinates from programmatic PDFs
☆48Updated this week
felixdittrich92 / OnnxTR
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR
☆77Updated this week
plaggy / rag-containers
Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.
☆55Updated 3 weeks ago
poloclub / tsr-convstem
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
☆42Updated 9 months ago
ppaanngggg / yolo-doclaynet
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
☆81Updated 2 weeks ago
wjbmattingly / qwen2-vl-finetune-huggingface
This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.
☆59Updated 4 months ago
kyegomez / Kosmos2.5
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
☆70Updated this week
DS4SD / deepsearch-glm
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆33Updated last month
davidberenstein1957 / dataset-viber
Dataset Viber is your chill repo for data collection, annotation and vibe checks.
☆44Updated 4 months ago
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆140Updated 3 months ago
Unstructured-IO / unstructured-inference
☆167Updated this week
microsoft / CompHRDoc
Datasets and Evaluation Scripts for CompHRDoc
☆31Updated 9 months ago
nlmatics / nlm-tika
☆22Updated 7 months ago
allanj / LayoutLMv3-DocVQA
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆49Updated 2 years ago
Knowledgator / GLiClass
Generalist and Lightweight Model for Text Classification
☆58Updated 2 weeks ago
google-research-datasets / vrdu
We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…
☆76Updated last year
neuralwork / instruct-finetune-mistral
Fine-tune Mistral 7B to generate fashion style suggestions
☆33Updated last year