mit1208 / Document-AI
☆19Updated last year
Alternatives and similar repositories for Document-AI
Users that are interested in Document-AI are comparing it to the libraries listed below
Sorting:
- ☆115Updated last week
- ☆22Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆126Updated last year
- A Unified Toolkit for Deep Learning-Based Table Extraction☆35Updated 5 months ago
- Logical structure analysis for visually structured documents☆89Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆43Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆121Updated last year
- ☆32Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆51Updated 7 months ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆340Updated 2 years ago
- Repository for deepdoctection tutorial notebooks☆45Updated 5 months ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 2 years ago
- Object Detection Model for Scanned Documents☆93Updated 2 months ago
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆104Updated last year
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆73Updated last month
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆190Updated 2 months ago
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆50Updated 3 months ago
- ☆180Updated last month
- GLiNER model in a FastAPI microservice.☆44Updated 5 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- Graph-based Layout Analysis Model☆15Updated 7 months ago
- Datasets and Evaluation Scripts for CompHRDoc☆38Updated 2 months ago
- UniTable: Towards a Unified Table Foundation Model☆467Updated 11 months ago
- YOLOv10 trained on DocLayNet dataset.☆72Updated 6 months ago
- Generalist and Lightweight Model for Text Classification☆128Updated 2 weeks ago
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆181Updated 8 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆65Updated 7 months ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆109Updated this week
- Repository of the ICNLSP 2024 paper "Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes…☆15Updated 4 months ago