samakos / Document-AI-
☆14Updated last year
Alternatives and similar repositories for Document-AI-
Users that are interested in Document-AI- are comparing it to the libraries listed below
Sorting:
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆14Updated 5 months ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated 2 years ago
- ☆22Updated this week
- BoundaryNet - A Semi-Automatic Layout Annotation Tool☆24Updated 3 years ago
- [AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming☆20Updated 5 months ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆14Updated 8 months ago
- Datasets and Evaluation Scripts for CompHRDoc☆38Updated 2 months ago
- DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction☆20Updated last year
- ☆18Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆78Updated last year
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆52Updated 11 months ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆43Updated last year
- Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".☆13Updated last year
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆13Updated 2 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆36Updated 8 months ago
- A tiny package supporting distributed computation of COCO metrics for PyTorch models.☆11Updated 2 years ago
- ☆11Updated 5 months ago
- Bibliometric. A Python framework designed for the analysis and evaluation of scholarly publications.☆16Updated last month
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- ☆13Updated 4 months ago
- Deploy Swin Transformer using TorchServe☆27Updated 3 years ago
- Official Implementation of SCOB [ICCV 2023]☆22Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 3 weeks ago
- ☆38Updated 11 months ago
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆31Updated 7 months ago
- The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.☆81Updated last month
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Updated last year