clovaai / donutLinks
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆6,771Updated last year
Alternatives and similar repositories for donut
Users that are interested in donut are comparing it to the libraries listed below
Sorting:
- A Repo For Document AI☆3,127Updated 2 weeks ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,838Updated last year
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,809Updated 11 months ago
- An easy way to extract information from documents☆1,787Updated 2 years ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,789Updated last month
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,815Updated 9 months ago
- A curated list of resources for Document Understanding (DU) topic☆1,498Updated 2 years ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,840Updated last week
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,641Updated last year
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,349Updated 8 months ago
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,101Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,159Updated 3 months ago
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,713Updated last year
- Improved file parsing for LLM’s☆3,151Updated last year
- An open-source framework for training large multimodal models.☆4,064Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,892Updated last year
- Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)☆687Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,715Updated this week
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,948Updated 9 months ago
- UniTable: Towards a Unified Table Foundation Model☆521Updated last year
- ☆1,036Updated 6 months ago
- Developer APIs to Accelerate LLM Projects☆1,741Updated last year
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,722Updated last year
- A language for constraint-guided and efficient LLM programming.☆4,139Updated 8 months ago
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆8,918Updated this week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,276Updated 10 months ago
- ☆391Updated 2 years ago
- A collection of libraries to optimise AI model performances☆8,354Updated last year
- Large Language Model Text Generation Inference☆10,739Updated 3 weeks ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,759Updated 3 months ago