clovaai / donutLinks
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆6,485Updated last year
Alternatives and similar repositories for donut
Users that are interested in donut are comparing it to the libraries listed below
Sorting:
- A Repo For Document AI☆2,919Updated 3 weeks ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,696Updated last year
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,087Updated last week
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,583Updated 5 months ago
- An easy way to extract information from documents☆1,772Updated 2 years ago
- A curated list of resources for Document Understanding (DU) topic☆1,451Updated 2 years ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,518Updated 2 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,083Updated last month
- LLM(😽)☆1,686Updated 6 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,759Updated 4 months ago
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,438Updated last year
- A Python library to extract tabular data from PDFs☆3,383Updated last week
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,791Updated 8 months ago
- ☆1,002Updated last month
- This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table …☆1,541Updated 3 years ago
- Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)☆656Updated last year
- Improved file parsing for LLM’s☆3,037Updated 9 months ago
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆1,995Updated 7 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,192Updated last week
- Transforms PDF, Documents and Images into Enriched Structured Data☆5,994Updated last year
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,121Updated 6 months ago
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆778Updated last week
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,890Updated last year
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,602Updated 8 months ago
- Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"☆1,704Updated last year
- ☆1,709Updated 10 months ago
- Official implementation of Character Region Awareness for Text Detection (CRAFT)☆3,288Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,450Updated 2 months ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆12,368Updated this week
- ☆2,164Updated 11 months ago