clovaai / donutLinks
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆6,630Updated last year
Alternatives and similar repositories for donut
Users that are interested in donut are comparing it to the libraries listed below
Sorting:
- A Repo For Document AI☆2,992Updated this week
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,769Updated last year
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,581Updated this week
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,793Updated 6 months ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,257Updated 5 months ago
- An easy way to extract information from documents☆1,780Updated 2 years ago
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,696Updated 8 months ago
- A curated list of resources for Document Understanding (DU) topic☆1,472Updated 2 years ago
- Improved file parsing for LLM’s☆3,123Updated 11 months ago
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,669Updated 11 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,768Updated 8 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆23,870Updated last year
- A collection of libraries to optimise AI model performances☆8,368Updated last year
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,373Updated last year
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆21,798Updated 4 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,813Updated 2 weeks ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,910Updated last year
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆4,847Updated last month
- ☆1,028Updated 3 months ago
- This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table …☆1,549Updated 4 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,734Updated last year
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,975Updated last year
- Instruct-tune LLaMA on consumer hardware☆18,977Updated last year
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,088Updated last week
- ImageBind One Embedding Space to Bind Them All☆8,847Updated last month
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,082Updated 4 months ago
- ☆666Updated 5 months ago
- Efficient few-shot learning with Sentence Transformers☆2,592Updated 3 months ago
- ☆983Updated last year
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,160Updated 4 months ago