clovaai / donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆6,217Updated 10 months ago
Alternatives and similar repositories for donut
Users that are interested in donut are comparing it to the libraries listed below
Sorting:
- A Repo For Document AI☆2,818Updated this week
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,435Updated 2 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,423Updated 11 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆4,645Updated last week
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,601Updated 10 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,708Updated last month
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,053Updated 8 months ago
- Containers for machine learning☆8,579Updated this week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,715Updated last year
- An open-source framework for training large multimodal models.☆3,909Updated 8 months ago
- ImageBind One Embedding Space to Bind Them All☆8,632Updated 9 months ago
- 💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows☆10,876Updated last week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,393Updated 9 months ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,237Updated last week
- StableLM: Stability AI Language Models☆15,832Updated last year
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,066Updated last week
- A curated list of resources for Document Understanding (DU) topic☆1,406Updated last year
- An easy way to extract information from documents☆1,753Updated 2 years ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,593Updated last year
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,522Updated 5 months ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆21,193Updated 2 months ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,869Updated last year
- Instruct-tune LLaMA on consumer hardware☆18,900Updated 9 months ago
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18,320Updated this week
- A collection of libraries to optimise AI model performances☆8,373Updated 9 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,633Updated 2 months ago
- LLM as a Chatbot Service☆3,319Updated last year
- the AI-native open-source embedding database☆19,694Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆14,413Updated last month
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,235Updated 8 months ago