clovaai / donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆6,175Updated 9 months ago
Alternatives and similar repositories for donut:
Users that are interested in donut are comparing it to the libraries listed below
- A Repo For Document AI☆2,792Updated last week
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆4,564Updated this week
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,301Updated 5 months ago
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,403Updated last month
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,476Updated last year
- Numbers every LLM developer should know☆4,204Updated last year
- A collection of libraries to optimise AI model performances☆8,369Updated 8 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,986Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,394Updated 10 months ago
- Simple UI for LLM Model Finetuning☆2,062Updated last year
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,048Updated 7 months ago
- 📋 A list of open LLMs available for commercial use.☆11,904Updated 2 months ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,583Updated 9 months ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,153Updated this week
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,703Updated 4 months ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,582Updated 7 months ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆21,069Updated last month
- 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows☆10,720Updated this week
- Home of StarCoder: fine-tuning & inference!☆7,409Updated last year
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,475Updated last year
- Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sour…☆2,642Updated 6 months ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,857Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,695Updated last year
- Adding guardrails to large language models.☆4,808Updated last week
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,910Updated 10 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,688Updated last week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,370Updated 8 months ago
- Instruct-tune LLaMA on consumer hardware☆18,896Updated 8 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,476Updated 10 months ago
- Structured Text Generation☆11,369Updated this week