facebookresearch / nougatLinks
Implementation of Nougat Neural Optical Understanding for Academic Documents
☆9,583Updated 6 months ago
Alternatives and similar repositories for nougat
Users that are interested in nougat are comparing it to the libraries listed below
Sorting:
- A Repo For Document AI☆2,927Updated this week
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,706Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,450Updated 2 months ago
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,490Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,337Updated this week
- Math OCR model that outputs LaTeX and markdown☆1,072Updated 6 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,025Updated last year
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆2,536Updated 3 weeks ago
- Multi-tool for semantic search☆2,641Updated 11 months ago
- Improved file parsing for LLM’s☆3,044Updated 9 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,617Updated last year
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,931Updated 7 months ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆12,368Updated last week
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,623Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,640Updated 3 months ago
- Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sour…☆2,663Updated 10 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,734Updated 5 months ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,238Updated 2 months ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,797Updated last year
- Structured Outputs☆12,384Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,859Updated last year
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,083Updated last month
- A guidance language for controlling large language models.☆20,587Updated this week
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,759Updated 4 months ago
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,011Updated 7 months ago
- Code for Paper: “Low-Resource” Text Classification: A Parameter-Free Classification Method with Compressors☆1,773Updated 2 years ago
- High accuracy RAG for answering questions from scientific documents with citations☆7,639Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆21,632Updated last month
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,763Updated 11 months ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,375Updated 7 months ago