facebookresearch / nougatLinks
Implementation of Nougat Neural Optical Understanding for Academic Documents
☆9,788Updated 10 months ago
Alternatives and similar repositories for nougat
Users that are interested in nougat are comparing it to the libraries listed below
Sorting:
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,749Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,112Updated 2 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,055Updated 11 months ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,831Updated last year
- A Repo For Document AI☆3,121Updated this week
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,807Updated 9 months ago
- High-speed Large Language Model Serving for Local Deployment☆8,572Updated 5 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆30,905Updated last week
- Improved file parsing for LLM’s☆3,150Updated last year
- Large World Model -- Modeling Text and Video with Millions Context☆7,394Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,261Updated last year
- DSPy: The framework for programming—not prompting—language models☆31,545Updated this week
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,264Updated 7 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,819Updated this week
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,923Updated 9 months ago
- Math OCR model that outputs LaTeX and markdown☆1,104Updated 11 months ago
- High accuracy RAG for answering questions from scientific documents with citations☆7,995Updated this week
- tiny vision language model☆9,218Updated 2 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,867Updated last year
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,114Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,169Updated last year
- Converts text input or URL into knowledge graph and displays☆3,540Updated 2 years ago
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆8,826Updated this week
- Inference Llama 2 in one file of pure C☆19,106Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,876Updated last year
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,468Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,588Updated last week
- Examples in the MLX framework☆8,134Updated last month
- ☆4,112Updated last year
- A language for constraint-guided and efficient LLM programming.☆4,126Updated 7 months ago