facebookresearch / nougatLinks
Implementation of Nougat Neural Optical Understanding for Academic Documents
☆9,717Updated 8 months ago
Alternatives and similar repositories for nougat
Users that are interested in nougat are comparing it to the libraries listed below
Sorting:
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,661Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,888Updated 3 weeks ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,777Updated last year
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,923Updated 10 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆29,799Updated last week
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆2,638Updated 3 months ago
- Math OCR model that outputs LaTeX and markdown☆1,093Updated 9 months ago
- Inference Llama 2 in one file of pure C☆18,937Updated last year
- A Repo For Document AI☆3,068Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,744Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,004Updated 9 months ago
- ☆4,105Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,456Updated 5 months ago
- A machine learning software for extracting information from scholarly documents☆4,431Updated this week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆8,439Updated last week
- Python bindings for llama.cpp☆9,735Updated 3 months ago
- ImageBind One Embedding Space to Bind Them All☆8,859Updated last month
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,218Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,761Updated 6 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,796Updated 7 months ago
- High accuracy RAG for answering questions from scientific documents with citations☆7,835Updated this week
- Improved file parsing for LLM’s☆3,135Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆52,493Updated last year
- All things prompt engineering☆5,702Updated last year
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,640Updated last year
- High-speed Large Language Model Serving for Local Deployment☆8,388Updated 3 months ago
- A real world full-stack application using LlamaIndex☆2,574Updated 8 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,152Updated 2 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,689Updated last year
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,623Updated last week