facebookresearch / nougatLinks
Implementation of Nougat Neural Optical Understanding for Academic Documents
☆9,736Updated 9 months ago
Alternatives and similar repositories for nougat
Users that are interested in nougat are comparing it to the libraries listed below
Sorting:
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,794Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,778Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,142Updated last year
- Convert PDF to markdown + JSON quickly with high accuracy☆30,183Updated 2 weeks ago
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,694Updated last year
- Fast and memory-efficient exact attention☆20,904Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆21,866Updated 5 months ago
- Math OCR model that outputs LaTeX and markdown☆1,101Updated 10 months ago
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,123Updated last month
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,087Updated 5 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,108Updated last year
- Train transformer language models with reinforcement learning.☆16,552Updated this week
- A Repo For Document AI☆3,099Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆50,560Updated 3 weeks ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,026Updated 9 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,705Updated last year
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆2,658Updated 4 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,959Updated last month
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆20,215Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,816Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,304Updated 2 weeks ago
- Inference Llama 2 in one file of pure C☆18,995Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,061Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,195Updated last year
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,843Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,714Updated last year
- Go ahead and axolotl questions☆10,911Updated this week
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,924Updated last year
- pix2tex: Using a ViT to convert images of equations into LaTeX code.☆16,000Updated 10 months ago
- Latest Advances on Multimodal Large Language Models☆16,847Updated this week