facebookresearch / nougatLinks
Implementation of Nougat Neural Optical Understanding for Academic Documents
☆9,546Updated 5 months ago
Alternatives and similar repositories for nougat
Users that are interested in nougat are comparing it to the libraries listed below
Sorting:
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,458Updated last year
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆2,515Updated last week
- Universal LLM Deployment Engine with ML Compilation☆21,039Updated this week
- Math OCR model that outputs LaTeX and markdown☆1,066Updated 6 months ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,682Updated last year
- A Repo For Document AI☆2,899Updated last week
- Convert PDF to markdown + JSON quickly with high accuracy☆26,856Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆12,035Updated last week
- pix2tex: Using a ViT to convert images of equations into LaTeX code.☆15,085Updated 6 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,929Updated last month
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆23,180Updated 11 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,583Updated last year
- A series of large language models trained from scratch by developers @01-ai☆7,834Updated 8 months ago
- Improved file parsing for LLM’s☆3,034Updated 8 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,949Updated last year
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆19,184Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,236Updated 6 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,749Updated 3 months ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆21,582Updated 3 weeks ago
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,951Updated last year
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆43,322Updated last week
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,626Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,667Updated last year
- GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)☆7,685Updated 2 years ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,882Updated this week
- Inference code for Llama models☆58,577Updated 6 months ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆12,443Updated 7 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,436Updated last month
- Train transformer language models with reinforcement learning.☆14,736Updated this week
- ☆8,644Updated 9 months ago