facebookresearch / nougatLinks

Implementation of Nougat Neural Optical Understanding for Academic Documents

☆9,546

Alternatives and similar repositories for nougat

Users that are interested in nougat are comparing it to the libraries listed below

Sorting:

clovaai / donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆6,458Updated last year
breezedeus / Pix2Text
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…
☆2,515Updated last week
mlc-ai / mlc-llm
Universal LLM Deployment Engine with ML Compilation
☆21,039Updated this week
VikParuchuri / texify
Math OCR model that outputs LaTeX and markdown
☆1,066Updated 6 months ago
microsoft / table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…
☆2,682Updated last year
deepdoctection / deepdoctection
A Repo For Document AI
☆2,899Updated last week
datalab-to / marker
Convert PDF to markdown + JSON quickly with high accuracy
☆26,856Updated this week
Unstructured-IO / unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…
☆12,035Updated last week
lukas-blecher / LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
☆15,085Updated 6 months ago
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆38,929Updated last month
haotian-liu / LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆23,180Updated 11 months ago
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,583Updated last year
01-ai / Yi
A series of large language models trained from scratch by developers @01-ai
☆7,834Updated 8 months ago
Filimoa / open-parse
Improved file parsing for LLM’s
☆3,034Updated 8 months ago
mit-han-lab / streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆6,949Updated last year
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆19,184Updated this week
opendatalab / PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆8,236Updated 6 months ago
AlibabaResearch / AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,749Updated 3 months ago
microsoft / unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆21,582Updated 3 weeks ago
amazon-science / mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
☆3,951Updated last year
run-llama / llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
☆43,322Updated last week
zai-org / CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
☆6,626Updated last year
jzhang38 / TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆8,667Updated last year
zai-org / GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
☆7,685Updated 2 years ago
datalab-to / surya
OCR, layout analysis, reading order, table recognition in 90+ languages
☆17,882Updated this week
meta-llama / llama
Inference code for Llama models
☆58,577Updated 6 months ago
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆12,443Updated 7 months ago
nlpxucan / WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,436Updated last month
huggingface / trl
Train transformer language models with reinforcement learning.
☆14,736Updated this week
apple / ml-ferret
☆8,644Updated 9 months ago