facebookresearch / nougatLinks
Implementation of Nougat Neural Optical Understanding for Academic Documents
☆9,626Updated 6 months ago
Alternatives and similar repositories for nougat
Users that are interested in nougat are comparing it to the libraries listed below
Sorting:
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,648Updated last year
- Inference Llama 2 in one file of pure C☆18,722Updated last year
- Convert PDF to markdown + JSON quickly with high accuracy☆28,444Updated this week
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,524Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆23,495Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,509Updated this week
- pix2tex: Using a ViT to convert images of equations into LaTeX code.☆15,212Updated 7 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,654Updated 9 months ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,079Updated 2 months ago
- ☆4,634Updated 2 months ago
- Tensor library for machine learning☆13,134Updated this week
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆56,559Updated 9 months ago
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,805Updated 9 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆40,001Updated this week
- Universal LLM Deployment Engine with ML Compilation☆21,259Updated last week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,787Updated last year
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,470Updated last year
- A Repo For Document AI☆2,951Updated 2 weeks ago
- A series of large language models trained from scratch by developers @01-ai☆7,845Updated 9 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,658Updated last year
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆2,553Updated last month
- PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn …☆7,145Updated 6 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,396Updated 2 weeks ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆15,825Updated last week
- An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.☆8,460Updated last month
- Math OCR model that outputs LaTeX and markdown☆1,075Updated 7 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,826Updated 7 months ago
- Multi-tool for semantic search☆2,648Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,726Updated this week
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,738Updated last year