facebookresearch / nougatLinks
Implementation of Nougat Neural Optical Understanding for Academic Documents
β9,500Updated 4 months ago
Alternatives and similar repositories for nougat
Users that are interested in nougat are comparing it to the libraries listed below
Sorting:
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,504Updated last year
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β18,861Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β22,883Updated 10 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinksβ6,915Updated 11 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β14,935Updated 3 months ago
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)β¦β13,744Updated last week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β12,119Updated 6 months ago
- Convert PDF to markdown + JSON quickly with high accuracyβ26,105Updated last week
- OpenLLaMA, a permissively licensed open source reproduction of Meta AIβs LLaMA 7B trained on the RedPajama datasetβ7,501Updated last year
- A guidance language for controlling large language models.β20,372Updated this week
- Large Language Model Text Generation Inferenceβ10,249Updated this week
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022β6,354Updated 11 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,194Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ17,676Updated last week
- Inference Llama 2 in one file of pure Cβ18,491Updated 10 months ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into cleanβ¦β11,670Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,421Updated 3 weeks ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β38,770Updated 3 weeks ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,073Updated 9 months ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,043Updated 11 months ago
- Train transformer language models with reinforcement learning.β14,281Updated last week
- Tensor library for machine learningβ12,712Updated this week
- GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)β7,682Updated last year
- Accessible large language models via k-bit quantization for PyTorch.β7,150Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.β8,589Updated last year
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β42,633Updated this week
- An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.β8,435Updated last month
- A series of large language models trained from scratch by developers @01-aiβ7,829Updated 7 months ago
- Universal LLM Deployment Engine with ML Compilationβ20,849Updated this week
- The official GitHub page for the survey paper "A Survey of Large Language Models".β11,617Updated 3 months ago