Effort to open-source NLLB checkpoints.
☆476May 29, 2024Updated last year
Alternatives and similar repositories for Open-NLLB
Users that are interested in Open-NLLB are comparing it to the libraries listed below
Sorting:
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆297Updated this week
- Convert all of libgen to high quality markdown☆255Dec 13, 2023Updated 2 years ago
- Common tasks in a single model☆35Jan 10, 2024Updated 2 years ago
- Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥☆1,684Jan 14, 2025Updated last year
- ☆13Aug 23, 2024Updated last year
- AITuber Server☆153Nov 14, 2024Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197May 6, 2024Updated last year
- 10ms, sd-turbo, 512x512, batch size 1, txt2img on consumer hardware☆19Dec 8, 2023Updated 2 years ago
- Official Repo for the Paper: CHATANYTHING: FACETIME CHAT WITH LLM-ENHANCED PERSONAS☆381Nov 26, 2023Updated 2 years ago
- State-of-the-art LLM-based translation models.☆579Apr 9, 2025Updated 10 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,762Nov 14, 2024Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Oct 20, 2023Updated 2 years ago
- ☆718Mar 6, 2024Updated 2 years ago
- Fast inference engine for Transformer models☆4,342Feb 4, 2026Updated last month
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆17Aug 31, 2022Updated 3 years ago
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆188Nov 19, 2025Updated 3 months ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆600Nov 17, 2023Updated 2 years ago
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,876Sep 26, 2024Updated last year
- Data extraction with LLM on CPU☆271Mar 26, 2024Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- An Open Source text-to-speech system built by inverting Whisper.☆4,567Dec 14, 2025Updated 2 months ago
- Repository for analysis and experiments in the BigCode project.☆128Mar 20, 2024Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆725Jan 31, 2025Updated last year
- ☆51Oct 17, 2023Updated 2 years ago
- Fine tune SDXL on YouTube videos☆182Aug 20, 2024Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,187Aug 10, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,843Jun 10, 2024Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,500Aug 12, 2024Updated last year
- Robust recipes to align language models with human and AI preferences☆5,510Sep 8, 2025Updated 5 months ago
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,923Dec 7, 2024Updated last year
- ☆64Apr 9, 2024Updated last year
- Video Search and Streaming Agent 🕵️♂️☆503Jan 31, 2024Updated 2 years ago
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,475Sep 13, 2024Updated last year
- Official implementation of "Separate Anything You Describe"☆1,876Nov 26, 2024Updated last year
- ☆51Jul 25, 2024Updated last year
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆340Dec 18, 2024Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,196Jul 11, 2024Updated last year
- OpenAI API and Whisper based Video Translation☆75Dec 9, 2024Updated last year