gordicaleksa / Open-NLLBLinks
Effort to open-source NLLB checkpoints.
☆452Updated last year
Alternatives and similar repositories for Open-NLLB
Users that are interested in Open-NLLB are comparing it to the libraries listed below
Sorting:
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆332Updated 7 months ago
- ☆527Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆819Updated 2 years ago
- ☆447Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆591Updated last year
- ☆175Updated last year
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆240Updated last year
- ☆158Updated 2 years ago
- ☆264Updated last year
- Gemma 2 optimized for your local machine.☆376Updated 11 months ago
- ☆1,133Updated 5 months ago
- Tune any FALCON in 4-bit☆467Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 8 months ago
- A bagel, with everything.☆322Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆715Updated last year
- Run inference on MPT-30B using CPU☆575Updated 2 years ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆310Updated last year
- Place where folks can contribute to 🤗 community events☆424Updated last year
- [ICLR-2025-SLLM Spotlight 🔥]MobiLlama : Small Language Model tailored for edge devices☆651Updated 2 months ago
- Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models☆176Updated last week
- ☆205Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆719Updated 5 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆701Updated last year
- ☆864Updated last year
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆765Updated 8 months ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆302Updated 2 years ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆914Updated 8 months ago
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript☆591Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆233Updated last year