fe1ixxu / ALMALinks

State-of-the-art LLM-based translation models.

☆548

Alternatives and similar repositories for ALMA

Users that are interested in ALMA are comparing it to the libraries listed below

Sorting:

ZNLP / BigTranslate
BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages
☆227Updated last year
hsing-wang / Awesome-LLM-MT
☆243Updated last year
Unbabel / COMET
A Neural Framework for MT Evaluation
☆642Updated this week
openlanguagedata / flores
The FLORES+ Machine Translation Benchmark
☆106Updated 8 months ago
rayliuca / T-Ragx
Enhancing Translation with RAG-Powered Large Language Models
☆81Updated 4 months ago
facebookresearch / stopes
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…
☆282Updated 6 months ago
mzbac / llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
☆203Updated last year
eole-nlp / eole
Open language modeling toolkit based on PyTorch
☆138Updated 3 weeks ago
wxjiao / ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…
☆177Updated 7 months ago
FreedomIntelligence / MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
☆94Updated last year
naver / nllb-pruning
Library for pruning experts per language pair in NLLB-200
☆33Updated 2 years ago
Guitaricet / relora
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
☆458Updated last year
jondurbin / bagel
A bagel, with everything.
☆323Updated last year
salesforce / DialogStudio
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
☆513Updated 6 months ago
huggingface / cosmopedia
☆529Updated 8 months ago
facebookresearch / belebele
Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.
☆335Updated 7 months ago
ChenghaoMou / text-dedup
All-in-one text de-duplication
☆706Updated 2 weeks ago
linhduongtuan / BLOOM-LORA
Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…
☆184Updated 2 years ago
TencentARC / LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
☆508Updated last year
ymoslem / Adaptive-MT-LLM-Fine-tuning
Fine-tuning Open-Source LLMs for Adaptive Machine Translation
☆85Updated 3 weeks ago
vipulraheja / coedit
Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)
☆129Updated 10 months ago
nlp-uoregon / mlmm-evaluation
Multilingual Large Language Models Evaluation Benchmark
☆128Updated 11 months ago
SeanLee97 / AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
☆550Updated 4 months ago
princeton-nlp / LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆626Updated last year
cisnlp / Glot500
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
☆103Updated last year
cisnlp / GlotLID
💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
☆147Updated 2 months ago
neelsjain / NEFTune
Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning
☆397Updated last year
bigscience-workshop / xmtf
Crosslingual Generalization through Multitask Finetuning
☆537Updated 10 months ago
datamllab / LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
☆660Updated last year
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆468Updated last year