ZNLP / BigTranslateLinks

BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages

☆227

Alternatives and similar repositories for BigTranslate

Users that are interested in BigTranslate are comparing it to the libraries listed below

Sorting:

wxjiao / ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…
☆177Updated 7 months ago
fe1ixxu / ALMA
State-of-the-art LLM-based translation models.
☆547Updated 3 months ago
FreedomIntelligence / MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
☆94Updated last year
linhduongtuan / BLOOM-LORA
Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…
☆184Updated 2 years ago
yangkevin2 / doc-story-generation
☆158Updated last year
zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆110Updated 2 years ago
pluiez / NLLB-inference
☆57Updated 3 years ago
openlanguagedata / flores
The FLORES+ Machine Translation Benchmark
☆106Updated 8 months ago
mbzuai-nlp / bactrian-x
A Multilingual Replicable Instruction-Following Model
☆94Updated 2 years ago
vipulraheja / coedit
Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)
☆129Updated 10 months ago
hsing-wang / Awesome-LLM-MT
☆242Updated last year
shjwudp / c4-dataset-script
Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese…
☆129Updated 2 years ago
akoksal / LongForm
Reverse Instructions to generate instruction tuning data with corpus examples
☆214Updated last year
LAION-AI / Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
☆208Updated last year
gmftbyGMFTBY / Copyisallyouneed
[ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM
☆186Updated 6 months ago
cisnlp / Glot500
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
☆103Updated last year
bigscience-workshop / xmtf
Crosslingual Generalization through Multitask Finetuning
☆537Updated 10 months ago
raunak-agarwal / instruction-datasets
All available datasets for Instruction Tuning of Large Language Models
☆255Updated last year
RWKV-Wiki / MultilingualShareGPT
MultilingualShareGPT, the free multi-language corpus for LLM training
☆72Updated 2 years ago
gpt4life / alpagasus
Unofficial implementation of AlpaGasus
☆92Updated last year
mzbac / llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
☆201Updated last year
orhonovich / unnatural-instructions
☆180Updated 2 years ago
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆78Updated last year
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆302Updated 2 years ago
facebookresearch / stopes
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…
☆282Updated 6 months ago
salesforce / DialogStudio
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
☆513Updated 6 months ago
facebookresearch / tart
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
☆163Updated last year
radi-cho / botbots
A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…
☆166Updated 2 years ago
yangkevin2 / emnlp22-re3-story-generation
☆254Updated 2 years ago
GeneZC / MiniMA
Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"
☆101Updated last year