FreedomIntelligence / MultilingualSIFTLinks

MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning

☆94

Alternatives and similar repositories for MultilingualSIFT

Users that are interested in MultilingualSIFT are comparing it to the libraries listed below

Sorting:

gpt4life / alpagasus
Unofficial implementation of AlpaGasus
☆92Updated last year
mbzuai-nlp / bactrian-x
A Multilingual Replicable Instruction-Following Model
☆94Updated 2 years ago
nlp-uoregon / mlmm-evaluation
Multilingual Large Language Models Evaluation Benchmark
☆128Updated 11 months ago
sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 5 months ago
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆146Updated 9 months ago
kaistAI / LangBridge
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
☆92Updated 9 months ago
fe1ixxu / CPO_SIMPO
This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.
☆55Updated 11 months ago
nlp-uoregon / Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
☆97Updated last year
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆78Updated last year
hills-code / open-instruct
☆17Updated last year
LAION-AI / Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
☆208Updated last year
cisnlp / Glot500
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
☆103Updated last year
kaistAI / CoT-Collection
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
☆245Updated last year
orhonovich / unnatural-instructions
☆180Updated 2 years ago
thu-coai / PICL
Code for ACL2023 paper: Pre-Training to Learn in Context
☆107Updated last year
akoksal / LongForm
Reverse Instructions to generate instruction tuning data with corpus examples
☆214Updated last year
jakespringer / echo-embeddings
☆152Updated last year
facebookresearch / tart
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
☆163Updated last year
qhjqhj00 / WebBrain
☆68Updated 2 years ago
GeneZC / MiniMA
Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"
☆101Updated last year
NormXU / Consistent-DynamicNTKRoPE
An Experiment on Dynamic NTK Scaling RoPE
☆64Updated last year
dwzhu-pku / LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆139Updated 8 months ago
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated last month
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆135Updated last year
gmftbyGMFTBY / Copyisallyouneed
[ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM
☆186Updated 6 months ago
kamalkraj / e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
☆85Updated last year
dwzhu-pku / PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆205Updated last year
salesforce / factualNLG
Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"
☆59Updated 6 months ago
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated 2 years ago
TIGER-AI-Lab / LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]
☆105Updated 5 months ago