FreedomIntelligence / MultilingualSIFTLinks
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
☆94Updated last year
Alternatives and similar repositories for MultilingualSIFT
Users that are interested in MultilingualSIFT are comparing it to the libraries listed below
Sorting:
- Unofficial implementation of AlpaGasus☆92Updated last year
- A Multilingual Replicable Instruction-Following Model☆94Updated 2 years ago
- Multilingual Large Language Models Evaluation Benchmark☆128Updated 11 months ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 5 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆146Updated 9 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆92Updated 9 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆55Updated 11 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- ☆17Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆245Updated last year
- ☆180Updated 2 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆214Updated last year
- ☆152Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆163Updated last year
- ☆68Updated 2 years ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆101Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆139Updated 8 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated last month
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆135Updated last year
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆186Updated 6 months ago
- Finetune mistral-7b-instruct for sentence embeddings☆85Updated last year
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated last year
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 6 months ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆105Updated 5 months ago