FreedomIntelligence / MultilingualSIFTLinks
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
☆93Updated last year
Alternatives and similar repositories for MultilingualSIFT
Users that are interested in MultilingualSIFT are comparing it to the libraries listed below
Sorting:
- A Multilingual Replicable Instruction-Following Model☆94Updated 2 years ago
- Unofficial implementation of AlpaGasus☆92Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆126Updated 10 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆145Updated 8 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆244Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆214Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆90Updated 8 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆53Updated 11 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆79Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆85Updated last year
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆186Updated 5 months ago
- ☆17Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆180Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆209Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆163Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆93Updated 4 months ago
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆176Updated 6 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated 11 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 2 weeks ago
- DSIR large-scale data selection framework for language model training☆252Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆138Updated 8 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆228Updated 8 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- ☆180Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 5 months ago
- All available datasets for Instruction Tuning of Large Language Models☆254Updated last year
- ☆181Updated last week
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆162Updated 3 weeks ago