Cohere-Labs-Community / language-confusionLinks
Repository for the "Understanding and Mitigating Language Confusion in LLMs" paper
☆28Updated last year
Alternatives and similar repositories for language-confusion
Users that are interested in language-confusion are comparing it to the libraries listed below
Sorting:
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 2 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆49Updated 3 weeks ago
- ☆75Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆93Updated 10 months ago
- ☆22Updated 3 weeks ago
- ☆127Updated 11 months ago
- SILO Language Models code repository☆81Updated last year
- ☆39Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated last year
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Updated 5 months ago
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆52Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 11 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated last year
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆64Updated 4 months ago
- ☆46Updated 5 months ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Updated 2 years ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆46Updated last year
- ☆97Updated 10 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated last year
- Code for Zero-Shot Tokenizer Transfer☆135Updated 7 months ago
- Language models scale reliably with over-training and on downstream tasks☆98Updated last year
- Long Context Extension and Generalization in LLMs☆59Updated 11 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆88Updated last year
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆77Updated 2 years ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆62Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆77Updated 2 years ago
- ☆27Updated last year