for-ai / language-confusion
Repository for the "Understanding and Mitigating Language Confusion in LLMs" paper
☆19Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for language-confusion
- ☆38Updated 7 months ago
- ☆46Updated this week
- ☆71Updated 6 months ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 7 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 3 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆41Updated 10 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆58Updated 3 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated last year
- ☆73Updated last year
- ☆44Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆39Updated 10 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆111Updated 2 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆78Updated 3 months ago
- Apps built using Inspired Cognition's Critique.☆58Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆56Updated 5 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 8 months ago
- ☆29Updated 9 months ago
- ☆31Updated last year
- ☆97Updated 2 years ago
- ☆112Updated last month
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- SILO Language Models code repository☆80Updated 8 months ago
- Evaluation pipeline for the BabyLM Challenge 2023.☆72Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- A unified benchmark for math reasoning☆87Updated last year
- ☆55Updated last year
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆73Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆37Updated last month
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆22Updated last year