IBM / ensemble-instructLinks
codebase release for EMNLP2023 paper publication
β19Updated 3 months ago
Alternatives and similar repositories for ensemble-instruct
Users that are interested in ensemble-instruct are comparing it to the libraries listed below
Sorting:
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β212Updated this week
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ189Updated 6 months ago
- Advanced Reasoning Benchmark Dataset for LLMsβ47Updated 2 years ago
- Code for Zero-Shot Tokenizer Transferβ142Updated 11 months ago
- Small and Efficient Mathematical Reasoning LLMsβ73Updated last year
- β43Updated last year
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testingβ52Updated last year
- Open Implementations of LLM Analysesβ107Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."β66Updated 2 years ago
- A package dedicated for running benchmark agreement testingβ18Updated 3 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β107Updated 2 years ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learnersβ116Updated 6 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ137Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.β155Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β49Updated 2 years ago
- β17Updated 9 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"β87Updated last year
- Aioli: A unified optimization framework for language model data mixingβ32Updated 11 months ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domainβ¦β52Updated 2 years ago
- β129Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Modelβ45Updated 3 months ago
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- Evaluating LLMs with CommonGen-Liteβ93Updated last year
- SILO Language Models code repositoryβ83Updated last year
- β59Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ43Updated last year
- β44Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ111Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ62Updated last year
- β38Updated last year