IBM / ensemble-instructLinks
codebase release for EMNLP2023 paper publication
β19Updated 4 months ago
Alternatives and similar repositories for ensemble-instruct
Users that are interested in ensemble-instruct are comparing it to the libraries listed below
Sorting:
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β212Updated last week
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."β66Updated 2 years ago
- β43Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.β155Updated last year
- β129Updated last year
- Advanced Reasoning Benchmark Dataset for LLMsβ47Updated 2 years ago
- Aioli: A unified optimization framework for language model data mixingβ32Updated last year
- Functional Benchmarks and the Reasoning Gapβ89Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"β87Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ191Updated 6 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β107Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ136Updated last year
- β59Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ73Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated 3 months ago
- Evaluating LLMs with CommonGen-Liteβ93Updated last year
- β38Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learnersβ116Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- Retrieval Augmented Generation Generalized Evaluation Datasetβ60Updated 6 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β82Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.β119Updated 2 years ago
- Code accompanying "How I learned to start worrying about prompt formatting".β113Updated 7 months ago
- β78Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ61Updated last year
- β77Updated last year
- Code repository for the c-BTM paperβ108Updated 2 years ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.β156Updated 2 years ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023β36Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β49Updated 2 years ago