IBM / ensemble-instructLinks
codebase release for EMNLP2023 paper publication
β19Updated 3 months ago
Alternatives and similar repositories for ensemble-instruct
Users that are interested in ensemble-instruct are comparing it to the libraries listed below
Sorting:
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β212Updated last week
- Codebase accompanying the Summary of a Haystack paper.β79Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ189Updated 5 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.β155Updated last year
- β43Updated last year
- Advanced Reasoning Benchmark Dataset for LLMsβ47Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ60Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ72Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ135Updated last year
- β20Updated 8 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β107Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated last year
- β129Updated last year
- β17Updated 8 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."β66Updated 2 years ago
- A package dedicated for running benchmark agreement testingβ18Updated 3 months ago
- Functional Benchmarks and the Reasoning Gapβ90Updated last year
- Aioli: A unified optimization framework for language model data mixingβ31Updated 11 months ago
- Code for Zero-Shot Tokenizer Transferβ142Updated 11 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Searchβ102Updated last year
- β55Updated last year
- Open Implementations of LLM Analysesβ108Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Faceβ32Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challengeβ59Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023β36Updated 2 years ago
- Code accompanying "How I learned to start worrying about prompt formatting".β112Updated 6 months ago
- β38Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found heβ¦β31Updated 2 years ago
- β58Updated last year