IBM / ensemble-instructLinks
codebase release for EMNLP2023 paper publication
β19Updated 2 months ago
Alternatives and similar repositories for ensemble-instruct
Users that are interested in ensemble-instruct are comparing it to the libraries listed below
Sorting:
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β211Updated last week
- β43Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β107Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMsβ72Updated last year
- Aioli: A unified optimization framework for language model data mixingβ28Updated 10 months ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domainβ¦β52Updated 2 years ago
- Advanced Reasoning Benchmark Dataset for LLMsβ46Updated 2 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."β66Updated 2 years ago
- Codebase accompanying the Summary of a Haystack paper.β79Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ135Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".β110Updated 5 months ago
- β23Updated 2 years ago
- β58Updated last year
- β44Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ188Updated 4 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"β86Updated last year
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".β83Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuningβ65Updated last year
- β39Updated last year
- β129Updated last year
- Open Implementations of LLM Analysesβ107Updated last year
- A package dedicated for running benchmark agreement testingβ18Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ60Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β50Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.β154Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023β36Updated last year
- β17Updated 7 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsβ24Updated 3 years ago
- Functional Benchmarks and the Reasoning Gapβ89Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptionsβ71Updated 2 years ago