IBM / ensemble-instructLinks
codebase release for EMNLP2023 paper publication
β19Updated 4 months ago
Alternatives and similar repositories for ensemble-instruct
Users that are interested in ensemble-instruct are comparing it to the libraries listed below
Sorting:
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β208Updated this week
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β107Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- Advanced Reasoning Benchmark Dataset for LLMsβ47Updated last year
- Codebase accompanying the Summary of a Haystack paper.β79Updated 11 months ago
- A package dedicated for running benchmark agreement testingβ18Updated 4 months ago
- Aioli: A unified optimization framework for language model data mixingβ27Updated 8 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ186Updated 2 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsβ24Updated 3 years ago
- β57Updated 11 months ago
- Evaluating LLMs with CommonGen-Liteβ91Updated last year
- Functional Benchmarks and the Reasoning Gapβ88Updated 11 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ133Updated last year
- β23Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ60Updated last year
- Code for Zero-Shot Tokenizer Transferβ137Updated 8 months ago
- β43Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated 10 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.β151Updated last year
- β54Updated 10 months ago
- β127Updated 11 months ago
- Open Implementations of LLM Analysesβ106Updated 11 months ago
- Retrieval Augmented Generation Generalized Evaluation Datasetβ55Updated 2 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated 10 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".β111Updated 3 months ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domainβ¦β52Updated 2 years ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ65Updated last year
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".β84Updated last year
- β48Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."β64Updated 2 years ago