IBM / ensemble-instructLinks
codebase release for EMNLP2023 paper publication
β19Updated last month
Alternatives and similar repositories for ensemble-instruct
Users that are interested in ensemble-instruct are comparing it to the libraries listed below
Sorting:
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β211Updated last week
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β107Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ135Updated last year
- Advanced Reasoning Benchmark Dataset for LLMsβ46Updated last year
- Open Implementations of LLM Analysesβ107Updated last year
- β39Updated last year
- Codebase accompanying the Summary of a Haystack paper.β79Updated last year
- Aioli: A unified optimization framework for language model data mixingβ28Updated 9 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ60Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ42Updated last year
- β128Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.β154Updated last year
- Code for Zero-Shot Tokenizer Transferβ138Updated 9 months ago
- β58Updated last year
- Evaluating LLMs with CommonGen-Liteβ91Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ72Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".β112Updated 4 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ188Updated 3 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)β52Updated 2 months ago
- β43Updated last year
- A package dedicated for running benchmark agreement testingβ18Updated last month
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mindβ66Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β49Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformersβ60Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023β36Updated last year
- Retrieval Augmented Generation Generalized Evaluation Datasetβ57Updated 3 months ago
- β65Updated 2 years ago
- Embedding Recycling for Language modelsβ38Updated 2 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"β86Updated last year