felipemaiapolo / prompteval
Efficient multi-prompt evaluation of LLMs
☆17Updated last month
Alternatives and similar repositories for prompteval:
Users that are interested in prompteval are comparing it to the libraries listed below
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆14Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆32Updated 2 years ago
- ☆14Updated 10 months ago
- Official Repository for Dataset Inference for LLMs☆27Updated 5 months ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆25Updated 2 years ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆36Updated 2 months ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆72Updated 11 months ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆22Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆38Updated 9 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆22Updated 2 months ago
- ☆26Updated 6 months ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Updated last year
- Embedding Recycling for Language models☆38Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆41Updated 3 weeks ago
- We enable LLM with personalization capability☆10Updated last year
- ☆67Updated 3 months ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆18Updated 3 months ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆30Updated last month
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆36Updated 6 months ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year
- ☆37Updated 6 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆35Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆43Updated last year
- ☆21Updated 2 months ago
- ☆35Updated last year
- Code/data for MARG (multi-agent review generation)☆36Updated 2 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆34Updated 5 months ago
- Data Valuation on In-Context Examples (ACL23)☆23Updated last week
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆39Updated last year