felipemaiapolo / prompteval
Efficient multi-prompt evaluation of LLMs
☆14Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for prompteval
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆120Updated this week
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆21Updated 11 months ago
- Using Explanations as a Tool for Advanced LLMs☆50Updated 2 months ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; arXiv preprint arXiv:2403.…☆36Updated 4 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- The Efficiency Spectrum of LLM☆52Updated 11 months ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆36Updated 5 months ago
- A Closer Look into Mixture-of-Experts in Large Language Models☆39Updated 3 months ago
- Google Research☆45Updated 2 years ago
- Codebase for Instruction Following without Instruction Tuning☆30Updated last month
- ☆50Updated last year
- Official Repository for Dataset Inference for LLMs☆23Updated 3 months ago
- Learning adapter weights from task descriptions☆15Updated last year
- ☆21Updated last week
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆48Updated 7 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆109Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆39Updated 4 months ago
- ☆14Updated 8 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆30Updated 3 months ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆40Updated 8 months ago
- Data and code for the Corr2Cause paper (ICLR 2024)☆86Updated 6 months ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆22Updated 11 months ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆15Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆41Updated last year
- AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning (Zhou et al.; TACL)☆42Updated 7 months ago
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆51Updated 3 weeks ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆42Updated 2 years ago
- ☆40Updated 2 years ago
- Evaluation of neuro-symbolic engines☆33Updated 3 months ago