Efficient multi-prompt evaluation of LLMs
☆31Dec 6, 2024Updated last year
Alternatives and similar repositories for prompteval
Users that are interested in prompteval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Feb 15, 2021Updated 5 years ago
- The collection of related papers and resources for the paper Time Series Analysis for Education: Methods, Applications, and Future Direct…☆18Apr 12, 2025Updated 11 months ago
- End-to-End Ontology Learning with Large Language Models, NeurIPS 2024.☆51Nov 6, 2024Updated last year
- ☆12Nov 2, 2021Updated 4 years ago
- simulate linkstate algorithm for routing☆10Nov 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Evaluating LLMs with fewer examples☆172Apr 12, 2024Updated last year
- Repository for the paper: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning☆18Feb 21, 2025Updated last year
- 🗜️Codebase of the ACIP algorithm 🗜️☆17Feb 11, 2026Updated last month
- ☆11May 18, 2025Updated 10 months ago
- ☆10Nov 15, 2023Updated 2 years ago
- Conceptual Construct Representations☆11Feb 23, 2023Updated 3 years ago
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- ScienceMeter: Tracking Scientific Knowledge Updates in Language Models☆17Jun 28, 2025Updated 9 months ago
- This repository contains an implementation of the simple yet powerful state machine agentic algorithm.☆22Sep 29, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Rahnema Final Project - Network anomaly detection☆11Jul 22, 2021Updated 4 years ago
- Use shecan in bash with ease☆15Feb 8, 2019Updated 7 years ago
- Starter repo for regl explorations☆10May 26, 2017Updated 8 years ago
- compiler project for compiler course (spring 99) in sbu university☆13Nov 21, 2023Updated 2 years ago
- ☆11Mar 12, 2021Updated 5 years ago
- ☆16Jul 11, 2023Updated 2 years ago
- This is a list of Persian foods☆13Oct 1, 2020Updated 5 years ago
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆18Oct 7, 2025Updated 5 months ago
- Helm chart for tile38☆15Feb 25, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13May 27, 2022Updated 3 years ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation☆14Aug 19, 2025Updated 7 months ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- Privateer is a plugin-based framework for security & compliance evaluations.☆19Mar 21, 2026Updated last week
- Generating graph structures from OWL ontologies☆12Nov 21, 2017Updated 8 years ago
- Sharif-AI-Challenge2021 Client☆11Aug 20, 2021Updated 4 years ago
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆25Feb 11, 2026Updated last month
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24May 24, 2022Updated 3 years ago
- [NAACL 2024] Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers https://arxiv.org/abs/2307.…☆17Jan 27, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This is the source code for "Dream On". An indie game planned to be released in Fall 2021.☆10Aug 19, 2021Updated 4 years ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆108Nov 11, 2024Updated last year
- learn most important part of docker fast and easy☆16May 5, 2020Updated 5 years ago
- Clinical NLP concept extraction of ADEs in the 2018 n2c2 Adverse Drug Events and Medication Extraction (Track 2). Includes data preproce…☆16Nov 21, 2020Updated 5 years ago
- The CSCS ReFrame test suite☆16Mar 18, 2026Updated last week
- Quantifying Uncertainty in Deep Spatiotemporal Forecasting☆12Mar 20, 2021Updated 5 years ago
- Knowledge-Guided Adaptation of Pathology Foundation Models Improves Cross-domain Generalization and Demographic Fairness☆17Oct 14, 2025Updated 5 months ago