Efficient multi-prompt evaluation of LLMs
☆28Dec 6, 2024Updated last year
Alternatives and similar repositories for prompteval
Users that are interested in prompteval are comparing it to the libraries listed below
Sorting:
- The collection of related papers and resources for the paper Time Series Analysis for Education: Methods, Applications, and Future Direct…☆18Apr 12, 2025Updated 10 months ago
- Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!☆26Dec 14, 2025Updated 2 months ago
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆25May 10, 2024Updated last year
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated 11 months ago
- Create and deploy virtual-experiments - co-processing computational workflows☆10Jan 28, 2026Updated last month
- ☆31Jul 14, 2023Updated 2 years ago
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆26Feb 4, 2026Updated last month
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated 2 weeks ago
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- Deep Transfer Learning codes using Google TensorFlow☆13Apr 4, 2016Updated 9 years ago
- This code repository is the source code of the paper "Deep Long-Range Spatiotemporal Dependency Synthetic Minority Oversampling Technique…☆12Nov 21, 2025Updated 3 months ago
- Memory Topology for GPUs☆18Updated this week
- The Forward-Forward Algorithm for Drug Discovery☆33Dec 30, 2022Updated 3 years ago
- Official PyTorch code for UAI 2023 paper "Concurrent Misclassification and Out-of-Distribution Detection for Semantic Segmentation via En…☆12Nov 10, 2023Updated 2 years ago
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago
- Conceptual Construct Representations☆11Feb 23, 2023Updated 3 years ago
- A longitudinal dataset for academic literature, including papers, metadata, and citation graphs, Also available on 🤗 HuggingFace and Kag…☆16Sep 6, 2025Updated 6 months ago
- AI agent skill for building modern, composable, and accessible React UI components following the components.build specification☆42Jan 28, 2026Updated last month
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 6 months ago
- Code for paper "Beyond Closure Models: Learning Chaotic Systems via Physics-Informed Neural Operators".☆14Dec 24, 2025Updated 2 months ago
- GPU based 2D elastic FWI☆12Mar 6, 2018Updated 8 years ago
- Reading comprehension based question-answering model for news articles.☆11Jun 22, 2022Updated 3 years ago
- Single-Source Domain Generalization for Bearing Fault Diagnosis Using Feature-Augmented Adaptive Neuro-Fuzzy Inference System☆11Apr 13, 2024Updated last year
- Performance Counter Reader☆11Sep 14, 2022Updated 3 years ago
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆24Feb 11, 2026Updated 3 weeks ago
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 10 months ago
- Twisted Convolutional Networks (TCNs)☆10Dec 9, 2025Updated 3 months ago
- ☆10Oct 2, 2024Updated last year
- Knowledge-Guided Adaptation of Pathology Foundation Models Improves Cross-domain Generalization and Demographic Fairness☆17Oct 14, 2025Updated 4 months ago
- Enhancing the convergence speed by 2x and improving the training success of Physics-Informed Neural Networks (PINNs).☆13Oct 14, 2024Updated last year
- ☆11Mar 12, 2021Updated 4 years ago
- Contest solution for 数境创新大赛-新能源储能系统电池容量预测☆13Mar 16, 2024Updated last year
- MDLText☆12Jul 13, 2017Updated 8 years ago
- VAE+GAN☆10Apr 18, 2018Updated 7 years ago
- Continuum Dynamics Evaluation and Test Suite☆15Aug 29, 2017Updated 8 years ago
- ☆15Feb 9, 2026Updated last month
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- Python routines for parallel analysis of large MITgcm simulations☆12Jun 23, 2016Updated 9 years ago
- Dependencies Upgrade with multi-agents (CrewAI & Langgraph)☆11Sep 9, 2024Updated last year