automl / promptolutionLinks
A unified, modular Framework for Prompt Optimization
☆108Updated 2 weeks ago
Alternatives and similar repositories for promptolution
Users that are interested in promptolution are comparing it to the libraries listed below
Sorting:
- Top papers related to LLM-based agent evaluation☆89Updated 3 months ago
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- LLM Attributor: Attribute LLM's Generated Text to Training Data☆72Updated 4 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆126Updated 3 months ago
- a curated list of the role of small models in the LLM era☆111Updated last year
- PAL: Predictive Analysis & Laws of Large Language Models☆38Updated last year
- ☆33Updated last year
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆47Updated 10 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆112Updated last year
- This repository serves as a comprehensive knowledge hub, curating cutting-edge research papers and developments across 25+ specialized do…☆92Updated last month
- Scrape papers from OpenReview using OpenReview API☆61Updated 11 months ago
- ☆94Updated last year
- ☆91Updated last month
- MIRIAD is a million-scale Medical Instruction and Retrieval Datatset☆142Updated 2 months ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆99Updated last year
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆72Updated 11 months ago
- A toolkit for quantitative evaluation of data attribution methods.☆55Updated 6 months ago
- End-to-End Ontology Learning with Large Language Models, NeurIPS 2024.☆47Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆319Updated last year
- PodGPT: An audio-augmented large language model for research and education☆58Updated last month
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆102Updated 2 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆94Updated 5 months ago
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆197Updated 3 months ago
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆28Updated 6 months ago
- A package dedicated for running benchmark agreement testing☆17Updated 4 months ago
- Efficient multi-prompt evaluation of LLMs☆28Updated last year
- A lightweight, reproducible toolkit for LLM-based query reformulation.☆29Updated last month
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆94Updated 2 months ago
- ☆52Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆156Updated 2 years ago