microsoft / EvoPrompt
Automatic Prompt Optimization
☆25Updated 8 months ago
Alternatives and similar repositories for EvoPrompt:
Users that are interested in EvoPrompt are comparing it to the libraries listed below
- LLM reads a paper and produce a working prototype☆48Updated last month
- ☆47Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆54Updated 3 months ago
- ☆48Updated 2 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 10 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆17Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆68Updated last month
- Example implementation of Iteration of Tought - Gives a star if you like the project☆37Updated last month
- ☆35Updated last week
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- ☆44Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 5 months ago
- ☆14Updated 4 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆19Updated 4 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆79Updated 3 months ago
- ☆60Updated 3 months ago
- ☆57Updated last year
- Track the progress of LLM context utilisation☆53Updated 6 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- ☆24Updated 3 months ago
- ☆45Updated 9 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated this week
- RAG example using DSPy, Gradio, FastAPI☆72Updated 9 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆14Updated 10 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆20Updated last month
- Official homepage for "Self-Harmonized Chain of Thought"☆89Updated last week