microsoft / EvoPrompt
Automatic Prompt Optimization
☆28Updated 10 months ago
Alternatives and similar repositories for EvoPrompt:
Users that are interested in EvoPrompt are comparing it to the libraries listed below
- ☆50Updated 4 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆39Updated 3 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- ☆41Updated 3 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆74Updated 3 weeks ago
- LLM reads a paper and produce a working prototype☆51Updated 3 weeks ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆21Updated 3 weeks ago
- ☆38Updated 2 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated this week
- Measuring RAG solutions throughput and latency☆16Updated 8 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆17Updated last week
- ☆24Updated 6 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆18Updated this week
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- ☆48Updated 5 months ago
- ☆20Updated last year
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆49Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- Official Code Release for "Training a Generally Curious Agent"☆20Updated last week
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆25Updated 11 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆53Updated 4 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 9 months ago
- ☆45Updated 6 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 5 months ago
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments☆49Updated last month
- ☆21Updated 5 months ago
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆16Updated last year
- Very minimal (and stateless) agent framework☆41Updated 2 months ago