JRC1995 / ZeroPromptSearchLinks
Implementation of an LLM prompting pipeline combined with wrappers for auto-decomposing reasoning steps and for search through the reasoning-step-space (eg. by beam search, MCTS etc.) guided by self-evaluation.
☆15Updated last year
Alternatives and similar repositories for ZeroPromptSearch
Users that are interested in ZeroPromptSearch are comparing it to the libraries listed below
Sorting:
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- Code repo for MathAgent☆17Updated last year
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆20Updated 9 months ago
- [NAACL 2024] TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition☆16Updated 5 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆26Updated last year
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆30Updated 2 years ago
- Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23☆11Updated 2 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- ☆31Updated last year
- Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding☆81Updated last year
- An implementation for MetGen: A Module-Based Entailment Tree Generation Framework for Answer Explanation.☆13Updated 3 years ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆77Updated last year
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Updated 10 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆28Updated last year
- Contrastive Chain-of-Thought Prompting☆68Updated last year
- Code for Benchmarking Language Model Agents for Data-Driven Science☆31Updated 11 months ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆68Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Updated last year
- An approach to perform RAG while taking into account the dynamic evolution of the data. Helpful to detect emerging trends in the data☆29Updated last year
- ☆18Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆29Updated last year
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Updated last year
- DataSciBench: An LLM Agent Benchmark for Data Science☆33Updated 3 weeks ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆97Updated last year
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆28Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆28Updated 5 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- This is the code for reproducing the TABBIE baseline in our paper: "Retrieval-Based Transformer for Table Augmentation"☆12Updated last week