stan-anony / Zero-shot-EoT-Prompting

Zero-Shot Chain-of-Thought Reasoning Guided by Evolutionary Algorithms in Large Language Models

☆13

Alternatives and similar repositories for Zero-shot-EoT-Prompting:

Users that are interested in Zero-shot-EoT-Prompting are comparing it to the libraries listed below

jwhj / OREO
☆109Updated 3 months ago
microsoft / competeai
[ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.
☆71Updated 9 months ago
NingMiao / SelfCheck
Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>
☆49Updated last year
microsoft / SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …
☆136Updated last year
siyuyuan / evoagent
Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"
☆96Updated 6 months ago
Yu-Fangxu / FoR
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples
☆85Updated last month
jxhuang0508 / Awesome-LLM-Reasoning-OpenAI-o1
Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large …
☆26Updated 6 months ago
ucl-dark / llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
☆104Updated last year
rxlqn / awesome-llm-self-reflection
augmented LLM with self reflection
☆120Updated last year
FoundationAgents / AFlow
🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.
☆59Updated 3 weeks ago
PRIME-RL / ImplicitPRM
Repo of paper "Free Process Rewards without Process Labels"
☆145Updated last month
ZJLAB-AMMI / LLM4Teach
Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model
☆35Updated last year
InfiAgent / InfiAgent
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)
☆123Updated 4 months ago
abdulhaim / LMRL-Gym
☆91Updated 10 months ago
sanjibanc / agent_prm
☆30Updated 2 months ago
SALT-NLP / DyLAN
Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization
☆142Updated 11 months ago
WeiXiongUST / Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning
This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…
☆25Updated 5 months ago
Ber666 / RAP
Reasoning with Language Model is Planning with World Model
☆164Updated last year
mingyin1 / Agents_Failure_Attribution
☆24Updated this week
facebookresearch / sweet_rl
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆186Updated 3 weeks ago
thunlp / Optima
Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"
☆56Updated 5 months ago
WeiminXiong / MPO
MPO: Boosting LLM Agents with Meta Plan Optimization
☆50Updated 2 months ago
xf-zhao / LoT
Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic"
☆23Updated last year
OSU-NLP-Group / WebDreamer
"Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"
☆70Updated 3 weeks ago
YifeiZhou02 / ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
☆167Updated 2 weeks ago
hkust-nlp / AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]
☆311Updated 11 months ago
MingLiiii / Layer_Gradient
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆63Updated 2 months ago
THU-KEG / Agentic-Reward-Modeling
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆90Updated 2 months ago
Yifan-Song793 / ETO
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
☆138Updated 6 months ago
allenai / ScienceWorld
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
☆258Updated 6 months ago