madaan/self-refine

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/madaan/self-refine)

madaan / self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

☆814

Alternatives and similar repositories for self-refine

Users that are interested in self-refine are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

reasoning-machines / prompt-lib
View on GitHub
A set of utilities for running few-shot prompting experiments on large-language models
☆124Oct 25, 2023Updated 2 years ago
noahshinn / reflexion
View on GitHub
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
☆3,210Jan 14, 2025Updated last year
teacherpeterpan / self-correction-llm-papers
View on GitHub
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
☆573Oct 28, 2024Updated last year
princeton-nlp / tree-of-thought-llm
View on GitHub
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
☆6,031Jan 16, 2025Updated last year
Timothyxxx / Chain-of-ThoughtsPapers
View on GitHub
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
☆2,105Oct 5, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TIGER-AI-Lab / Program-of-Thoughts
View on GitHub
Data and Code for Program of Thoughts [TMLR 2023]
☆317May 15, 2024Updated 2 years ago
FranxYao / chain-of-thought-hub
View on GitHub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,776Aug 4, 2024Updated last year
debjitpaul / refiner
View on GitHub
About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…
☆76Jan 27, 2026Updated 5 months ago
agi-templar / Stable-Alignment
View on GitHub
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…
☆356Jun 18, 2023Updated 3 years ago
jeffhj / LM-reasoning
View on GitHub
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
☆572Nov 13, 2023Updated 2 years ago
keirp / automatic_prompt_engineer
View on GitHub
☆1,362Apr 29, 2024Updated 2 years ago
zjunlp / Prompt4ReasoningPapers
View on GitHub
[ACL 2023] Reasoning with Language Model Prompting: A Survey
☆1,009May 21, 2025Updated last year
reasoning-machines / pal
View on GitHub
PaL: Program-Aided Language Models (ICML 2023)
☆525Jun 30, 2023Updated 3 years ago
yizhongw / self-instruct
View on GitHub
Aligning pretrained language models with instruction data generated by themselves.
☆4,606Mar 27, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tatsu-lab / alpaca_eval
View on GitHub
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆2,007Aug 9, 2025Updated 11 months ago
chuanyang-Zheng / Progressive-Hint
View on GitHub
This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"
☆208Oct 11, 2023Updated 2 years ago
openai / prm800k
View on GitHub
800,000 step-level correctness labels on LLM solutions to MATH problems
☆2,151Jun 1, 2023Updated 3 years ago
FranxYao / GPT-Bargaining
View on GitHub
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆207May 24, 2023Updated 3 years ago
WENGSYX / Self-Verification
View on GitHub
We have released the code and demo program required for LLM with self-verification
☆61Oct 18, 2023Updated 2 years ago
WHGTyen / BIG-Bench-Mistake
View on GitHub
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆89Aug 10, 2024Updated last year
JeremyAlain / imitation_learning_from_language_feedback
View on GitHub
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆26Mar 30, 2023Updated 3 years ago
maitrix-org / llm-reasoners
View on GitHub
A library for advanced large language model reasoning
☆2,341Jun 10, 2025Updated last year
ysymyth / ReAct
View on GitHub
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
☆4,074Feb 6, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
google-research / FLAN
View on GitHub
☆1,565Jul 2, 2026Updated 3 weeks ago
SinclairCoder / Instruction-Tuning-Papers
View on GitHub
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
☆769Jul 20, 2023Updated 3 years ago
GAIR-NLP / auto-j
View on GitHub
Generative Judge for Evaluating Alignment
☆251Jan 18, 2024Updated 2 years ago
THU-KEG / EvaluationPapers4ChatGPT
View on GitHub
Resource, Evaluation and Detection Papers for ChatGPT
☆456Mar 21, 2024Updated 2 years ago
microsoft / LMOps
View on GitHub
General technology for enabling AI capabilities w/ LLMs and MLLMs
☆4,444Updated this week
AkariAsai / self-rag
View on GitHub
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…
☆2,410May 25, 2024Updated 2 years ago
veronica320 / Faithful-COT
View on GitHub
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
☆169May 7, 2024Updated 2 years ago
lucidrains / toolformer-pytorch
View on GitHub
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
☆2,062Jul 22, 2024Updated 2 years ago
ruixiangcui / AGIEval
View on GitHub
☆774Jun 13, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenBioLink / ThoughtSource
View on GitHub
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …
☆1,014Dec 16, 2024Updated last year
lupantech / chameleon-llm
View on GitHub
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
☆1,140Dec 23, 2023Updated 2 years ago
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,332Jun 11, 2023Updated 3 years ago
OSU-NLP-Group / Mind2Web
View on GitHub
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…
☆1,015Nov 5, 2025Updated 8 months ago
allenai / lumos
View on GitHub
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
☆478Mar 19, 2024Updated 2 years ago
ShengranHu / Thought-Cloning
View on GitHub
[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
☆268Jun 28, 2024Updated 2 years ago
tatsu-lab / alpaca_farm
View on GitHub
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
☆845Jul 1, 2024Updated 2 years ago