matthewrenze/self-reflection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/matthewrenze/self-reflection)

matthewrenze / self-reflection

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

☆98

Alternatives and similar repositories for self-reflection

Users that are interested in self-reflection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

matthewrenze / jhu-concise-cot
View on GitHub
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆25Nov 25, 2024Updated last year
ChengpengLi1003 / CoRT
View on GitHub
☆72Oct 23, 2025Updated 9 months ago
jmnian / WRAG
View on GitHub
Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"
☆16Oct 2, 2025Updated 9 months ago
rxlqn / awesome-llm-self-reflection
View on GitHub
augmented LLM with self reflection
☆144Nov 21, 2023Updated 2 years ago
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cambridgeltl / topviewrs
View on GitHub
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)
☆15Jun 14, 2025Updated last year
miralab-ai / autoreason
View on GitHub
☆45Dec 14, 2024Updated last year
alibaba / SimCSE-with-CARDS
View on GitHub
Source code for SIGIR 2022 paper.
☆16Apr 25, 2022Updated 4 years ago
Junjie-Ye / ToolEyes
View on GitHub
[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
☆74May 13, 2025Updated last year
Tufalabs / textbook-to-rl
View on GitHub
☆29Aug 27, 2025Updated 11 months ago
IlyasMoutawwakil / py-txi
View on GitHub
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆32Sep 19, 2025Updated 10 months ago
illinois-impact / klap
View on GitHub
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Jun 21, 2019Updated 7 years ago
madaan / self-refine
View on GitHub
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
☆815Oct 4, 2024Updated last year
archiki / UTGenDebug
View on GitHub
Code for our paper "Learning to Generate Unit Tests for Automated Debugging"
☆18Mar 7, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
rladmstn1714 / CLIcK
View on GitHub
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
☆48Dec 23, 2024Updated last year
cambridgeltl / multi3woz
View on GitHub
The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…
☆17Jan 15, 2024Updated 2 years ago
zhengkid / Parallel_Thinking_via_MoT
View on GitHub
Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"
☆29Nov 20, 2025Updated 8 months ago
mahtabbigverdi / Aurora
View on GitHub
☆12Dec 4, 2024Updated last year
gjq100 / Graph-Counselor
View on GitHub
☆32Jun 5, 2025Updated last year
smilegate-ai / OPELA
View on GitHub
☆29Nov 23, 2022Updated 3 years ago
princeton-nlp / continual-factoid-memorization
View on GitHub
Continual Memorization of Factoids in Large Language Models
☆12Nov 20, 2024Updated last year
xxyQwQ / CoMAS
View on GitHub
Implementation for the paper "CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards".
☆53Jan 26, 2026Updated 6 months ago
corca-ai / evaluating-gpt-4o-on-CLIcK
View on GitHub
Evaluate gpt-4o on CLIcK (Korean NLP Dataset)
☆20May 18, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sb-jang / kodialogbench
View on GitHub
Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…
☆18Apr 15, 2025Updated last year
noahshinn / reflexion
View on GitHub
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
☆3,214Jan 14, 2025Updated last year
nalzok / test-time-label-shift
View on GitHub
Test-Time Label-Shift Adaptation
☆14May 24, 2023Updated 3 years ago
duykhuongnguyen / MAT-Steer
View on GitHub
☆21Aug 19, 2025Updated 11 months ago
ttw1018 / MoPE-DST
View on GitHub
The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"
☆19Jan 25, 2025Updated last year
hrwise-nlp / AppBench
View on GitHub
This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
☆16Nov 4, 2024Updated last year
JasonForJoy / Model-Editing-Hurt
View on GitHub
EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
☆37May 26, 2025Updated last year
WujiangXu / EPO
View on GitHub
The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"
☆40Jul 13, 2026Updated 2 weeks ago
yanqiangmiffy / tree2retriever
View on GitHub
Recursive Abstractive Processing for Tree-Organized Retrieval
☆10May 30, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
keeeeenw / TinyLlama
View on GitHub
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆14Mar 30, 2024Updated 2 years ago
guosyjlu / DS-Agent
View on GitHub
Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24
☆239Dec 3, 2024Updated last year
Extrality / nvidia-dind
View on GitHub
docker:dind with NVIDIA GPU support via NVIDIA container toolkit
☆14Jul 1, 2026Updated 3 weeks ago
siddheshih / culture-awareness-llms
View on GitHub
☆20Nov 7, 2024Updated last year
trestad / mitigating-reversal-curse
View on GitHub
Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'
☆14Aug 2, 2024Updated last year
dtch1997 / steering-bench
View on GitHub
Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"
☆22Dec 14, 2024Updated last year
AI9Stars / AStar-Thought
View on GitHub
[NeurIPS 2025] A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings
☆16Jun 12, 2026Updated last month