dongxiangjue/Awesome-LLM-Self-Improvement

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dongxiangjue/Awesome-LLM-Self-Improvement)

dongxiangjue / Awesome-LLM-Self-Improvement

A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large Language Model Inference-Time Self-Improvement.

☆108

Alternatives and similar repositories for Awesome-LLM-Self-Improvement

Users that are interested in Awesome-LLM-Self-Improvement are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Aurora-slz / MM-Verify
View on GitHub
☆19Oct 28, 2025Updated 8 months ago
ryokamoi / llm-self-correction-papers
View on GitHub
List of papers on Self-Correction of LLMs.
☆82May 19, 2026Updated 2 months ago
NuoJohnChen / JudgeLRM
View on GitHub
JudgeLRM: Large Reasoning Models as a Judge
☆42May 6, 2026Updated 2 months ago
Qianyue-Wang / Generating-Long-form-Story-Using-Dynamic-Hierarchical-Outlining-with-Memory-Enhancement
View on GitHub
☆19Oct 12, 2024Updated last year
VITA-Group / o1-planning
View on GitHub
[NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability
☆42Apr 10, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆18Jun 30, 2026Updated 3 weeks ago
RZFan525 / Awesome-ScalingLaws
View on GitHub
A curated list of awesome resources dedicated to Scaling Laws for LLMs
☆84Apr 10, 2023Updated 3 years ago
ylsung / vl-merging
View on GitHub
PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"
☆37Oct 11, 2023Updated 2 years ago
jdf-prog / LLM-Gen
View on GitHub
A simple generate script utils using fastchat conv template for generation of Large Language Models
☆21Jun 21, 2023Updated 3 years ago
THUDM / Self-Contrast
View on GitHub
Extensive Self-Contrast Enables Feedback-Free Language Model Alignment
☆20Apr 2, 2024Updated 2 years ago
SihengLi99 / SEALONG
View on GitHub
Large Language Models Can Self-Improve in Long-context Reasoning
☆72Nov 24, 2024Updated last year
lqtrung1998 / mwp_ReFT
View on GitHub
☆554Jan 2, 2025Updated last year
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
Shwai-He / PAD-Net
View on GitHub
Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".
☆14Feb 28, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cambridgeltl / zepo
View on GitHub
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)
☆14Oct 3, 2024Updated last year
cs-holder / Reasoning-Self-Evolution-Survey
View on GitHub
☆54Mar 6, 2025Updated last year
THU-KEG / AdaptThink
View on GitHub
☆186Dec 5, 2025Updated 7 months ago
schmmd / ollie
View on GitHub
Ollie is a open information extractor that uses dependency parses.
☆12Sep 27, 2013Updated 12 years ago
MME-Benchmarks / MME-Unify
View on GitHub
✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models
☆42Apr 10, 2025Updated last year
QingyangZhang / Label-Free-RLVR
View on GitHub
☆311Jul 6, 2025Updated last year
LivingFutureLab / DeltaBench
View on GitHub
☆46Mar 4, 2025Updated last year
shirley-wu / daco
View on GitHub
[NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation
☆14Mar 5, 2025Updated last year
which47 / LLMCL
View on GitHub
Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning
☆38Nov 17, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
XinbangZhang / DATA-NAS
View on GitHub
Codes for DATA: Differentiable ArchiTecture Approximation.
☆11Jul 22, 2021Updated 5 years ago
pzhren / DW-ViT
View on GitHub
☆31Mar 14, 2022Updated 4 years ago
thu-coai / SPaR
View on GitHub
☆47Jun 11, 2025Updated last year
jinzheio / FeynmanDiagram
View on GitHub
C-program for Feynman diagram generation
☆10Jun 10, 2013Updated 13 years ago
xiwenc1 / DRA-GRPO
View on GitHub
Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models
☆24Jan 6, 2026Updated 6 months ago
yzhangchuck / awesome-llm-reasoning-long2short-papers
View on GitHub
☆17Apr 11, 2025Updated last year
openpsi-project / ReaLHF
View on GitHub
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
☆335Apr 24, 2025Updated last year
MAmmoTH-VL / MAmmoTH-VL
View on GitHub
(ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
☆50Jun 4, 2025Updated last year
Kwai-Klear / CE-GPPO
View on GitHub
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning
☆16Jan 23, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
beichenzbc / BoostStep
View on GitHub
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆37Jan 21, 2025Updated last year
zhuyr97 / SDCM
View on GitHub
☆10Feb 13, 2023Updated 3 years ago
hughbzhang / o1_inference_scaling_laws
View on GitHub
Replicating O1 inference-time scaling laws
☆94Dec 1, 2024Updated last year
MingLiiii / Gradient_Unified
View on GitHub
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
☆20Jun 17, 2025Updated last year
baixianghuang / survey-authorship
View on GitHub
Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…
☆19May 25, 2026Updated 2 months ago
google-deepmind / nonstationary_mbml
View on GitHub
Memory-Based Meta-Learning on Non-Stationary Distributions
☆18Mar 14, 2024Updated 2 years ago
YujunZhou / EVOL-RL
View on GitHub
Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).
☆51Mar 31, 2026Updated 3 months ago