A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large Language Model Inference-Time Self-Improvement.
☆105Dec 24, 2024Updated last year
Alternatives and similar repositories for Awesome-LLM-Self-Improvement
Users that are interested in Awesome-LLM-Self-Improvement are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Oct 28, 2025Updated 6 months ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- List of papers on Self-Correction of LLMs.☆80Dec 28, 2024Updated last year
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated last year
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆42Apr 10, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆184Dec 5, 2025Updated 5 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆84Apr 10, 2023Updated 3 years ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆47Jul 17, 2025Updated 9 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated 2 years ago
- JudgeLRM: Large Reasoning Models as a Judge☆41Apr 7, 2026Updated 3 weeks ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- ☆552Jan 2, 2025Updated last year
- Large Language Models Can Self-Improve in Long-context Reasoning☆73Nov 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆17Aug 15, 2025Updated 8 months ago
- Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".☆12Feb 28, 2026Updated 2 months ago
- ☆54Mar 6, 2025Updated last year
- Official implementation of NeurIPS'24 paper "Reinforcement Learning Policy as Macro Regulator Rather than Macro Placer".☆20Aug 13, 2025Updated 8 months ago
- The repository is created for AR2L algorithm used for solving online 3D-BPP.☆15Sep 22, 2025Updated 7 months ago
- Paper list for Efficient Reasoning.☆879Apr 21, 2026Updated 2 weeks ago
- ☆17Apr 11, 2025Updated last year
- Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".☆13Nov 28, 2024Updated last year
- ☆46Mar 4, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆42Apr 10, 2025Updated last year
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆36Nov 17, 2024Updated last year
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆83Feb 27, 2026Updated 2 months ago
- Ollie is a open information extractor that uses dependency parses.☆12Sep 27, 2013Updated 12 years ago
- ☆30Dec 26, 2025Updated 4 months ago
- ☆46Jun 11, 2025Updated 10 months ago
- ☆31Mar 14, 2022Updated 4 years ago
- (ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆49Jun 4, 2025Updated 11 months ago
- [ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible☆124Aug 10, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆111Dec 10, 2025Updated 4 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 6 months ago
- ☆17Aug 1, 2025Updated 9 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation)…☆73Jun 11, 2025Updated 10 months ago
- Paper List of Inference/Test Time Scaling/Computing☆375Apr 27, 2026Updated last week
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…☆19Apr 5, 2026Updated last month