☆56Mar 6, 2025Updated last year
Alternatives and similar repositories for Reasoning-Self-Evolution-Survey
Users that are interested in Reasoning-Self-Evolution-Survey are comparing it to the libraries listed below
Sorting:
- ☆52Feb 12, 2025Updated last year
- The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"☆21Jul 21, 2025Updated 7 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆16Updated this week
- ☆49Apr 11, 2025Updated 10 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Nov 29, 2024Updated last year
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆34Feb 25, 2026Updated last week
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Mar 11, 2024Updated last year
- Papers of Implicit Reasoning in LLMs.☆22Mar 13, 2025Updated 11 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆24Sep 26, 2024Updated last year
- ☆20May 28, 2025Updated 9 months ago
- ☆20Nov 3, 2024Updated last year
- Latest Advances on Long Chain-of-Thought Reasoning☆615Jul 18, 2025Updated 7 months ago
- Awesome paper for multi-modal llm with grounding ability☆19Oct 11, 2025Updated 4 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆212Apr 22, 2025Updated 10 months ago
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆41May 22, 2025Updated 9 months ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- ☆49Aug 14, 2025Updated 6 months ago
- ☆20Jan 16, 2024Updated 2 years ago
- ☆26Mar 4, 2025Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆112Sep 28, 2024Updated last year
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆35Jul 16, 2025Updated 7 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆65Feb 13, 2023Updated 3 years ago
- [NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux, ReasonFlux-PRM, and ReasonFlux-Coder.☆521Sep 27, 2025Updated 5 months ago
- ☆552Jan 2, 2025Updated last year
- Latest Advances on System-2 Reasoning☆1,333Jun 8, 2025Updated 8 months ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Jan 13, 2023Updated 3 years ago
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆67Mar 27, 2025Updated 11 months ago
- ☆81Mar 11, 2025Updated 11 months ago
- Official Implementation of APB (ACL 2025 main Oral) and Spava.☆35Jan 30, 2026Updated last month
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆107Aug 5, 2025Updated 7 months ago
- ☆145Sep 12, 2025Updated 5 months ago
- ☆285Jan 29, 2026Updated last month
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆392Jan 19, 2025Updated last year
- This the implementation of LeCo☆31Jan 20, 2025Updated last year
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆35Apr 25, 2024Updated last year
- ☆33Oct 31, 2024Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆41Jan 29, 2026Updated last month