LightChen233 / Awesome-Long-Chain-of-Thought-ReasoningLinks
Latest Advances on Long Chain-of-Thought Reasoning
β343Updated last week
Alternatives and similar repositories for Awesome-Long-Chain-of-Thought-Reasoning
Users that are interested in Awesome-Long-Chain-of-Thought-Reasoning are comparing it to the libraries listed below
Sorting:
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyondβ228Updated this week
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Modelsβ414Updated 2 weeks ago
- β208Updated last week
- Awesome RL Reasoning Recipes ("Triple R")β605Updated this week
- Awesome RL-based LLM Reasoningβ506Updated last month
- Awesome Agent Trainingβ131Updated this week
- A Survey on Multimodal Retrieval-Augmented Generationβ206Updated this week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learningβ518Updated last week
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β218Updated 2 weeks ago
- Paper list for Efficient Reasoning.β463Updated last week
- β198Updated last week
- Collect every awesome work about r1!β372Updated last month
- A series of technical report on Slow Thinking with LLMβ685Updated this week
- Latest Advances on System-2 Reasoningβ1,041Updated last month
- β100Updated last month
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-basβ¦β820Updated this week
- β193Updated last week
- Generative AI Act II: Test Time Scaling Drives Cognition Engineeringβ183Updated last month
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)β250Updated last month
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningβ141Updated 5 months ago
- Multimodal Chain-of-Thought Reasoning: A Comprehensive Surveyβ613Updated 2 weeks ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"β120Updated last week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.β409Updated last month
- Survey on LLM Agents (Published on CoLing 2025)β283Updated 3 weeks ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learningβ541Updated last week
- Explore the Multimodal βAha Momentβ on 2B Modelβ589Updated 2 months ago
- A comprehensive collection of process reward models.β85Updated 2 weeks ago
- llm & rlβ134Updated last week
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.β760Updated 3 weeks ago
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learningβ631Updated last week