LightChen233 / Awesome-Long-Chain-of-Thought-ReasoningLinks
Latest Advances on Long Chain-of-Thought Reasoning
β390Updated 3 weeks ago
Alternatives and similar repositories for Awesome-Long-Chain-of-Thought-Reasoning
Users that are interested in Awesome-Long-Chain-of-Thought-Reasoning are comparing it to the libraries listed below
Sorting:
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyondβ252Updated 2 weeks ago
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Modelsβ464Updated last week
- Awesome RL-based LLM Reasoningβ526Updated last month
- β242Updated last month
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β228Updated 2 weeks ago
- Paper list for Efficient Reasoning.β509Updated this week
- Awesome RL Reasoning Recipes ("Triple R")β697Updated last week
- Awesome Agent Trainingβ164Updated this week
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-basβ¦β922Updated last week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learningβ573Updated 3 weeks ago
- β222Updated this week
- β220Updated last month
- A series of technical report on Slow Thinking with LLMβ699Updated 2 weeks ago
- Survey on LLM Agents (Published on CoLing 2025)β314Updated last month
- Multimodal Chain-of-Thought Reasoning: A Comprehensive Surveyβ663Updated this week
- Generative AI Act II: Test Time Scaling Drives Cognition Engineeringβ188Updated 2 months ago
- Collect every awesome work about r1!β388Updated last month
- β241Updated 2 weeks ago
- A Survey on Multimodal Retrieval-Augmented Generationβ231Updated 3 weeks ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"β125Updated last week
- Paper List of Inference/Test Time Scaling/Computingβ264Updated this week
- Latest Advances on System-2 Reasoningβ1,128Updated 2 weeks ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)β265Updated last week
- β101Updated this week
- R1-onevision, a visual language model capable of deep CoT reasoning.β528Updated 2 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learningβ566Updated last month
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningβ144Updated 6 months ago
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learningβ665Updated 3 weeks ago
- llm & rlβ151Updated this week
- Building a comprehensive and handy list of papers for GUI agentsβ402Updated last week