modelscope / awesome-deep-reasoning
Collect every awesome work about r1!
☆349Updated this week
Alternatives and similar repositories for awesome-deep-reasoning:
Users that are interested in awesome-deep-reasoning are comparing it to the libraries listed below
- Latest Advances on Long Chain-of-Thought Reasoning☆241Updated last week
- ☆405Updated this week
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆300Updated last month
- MMR1: Advancing the Frontiers of Multimodal Reasoning☆155Updated last month
- Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥☆253Updated 3 months ago
- ☆153Updated 3 weeks ago
- Explore the Multimodal “Aha Moment” on 2B Model☆577Updated last month
- ☆146Updated last month
- Awesome RL Reasoning Recipes ("Triple R")☆410Updated this week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆187Updated 2 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆477Updated this week
- R1-onevision, a visual language model capable of deep CoT reasoning.☆506Updated last week
- ☆673Updated last week
- Awesome RL-based LLM Reasoning☆450Updated last week
- MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning☆574Updated this week
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆173Updated last month
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆244Updated last week
- ☆126Updated 3 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆229Updated last week
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆115Updated 2 weeks ago
- A series of technical report on Slow Thinking with LLM☆644Updated last week
- Awesome Agent Training☆33Updated last week
- ☆518Updated 3 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆230Updated last month
- The related works and background techniques about Openai o1☆221Updated 3 months ago
- A jounery to real multimodel R1 ! We are doing on large-scale experiment☆295Updated last month
- ☆698Updated this week
- Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey☆507Updated this week
- ☆283Updated last month
- Awesome-RAG-VIsion: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆136Updated last week