modelscope / awesome-deep-reasoning
Collect every awesome work about r1!
☆306Updated last week
Alternatives and similar repositories for awesome-deep-reasoning:
Users that are interested in awesome-deep-reasoning are comparing it to the libraries listed below
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆278Updated last week
- Explore the Multimodal “Aha Moment” on 2B Model☆524Updated last week
- MMR1: Advancing the Frontiers of Multimodal Reasoning☆145Updated last week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆177Updated last month
- ☆113Updated 2 months ago
- ☆186Updated this week
- ☆124Updated 3 weeks ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆464Updated last week
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆158Updated last week
- ☆117Updated 2 weeks ago
- ☆518Updated last week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆376Updated this week
- A Comprehensive Survey on Long Context Language Modeling☆86Updated last week
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning☆306Updated 3 weeks ago
- MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning☆425Updated last week
- Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥☆238Updated 2 months ago
- This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-sta…☆346Updated this week
- Awesome RL-based LLM Reasoning☆341Updated this week
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆223Updated last month
- A series of technical report on Slow Thinking with LLM☆595Updated last week
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆124Updated 8 months ago
- The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆148Updated last week
- ☆172Updated last month
- The related works and background techniques about Openai o1☆217Updated 2 months ago
- Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and Dee…☆45Updated last week
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆229Updated last month
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆135Updated this week
- OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…☆124Updated this week
- A Survey on Efficient Reasoning for LLMs☆116Updated this week
- A jounery to real multimodel R1 ! We are doing on large-scale experiment☆280Updated 2 weeks ago