[Blog 1] Recording a bug of grpo_trainer in some R1 projects
☆22Feb 23, 2025Updated last year
Alternatives and similar repositories for R1-Video-fixbug
Users that are interested in R1-Video-fixbug are comparing it to the libraries listed below
Sorting:
- [IEEE-TIP R2]Advancing Pre-trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection☆15Jan 30, 2026Updated last month
- ✨First Open-Source R1-like Video-LLM [2025/02/18]☆381Feb 23, 2025Updated last year
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆183Jun 5, 2025Updated 9 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆88Jul 13, 2025Updated 7 months ago
- [ICME 2023, Oral] HybridPoint: Point cloud registration based on hybrid point sampling and matching☆29Mar 14, 2024Updated last year
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs☆45Jun 17, 2025Updated 8 months ago
- [AAAI 2025] RCTrans: Radar-Camera Transformer via Radar Densiffer and Sequential Decoder for 3D Object Detection☆41Mar 14, 2025Updated 11 months ago
- ☆58Oct 2, 2025Updated 5 months ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆141Aug 21, 2025Updated 6 months ago
- ☆11Sep 2, 2024Updated last year
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- ☆107Jun 10, 2025Updated 8 months ago
- [TKDE 2024] Robust Knowledge Adaptation for Dynamic Graph Neural Networks☆11Apr 11, 2024Updated last year
- End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…☆13Apr 3, 2024Updated last year
- ☆23Feb 12, 2026Updated 3 weeks ago
- ☆12Feb 27, 2025Updated last year
- ☆17Dec 23, 2025Updated 2 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆69Apr 12, 2025Updated 10 months ago
- ☆10Jan 28, 2024Updated 2 years ago
- REverse-Engineered Reasoning for Open-Ended Generation☆93Sep 10, 2025Updated 5 months ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆60Jun 6, 2025Updated 8 months ago
- Explore the Multimodal “Aha Moment” on 2B Model☆623Mar 18, 2025Updated 11 months ago
- python 实现的微信自动回复机器人☆11Nov 16, 2019Updated 6 years ago
- ☆14Dec 13, 2021Updated 4 years ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆18Oct 6, 2025Updated 4 months ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated last month
- 人工智能:爬山法、随机重启爬山法、模拟退火算法、遗传算法、启发式搜索方法解决八数码和八皇后问题☆11Jul 15, 2021Updated 4 years ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Apr 25, 2024Updated last year
- 数据库实践课设:利用C#和SQL-Server实现简易的选课系统☆10Oct 11, 2020Updated 5 years ago
- ☆16Oct 11, 2025Updated 4 months ago
- MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka☆324Jun 21, 2025Updated 8 months ago
- R1-like Video-LLM for Temporal Grounding☆133Jun 20, 2025Updated 8 months ago
- A Pytorch implementation of Diffusion-Based Probabilistic Uncertainty Estimation for Active Domain Adaptation☆15Nov 28, 2023Updated 2 years ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago
- [IJCV] Progressive Visual Prompt Learning with Contrastive Feature Re-formation☆15Aug 10, 2024Updated last year
- ☆16Sep 25, 2025Updated 5 months ago
- [COLING 2025🔥] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection☆17Jan 21, 2025Updated last year
- Cog wrapper for playgroundai/playground-v2.5-1024px-aesthetic☆17Nov 25, 2024Updated last year