[NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning
☆84Sep 19, 2025Updated 6 months ago
Alternatives and similar repositories for CLS-RL
Users that are interested in CLS-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2024] Implementation of vision-language model fine-tuning via simple parameter-efficient modification☆18Nov 24, 2024Updated last year
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆111Sep 18, 2025Updated 6 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆43Jul 22, 2025Updated 8 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆187Jun 5, 2025Updated 10 months ago
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆23Feb 23, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆14Sep 14, 2023Updated 2 years ago
- [ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that…☆1,108Mar 20, 2026Updated 3 weeks ago
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆844May 14, 2025Updated 10 months ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 11 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆50Mar 13, 2025Updated last year
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆78Jul 13, 2024Updated last year
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 11 months ago
- code for EMNLP2018 paper 'Associative-multichannel-autoencoder for multimodal word representation'☆13Aug 24, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆108Jun 10, 2025Updated 10 months ago
- [AAAI 2026] Relation-R1: Progressively Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relation Comprehension☆18Mar 6, 2026Updated last month
- Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’☆2,287Oct 29, 2025Updated 5 months ago
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆28Mar 26, 2025Updated last year
- ☆34Mar 24, 2026Updated 2 weeks ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated 10 months ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆99Oct 19, 2024Updated last year
- [ICLR2025] Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data☆28Mar 3, 2025Updated last year
- [NeurIPS 2025] Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆290Jul 15, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [CVPR 2025 Highlight] Official repository for CoMM Dataset☆52Dec 31, 2024Updated last year
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- ☆24Jun 18, 2025Updated 9 months ago
- ☆41Jun 9, 2025Updated 10 months ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆578Apr 13, 2025Updated 11 months ago
- Consistent Prompting for Rehearsal-Free Continual Learning [CVPR2024]☆35Jun 12, 2025Updated 10 months ago
- (CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction☆143Mar 6, 2025Updated last year
- GRPO Algorithm for Llava Architecture (Based on Verl)☆49May 9, 2025Updated 11 months ago
- The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"☆21Jul 21, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆148Oct 10, 2025Updated 6 months ago
- ✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆283May 9, 2025Updated 11 months ago
- Witness the aha moment of VLM with less than $3.☆4,046May 19, 2025Updated 10 months ago
- [ICCV 2025 Highlight] Official PyTorch implementation of "SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segment…☆19Jan 18, 2026Updated 2 months ago
- TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning☆115Dec 24, 2025Updated 3 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆101Feb 11, 2025Updated last year
- ☆11Apr 23, 2023Updated 2 years ago