ZJU-REAL / cooperLinks
☆22Updated last week
Alternatives and similar repositories for cooper
Users that are interested in cooper are comparing it to the libraries listed below
Sorting:
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 weeks ago
- Code for Let LLMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆46Updated last month
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 2 weeks ago
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆24Updated 3 months ago
- Unsupervised GRPO☆43Updated 2 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆72Updated 2 weeks ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 5 months ago
- ☆35Updated last month
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 5 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆47Updated last month
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆18Updated 3 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆69Updated 2 months ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆28Updated 8 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆36Updated 2 months ago
- ☆16Updated 7 months ago
- ☆30Updated last month
- ☆23Updated 2 months ago
- ☆18Updated 3 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆40Updated last month
- ☆48Updated 3 months ago
- SFT+RL boosts multimodal reasoning☆27Updated 2 months ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Updated last year
- Quick Long Video Understanding☆62Updated 2 months ago
- [EMNLP-2025] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration☆49Updated this week
- ☆53Updated last week
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆26Updated 2 months ago
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆32Updated 2 months ago
- CS194-196 Course Project☆15Updated 6 months ago
- ☆21Updated 9 months ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆16Updated 6 months ago