XenoZLH / Shuffle-R1Links
Official code repository of Shuffle-R1
☆18Updated this week
Alternatives and similar repositories for Shuffle-R1
Users that are interested in Shuffle-R1 are comparing it to the libraries listed below
Sorting:
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆36Updated 2 months ago
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆12Updated last year
- ☆23Updated 4 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆38Updated 4 months ago
- The first decoder-only multimodal state space model☆95Updated 2 months ago
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆66Updated last month
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆14Updated 2 weeks ago
- [NeurIPS 2024 Oral] RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation☆18Updated 7 months ago
- [IJCV 2024]☆16Updated 9 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".☆47Updated 3 weeks ago
- This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…☆20Updated 10 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆36Updated last month
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆45Updated 10 months ago
- CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms☆23Updated 2 months ago
- 🚀 Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models☆28Updated 2 months ago
- ☆46Updated 2 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆27Updated 3 months ago
- ☆12Updated 6 months ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆48Updated last month
- ☆36Updated last month
- ☆12Updated 8 months ago
- [AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…☆42Updated last year
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆103Updated last week
- This is the project for 'USG'.☆22Updated 4 months ago
- [ICCV'25] "Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection".☆19Updated 9 months ago
- [MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation☆37Updated 7 months ago
- ☆30Updated last year
- Make Large Multimodal Models excel in object detection, ICCV 2025☆33Updated last week
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆18Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆110Updated last year