limezc / Awesome-CVPR-2025-PapersLinks
☆16Updated 3 months ago
Alternatives and similar repositories for Awesome-CVPR-2025-Papers
Users that are interested in Awesome-CVPR-2025-Papers are comparing it to the libraries listed below
Sorting:
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆22Updated 8 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆40Updated 6 months ago
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆82Updated last month
- Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆261Updated 2 months ago
- 😎 A curated list of CVPR 2025 Oral paper. Total 96☆48Updated 2 months ago
- Vision Manus: Your versatile Visual AI assistant☆276Updated last month
- [CVPR 2025] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".☆40Updated 4 months ago
- CVPR 2025 Parper Collections☆41Updated 6 months ago
- ☆56Updated 6 months ago
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆364Updated 3 months ago
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆292Updated 2 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆145Updated 3 weeks ago
- [ICLR 2025 Spotlight] Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation☆52Updated 5 months ago
- 😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.☆222Updated last week
- The first decoder-only multimodal state space model☆97Updated 4 months ago
- ☆107Updated 9 months ago
- [AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video…☆88Updated 9 months ago
- X-SAM: From Segment Anything to Any Segmentation☆281Updated last week
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆149Updated last year
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆126Updated 3 months ago
- [ICCV2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary…☆83Updated this week
- [CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.☆180Updated last year
- [NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆210Updated this week
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆113Updated 2 months ago
- [CVPR 2025] DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation & [ICLR 2024] DFormer & [NeuriPS 2025] OmniSegmentor☆364Updated 2 weeks ago
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆23Updated 2 weeks ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆43Updated 5 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆114Updated last year
- [TPAMI 2025] Towards Visual Grounding: A Survey☆236Updated last month
- Collection of Highlight papers☆41Updated last year