jiaqihuang01 / DETRISView external linksLinks
[AAAI-2025] The official code of Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
☆67May 21, 2025Updated 8 months ago
Alternatives and similar repositories for DETRIS
Users that are interested in DETRIS are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024 Main] MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension☆16Jan 6, 2025Updated last year
- Transactions on Multimedia (TMM25)☆19Apr 8, 2025Updated 10 months ago
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆137Jun 26, 2025Updated 7 months ago
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆44Jul 11, 2024Updated last year
- ☆23Aug 20, 2024Updated last year
- Official PyTorch implementation of “MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation”☆18Dec 5, 2024Updated last year
- [CVPR 2025] Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation☆31Jun 27, 2025Updated 7 months ago
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆28Nov 28, 2024Updated last year
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated last year
- Related papers about Referring Image Segmentation (RIS)☆16Dec 26, 2023Updated 2 years ago
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated 11 months ago
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation,☆49Mar 20, 2025Updated 10 months ago
- ☆13Jul 8, 2024Updated last year
- ☆12Dec 17, 2024Updated last year
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆72Jun 3, 2024Updated last year
- [MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention☆12Dec 24, 2024Updated last year
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆56Jun 16, 2025Updated 8 months ago
- ☆28Jul 22, 2024Updated last year
- Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation☆15Sep 24, 2025Updated 4 months ago
- This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…☆40Sep 10, 2025Updated 5 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆64Oct 22, 2024Updated last year
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆58Dec 22, 2025Updated last month
- ☆14Oct 30, 2023Updated 2 years ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆22Nov 17, 2025Updated 3 months ago
- Code release for "Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation"☆14Oct 23, 2023Updated 2 years ago
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Apr 6, 2025Updated 10 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆100Oct 29, 2025Updated 3 months ago
- ☆30Jun 14, 2024Updated last year
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆40Jan 12, 2026Updated last month
- Chain_of_Thoughts_3D_Visual_Grounding☆19Apr 20, 2024Updated last year
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆162Sep 12, 2024Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- (CVPR 2025 Highlight) Official repository of paper "AODRaw: Towards RAW Object Detection in Diverse Conditions" (https://arxiv.org/pdf/24…☆24Apr 6, 2025Updated 10 months ago
- 3D Traffic Light & Sign Dataset☆24Mar 24, 2025Updated 10 months ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆49Sep 24, 2024Updated last year
- Awesome paper for multi-modal llm with grounding ability☆19Oct 11, 2025Updated 4 months ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Jul 20, 2024Updated last year
- ☆21Jan 17, 2025Updated last year
- ☆45Oct 3, 2023Updated 2 years ago