[ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
☆42Nov 21, 2025Updated 4 months ago
Alternatives and similar repositories for DeRIS
Users that are interested in DeRIS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination☆32Oct 13, 2025Updated 5 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Jul 2, 2025Updated 8 months ago
- [PR2026] Drone Referring Localization: An Efficient Heterogeneous Spatial Feature Interaction Method For UAV Self-Localization☆87Feb 19, 2026Updated last month
- paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Vi…☆36Dec 27, 2025Updated 2 months ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆41Nov 21, 2025Updated 4 months ago
- Official repository of OS-FPI☆17Dec 22, 2024Updated last year
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆101Oct 29, 2025Updated 4 months ago
- ☆27Feb 21, 2025Updated last year
- ☆28Jul 22, 2024Updated last year
- RGBD image to point cloud and visualization☆11Apr 17, 2018Updated 7 years ago
- EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework☆36Jan 22, 2026Updated 2 months ago
- [BMVC 2022 Oral] Official PyTorch Implementation of "Open-vocabulary Semantic Segmentation with Frozen Vision-Language Models" https://ar…☆19May 12, 2025Updated 10 months ago
- 官方livox_driver驱动livox雷达发出的点云topic有两种,一种是大疆览沃定制的格式CustomMsg格式,另一种是将CustomMsg格式 转换过的pointcloud2格式,参见 Livox雷达驱动程序发布点云格式CustomMsg、PointCloud2…☆46Feb 26, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 「TIP2023」Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments☆211Dec 12, 2025Updated 3 months ago
- ☆23Sep 22, 2025Updated 6 months ago
- [CVPR2026 🌟] The first attempt to Marine Open Vocabulary Instance Segmentation☆45Mar 17, 2026Updated last week
- [CVPR 2026] ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks☆32Mar 10, 2026Updated 2 weeks ago
- Paper List on Earth Observation in the Foundation Model Era☆30Mar 15, 2026Updated last week
- A LaTeX thesis template for Xi'an University of Architecture and Technology☆15Dec 10, 2024Updated last year
- ☆17Aug 7, 2024Updated last year
- Codebase for paper ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools☆30Nov 3, 2025Updated 4 months ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- (TPAMI 2026) Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness & & (NeurIPS 2024) Text-Guided Attention is All Y…☆16Mar 6, 2026Updated 2 weeks ago
- [ICLR'26] OF-Diff: Object Fidelity Diffusion for Remote Sensing Image Generation☆28Feb 6, 2026Updated last month
- ☆18Apr 4, 2025Updated 11 months ago
- [RA-L + IROS2024] Learning to place unseen objects stably using large-scale simulation☆21Jun 30, 2024Updated last year
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆25Jan 21, 2025Updated last year
- Latest Papers, Codes and Datasets on VTG-LLMs.☆86Nov 17, 2025Updated 4 months ago
- [ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆54Feb 10, 2025Updated last year
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆72Jun 3, 2024Updated last year
- OCID-VLG dataset and baselines☆22Mar 12, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆25Oct 18, 2022Updated 3 years ago
- Official Code for "Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction (ECCV 2022)"☆72Jul 16, 2025Updated 8 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆51Mar 24, 2024Updated 2 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆90Sep 30, 2021Updated 4 years ago
- ☆39Jun 28, 2023Updated 2 years ago
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆164Sep 12, 2024Updated last year
- Sharingan: A Transformer Architecture for Multi-Person Gaze Following☆28Nov 11, 2024Updated last year