[ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
☆46Nov 21, 2025Updated 7 months ago
Alternatives and similar repositories for DeRIS
Users that are interested in DeRIS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆45Jul 2, 2025Updated last year
- [PR2026] Drone Referring Localization: An Efficient Heterogeneous Spatial Feature Interaction Method For UAV Self-Localization☆94Feb 19, 2026Updated 4 months ago
- paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Vi…☆41Dec 27, 2025Updated 6 months ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 10 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆49Nov 21, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository of OS-FPI☆17Dec 22, 2024Updated last year
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆103Oct 29, 2025Updated 8 months ago
- [CVPR2024] Mask Grounding for Referring Image Segmentation☆29Jul 22, 2024Updated last year
- ☆28Feb 21, 2025Updated last year
- [CVPR 2026 Highlight] XL-VLA: Cross-Hand Latent Representation for Vision-Language-Action Models☆98Apr 15, 2026Updated 2 months ago
- python 语言程序设计基础(第二版) 嵩天 礼欣 黄天羽 著 书上代码☆12Dec 10, 2018Updated 7 years ago
- EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework☆43Jan 22, 2026Updated 5 months ago
- 官方livox_driver驱动livox雷达发出的点云topic有两种,一种是大疆览沃定制的格式CustomMsg格式,另一种是将CustomMsg格式 转换过的pointcloud2格式,参见 Livox雷达驱动程序发布点云格式CustomMsg、PointCloud2…☆47Feb 26, 2024Updated 2 years ago
- [BMVC 2022 Oral] Official PyTorch Implementation of "Open-vocabulary Semantic Segmentation with Frozen Vision-Language Models" https://ar…☆19May 12, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Apr 11, 2026Updated 2 months ago
- ☆31Sep 22, 2025Updated 9 months ago
- Official repository of the "ReSTR: Convolution-Free Referring Image Segmentation Using Transformers (CVPR'22)"☆15Dec 13, 2024Updated last year
- [AAAI2026 demo] Official repo of “AirNavigation: Let UAV Navigation Tells Its Own Story”☆22Nov 1, 2025Updated 8 months ago
- Paper List on Earth Observation in the Foundation Model Era☆31Jun 15, 2026Updated 2 weeks ago
- [CVPR 2026] ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks☆40Jun 18, 2026Updated 2 weeks ago
- ☆18Aug 7, 2024Updated last year
- 「TCSVT2021」A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization☆123Mar 7, 2024Updated 2 years ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆18Sep 11, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Margin-based Vision Transformer☆69Apr 7, 2026Updated 2 months ago
- ☆18May 18, 2026Updated last month
- auto sign cursor☆20Feb 18, 2025Updated last year
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆26Jan 21, 2025Updated last year
- [ICLR'26] OF-Diff: Object Fidelity Diffusion for Remote Sensing Image Generation☆37Feb 6, 2026Updated 4 months ago
- [ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆56Feb 10, 2025Updated last year
- ☆26Jun 15, 2021Updated 5 years ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆74Jun 3, 2024Updated 2 years ago
- A codebase for flexible and efficient Image Text Representation Alignment☆24Jun 20, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OCID-VLG dataset and baselines☆25Mar 12, 2024Updated 2 years ago
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆25Oct 18, 2022Updated 3 years ago
- Official Code for "Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction (ECCV 2022)"☆75Jul 16, 2025Updated 11 months ago
- Latest Papers, Codes and Datasets on VTG-LLMs.☆94Jun 12, 2026Updated 3 weeks ago
- Sora Generates Videos with Stunning Geometrical Consistency☆51Mar 24, 2024Updated 2 years ago
- ☆92Apr 15, 2022Updated 4 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆91Sep 30, 2021Updated 4 years ago