[ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
☆44Nov 21, 2025Updated 6 months ago
Alternatives and similar repositories for DeRIS
Users that are interested in DeRIS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination☆32Oct 13, 2025Updated 7 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Jul 2, 2025Updated 10 months ago
- [PR2026] Drone Referring Localization: An Efficient Heterogeneous Spatial Feature Interaction Method For UAV Self-Localization☆90Feb 19, 2026Updated 3 months ago
- paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Vi…☆39Dec 27, 2025Updated 4 months ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆47Nov 21, 2025Updated 6 months ago
- Official repository of OS-FPI☆17Dec 22, 2024Updated last year
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆103Oct 29, 2025Updated 6 months ago
- [CVPR2024] Mask Grounding for Referring Image Segmentation☆29Jul 22, 2024Updated last year
- ☆28Feb 21, 2025Updated last year
- RGBD image to point cloud and visualization☆11Apr 17, 2018Updated 8 years ago
- [CVPR 2026 Highlight] XL-VLA: Cross-Hand Latent Representation for Vision-Language-Action Models☆89Apr 15, 2026Updated last month
- python 语言程序设计基础(第二版) 嵩天 礼欣 黄天羽 著 书上代码☆12Dec 10, 2018Updated 7 years ago
- RS-Paper-Hub: A curated collection of remote sensing papers from arXiv. 遥感论文社:打造遥感领域的专属论文集(如卫星、无人机、地面基站)(http://rspaper.top/)☆41Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework☆42Jan 22, 2026Updated 4 months ago
- 官方livox_driver驱动livox雷达发出的点云topic有两种,一种是大疆览沃定制的格式CustomMsg格式,另一种是将CustomMsg格式 转换过的pointcloud2格式,参见 Livox雷达驱动程序发布点云格式CustomMsg、PointCloud2…☆46Feb 26, 2024Updated 2 years ago
- 「TIP2023」Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments☆221Dec 12, 2025Updated 5 months ago
- [BMVC 2022 Oral] Official PyTorch Implementation of "Open-vocabulary Semantic Segmentation with Frozen Vision-Language Models" https://ar…☆19May 12, 2025Updated last year
- ☆19Apr 11, 2026Updated last month
- Official repository of the "ReSTR: Convolution-Free Referring Image Segmentation Using Transformers (CVPR'22)"☆14Dec 13, 2024Updated last year
- [AAAI2026 demo] Official repo of “AirNavigation: Let UAV Navigation Tells Its Own Story”☆22Nov 1, 2025Updated 6 months ago
- Paper List on Earth Observation in the Foundation Model Era☆31Apr 12, 2026Updated last month
- A LaTeX thesis template for Xi'an University of Architecture and Technology☆17Dec 10, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR 2026] ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks☆37Apr 9, 2026Updated last month
- ☆18Aug 7, 2024Updated last year
- Codebase for paper ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools☆29Nov 3, 2025Updated 6 months ago
- Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021☆20Aug 17, 2021Updated 4 years ago
- (TPAMI 2026) Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness & & (NeurIPS 2024) Text-Guided Attention is All Y…☆22Mar 23, 2026Updated 2 months ago
- Margin-based Vision Transformer☆69Apr 7, 2026Updated last month
- ☆226May 18, 2026Updated last week
- ☆18May 18, 2026Updated last week
- auto sign cursor☆20Feb 18, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆25Jan 21, 2025Updated last year
- [RA-L + IROS2024] Learning to place unseen objects stably using large-scale simulation☆21Jun 30, 2024Updated last year
- ☆26Jun 15, 2021Updated 4 years ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆73Jun 3, 2024Updated last year
- OCID-VLG dataset and baselines☆25Mar 12, 2024Updated 2 years ago
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆25Oct 18, 2022Updated 3 years ago
- Official Code for "Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction (ECCV 2022)"☆74Jul 16, 2025Updated 10 months ago