A lightweight codebase for referring expression comprehension and segmentation
☆57May 21, 2022Updated 3 years ago
Alternatives and similar repositories for SimREC
Users that are interested in SimREC are comparing it to the libraries listed below
Sorting:
- RefTeacher is a strong baseline method for Semi-Supervised Referring Expression Comprehension.☆13May 26, 2023Updated 2 years ago
- ☆39Jun 28, 2023Updated 2 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆67May 26, 2022Updated 3 years ago
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Sep 4, 2022Updated 3 years ago
- ☆87Apr 15, 2022Updated 3 years ago
- Preliminary code for reviewers☆13Mar 30, 2021Updated 4 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Oct 30, 2024Updated last year
- ☆225Apr 13, 2023Updated 2 years ago
- [NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"☆523Jan 27, 2024Updated 2 years ago
- ☆14Jul 13, 2021Updated 4 years ago
- The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021☆27Oct 9, 2021Updated 4 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Nov 28, 2022Updated 3 years ago
- Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".☆188Apr 18, 2024Updated last year
- Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021☆19Aug 17, 2021Updated 4 years ago
- Gender/Age attribute grounding using weak supervised manner.☆12Jun 23, 2019Updated 6 years ago
- iterative shrinking for referring expression grounding using deep reinforcement learning☆14Nov 27, 2021Updated 4 years ago
- ☆196Feb 27, 2024Updated 2 years ago
- A collection of papers about Referring Image Segmentation.☆810Jan 28, 2026Updated last month
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆298Nov 29, 2022Updated 3 years ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆30May 28, 2023Updated 2 years ago
- [CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"☆19Oct 10, 2023Updated 2 years ago
- ☆20Apr 2, 2024Updated last year
- awesome grounding: A curated list of research papers in visual grounding☆1,125Sep 21, 2025Updated 5 months ago
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated 2 years ago
- A curated list of research papers in Referring Expression Comprehension (REC)☆46May 13, 2021Updated 4 years ago
- ☆65Oct 11, 2023Updated 2 years ago
- A summarization of zero-shot image recognition methods, in the perspective of element-wise representation and reasoning , covering public…☆20Oct 12, 2024Updated last year
- Referring Expression Datasets API☆561Aug 27, 2024Updated last year
- Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification"…☆20Aug 1, 2023Updated 2 years ago
- ☆1,047Oct 3, 2022Updated 3 years ago
- source code of our MGPN in SIGIR 2022☆18Jun 8, 2022Updated 3 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆26Jan 20, 2022Updated 4 years ago
- The project is an official implementation of our paper " RSGNet: Relation based Skeleton Graph Network for Crowded Scenes Pose Estimation…☆10Dec 9, 2020Updated 5 years ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆50Jan 30, 2026Updated last month
- Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)☆38Apr 7, 2023Updated 2 years ago
- [CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"☆129Mar 17, 2025Updated last year
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- An official implementation for MS-DETR in ACL'23☆17Jun 3, 2023Updated 2 years ago
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆153Jul 13, 2024Updated last year