henghuiding / Vision-Language-TransformerLinks
[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation
☆358Updated 3 years ago
Alternatives and similar repositories for Vision-Language-Transformer
Users that are interested in Vision-Language-Transformer are comparing it to the libraries listed below
Sorting:
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆235Updated 2 years ago
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆356Updated last year
- [ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions☆518Updated last month
- [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation☆689Updated 2 years ago
- [ICCV'23 Oral] The introduction and toolkit for EqBen Benchmark☆124Updated last year
- ☆158Updated 2 years ago
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆86Updated last year
- Multimodal Referring Segmentation☆136Updated last week
- [CVPR 2022 Oral] Towards Fewer Annotations: Active Learning via Region Impurity and Prediction Uncertainty for Domain Adaptive Semantic S…☆150Updated last year
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆190Updated 2 years ago
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Updated 2 years ago
- [TPAMI 2023 ESI Highly Cited Paper] SePiCo: Semantic-Guided Pixel Contrast for Domain Adaptive Semantic Segmentation https://arxiv.org/ab…☆119Updated last year
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆80Updated 2 years ago
- [ICCV2019] Boundary-Aware Feature Propagation for Scene Segmentation☆79Updated 5 years ago
- [ECCV2022] Factorizing Knowledge in Neural Networks☆89Updated 3 years ago
- [NeurIPS2022] Deep Model Reassembly☆252Updated 2 years ago
- This is an official implementation of our NeurIPS 22 paper“QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Qu…☆49Updated 2 years ago
- Awesome weakly-supervised image semantic segmentation;scribble,bounding box, point, image tag, and heterogeneous of them. 2016-2025☆167Updated 3 weeks ago
- [CVPR2023 Highlight] Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection☆311Updated 2 years ago
- HRViT ("Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation"), CVPR 2022.☆196Updated 3 years ago
- ☆410Updated last year
- ☆173Updated last year
- A curated list of Causality in Computer Vision☆242Updated 3 years ago
- Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).☆109Updated 3 years ago
- [CVPR 2023 (Highlight)] Offical implementation of the paper "RepMode: Learning to Re-parameterize Diverse Experts for Subcellular Structu…☆166Updated last year
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆68Updated 3 years ago
- ☆61Updated 3 years ago
- A lightweight codebase for referring expression comprehension and segmentation☆55Updated 3 years ago
- ☆211Updated 2 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Updated 10 months ago