henghuiding / Vision-Language-TransformerLinks
[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation
☆359Updated 3 years ago
Alternatives and similar repositories for Vision-Language-Transformer
Users that are interested in Vision-Language-Transformer are comparing it to the libraries listed below
Sorting:
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆238Updated 2 years ago
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆358Updated last month
- [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation☆692Updated 2 years ago
- [ICCV 2023 & TPAMI 2025] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions☆520Updated 2 months ago
- [ICCV'23 Oral] The introduction and toolkit for EqBen Benchmark☆123Updated last year
- ☆158Updated 2 years ago
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆85Updated last year
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆189Updated 2 years ago
- Multimodal Referring Segmentation☆169Updated last month
- [CVPR 2022 Oral] Towards Fewer Annotations: Active Learning via Region Impurity and Prediction Uncertainty for Domain Adaptive Semantic S…☆152Updated 2 years ago
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Updated 2 years ago
- [NeurIPS 2025] Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale perso…☆63Updated last week
- [TPAMI 2023 ESI Highly Cited Paper] SePiCo: Semantic-Guided Pixel Contrast for Domain Adaptive Semantic Segmentation https://arxiv.org/ab…☆119Updated last year
- HRViT ("Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation"), CVPR 2022.☆197Updated 3 years ago
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆81Updated 2 years ago
- [ICCV2019] Boundary-Aware Feature Propagation for Scene Segmentation☆79Updated 5 years ago
- This is an official implementation of our NeurIPS 22 paper“QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Qu…☆49Updated 2 years ago
- [CVPR2023 Highlight] Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection☆312Updated 2 years ago
- Awesome weakly-supervised image semantic segmentation;scribble,bounding box, point, image tag, and heterogeneous of them. 2016-2025☆170Updated this week
- [ECCV2022] Factorizing Knowledge in Neural Networks☆91Updated 3 years ago
- Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).☆109Updated 3 years ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆77Updated last month
- [CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation☆298Updated 3 years ago
- A curated list of Causality in Computer Vision☆243Updated 4 years ago
- [NeurIPS2022] Deep Model Reassembly☆253Updated 2 years ago
- [CVPR 2023 (Highlight)] Offical implementation of the paper "RepMode: Learning to Re-parameterize Diverse Experts for Subcellular Structu…☆165Updated 2 years ago
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆88Updated 2 years ago
- ☆420Updated last year
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆65Updated last year
- ☆214Updated 2 years ago