TencentARC / SGAT4PASS
This is the official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation (IJCAI 2023)
☆23Updated last year
Related projects: ⓘ
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆25Updated 8 months ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆150Updated 10 months ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆75Updated 6 months ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆110Updated 8 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆38Updated 5 months ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆41Updated 5 months ago
- ☆163Updated 6 months ago
- [CVPR-W 2023] Official Implementation of One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models☆72Updated 8 months ago
- AI-Generated Images as Data Source: The Dawn of Synthetic Era☆141Updated 9 months ago
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆58Updated 4 months ago
- ☆104Updated 3 months ago
- ☆14Updated 9 months ago
- This work is accepted by CVPR2023☆36Updated last year
- [ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"☆58Updated 3 months ago
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆49Updated 5 months ago
- [ICCV 2023] Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption☆50Updated 9 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆45Updated 4 months ago
- [Arxiv 2024] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation☆14Updated 2 months ago
- Open-vocabulary Object Segmentation with Diffusion Models☆168Updated last year
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆93Updated 7 months ago
- Looking 3D: Anomaly Detection with 2D-3D Alignment (CVPR24)☆17Updated last month
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆168Updated 2 months ago
- [ICCV 2023] CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training☆97Updated 8 months ago
- MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer☆24Updated 2 weeks ago
- [CVPR'24] Neural Clustering based Visual Representation Learning☆31Updated 5 months ago
- ☆54Updated last year
- DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆106Updated 3 months ago
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆58Updated last week
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆161Updated 7 months ago
- Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).☆40Updated 8 months ago