asudahkzj / WnetView external linksLinks
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks
☆24Sep 6, 2022Updated 3 years ago
Alternatives and similar repositories for Wnet
Users that are interested in Wnet are comparing it to the libraries listed below
Sorting:
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆90Jul 27, 2023Updated 2 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Jan 20, 2024Updated 2 years ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Mar 13, 2024Updated last year
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆57Oct 7, 2023Updated 2 years ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Mar 16, 2024Updated last year
- ☆13Jul 20, 2024Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Apr 9, 2025Updated 10 months ago
- ☆19May 27, 2023Updated 2 years ago
- PyTorch implementation of MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation☆27Oct 22, 2024Updated last year
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Sep 28, 2024Updated last year
- ☆26Oct 8, 2021Updated 4 years ago
- [CVPR2022] Official Implementation of ReferFormer☆352Feb 15, 2025Updated last year
- The official PyTorch implementation of the CVPR 2023 paper "Contrastive Grouping with Transformer for Referring Image Segmentation".☆50Apr 17, 2024Updated last year
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆24Aug 12, 2022Updated 3 years ago
- [ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding☆40Mar 18, 2025Updated 10 months ago
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆25Dec 30, 2024Updated last year
- VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)☆105Jan 4, 2024Updated 2 years ago
- Code for the VOST dataset☆26Oct 1, 2023Updated 2 years ago
- Refer-Youtube-VOS dataset☆26Jan 30, 2024Updated 2 years ago
- Official PyTorch implementation of PiClick: Picking the desired mask in click-based interactive segmentation.☆26Jul 2, 2024Updated last year
- ALGM applied to Segmenter☆31May 27, 2024Updated last year
- [CVPRW'23 Best Paper Award] Zero-shot Unsupervised Transfer Instance Segmentation☆24Aug 22, 2023Updated 2 years ago
- [ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.☆111Apr 9, 2025Updated 10 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- The official PyTorch implementation of oral paper "FocusCut: Diving into a Focus View in Interactive Segmentation" in CVPR 2022.☆29Dec 18, 2023Updated 2 years ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆30May 28, 2023Updated 2 years ago
- Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)☆43Aug 5, 2023Updated 2 years ago
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆31Dec 4, 2024Updated last year
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated last year
- [CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation☆32Oct 18, 2024Updated last year
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆80Oct 15, 2023Updated 2 years ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆82Jun 13, 2025Updated 8 months ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Realistic Full-Body Anonymization with Surface-Guided GANs☆39Jun 1, 2023Updated 2 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆34Apr 9, 2022Updated 3 years ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Jan 1, 2024Updated 2 years ago