[2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation
☆19Nov 8, 2025Updated 4 months ago
Alternatives and similar repositories for TGS-Agent
Users that are interested in TGS-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing☆14Nov 17, 2024Updated last year
- [2022 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line☆32Mar 6, 2023Updated 3 years ago
- [CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".☆45Jun 5, 2025Updated 9 months ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- [2025 CVPR] Towards Open-Vocabulary Audio-Visual Event Localization☆42Mar 7, 2025Updated last year
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆50Oct 12, 2025Updated 5 months ago
- [ICCV2025] All in One: Visual-Description-Guided Unified Point Cloud Segmentation☆28Jul 25, 2025Updated 7 months ago
- ☆18Nov 15, 2024Updated last year
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆87Sep 29, 2025Updated 5 months ago
- ACM MM 2022 - PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding☆11Aug 12, 2022Updated 3 years ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆73Mar 6, 2025Updated last year
- [CVPR 2025] Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation☆83Dec 24, 2025Updated 3 months ago
- This repository provides code for "Polyp Segmentation via Semantic Enhanced Perceptual Network" IEEE TCSVT-2024.☆15Dec 26, 2025Updated 2 months ago
- The official code of GEMINI - Homeomorphism Prior for Representation Learning (accepted by TPAMI 2025)☆12Apr 1, 2025Updated 11 months ago
- LaTeX中文模板收集☆28Aug 15, 2018Updated 7 years ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Mar 13, 2024Updated 2 years ago
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆35Nov 2, 2024Updated last year
- Video Reasoning Segmentation☆28Nov 29, 2024Updated last year
- MATLAB implementation for our TPAMI paper: Quanxue Gao;Pu Zhang;Wei Xia;Deyan Xie;Xinbo Gao;Dacheng Tao. Enhanced Tensor RPCA and Its App…☆14Aug 12, 2021Updated 4 years ago
- The Pytorch implementation of "NCAGC: A Neighborhood Contrast Framework for Attributed Graph Clustering"☆17Jul 28, 2024Updated last year
- SmartCLIP: A training method to improve CLIP with both short and long texts☆40Jun 18, 2025Updated 9 months ago
- Wavelet convolutional neural network combines a multi-resolution analysis and convolutional neural network into a single model to achie…☆10Feb 19, 2022Updated 4 years ago
- T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation☆36Sep 16, 2025Updated 6 months ago
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆38Oct 11, 2024Updated last year
- Generate power grid dynamic simulation data automatically for machine learning applications using Python and Modelica models.☆16Sep 1, 2022Updated 3 years ago
- The official code of TAMP - Imaging foundation model for universal enhancement of non-ideal measurement CT☆24Dec 15, 2025Updated 3 months ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆58Oct 7, 2023Updated 2 years ago
- Tensorflow 1.13.X implementation for our NN paper: Wei Xia, Sen Wang, Ming Yang, Quanxue Gao, Jungong Han, Xinbo Gao: Multi-view graph em…☆17Mar 1, 2022Updated 4 years ago
- [CVPR2025] Official implementation of RAM☆29Nov 4, 2025Updated 4 months ago
- Pointers to large-scale underwater datasets and relevant resources.☆13May 22, 2025Updated 10 months ago
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆40Apr 20, 2025Updated 11 months ago
- Code for our EMNLP-2022 paper: "Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning"☆16Feb 22, 2023Updated 3 years ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆65Jan 27, 2026Updated last month
- [2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line☆42Jul 5, 2022Updated 3 years ago
- MATLAB implementation for our IEEE TC paper: Wei Xia; Xiangdong Zhang; Quanxue Gao; Xiaochuang Shu; Jungong Han; Xinbo Gao. Multiview Sub…☆19May 31, 2022Updated 3 years ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Jul 9, 2024Updated last year
- 💩里淘金☆44Updated this week
- ☆24Apr 9, 2024Updated last year
- Practical New Tasks and Inspiring Modeling Solutions for Diverse Open Vision Problems☆139Oct 2, 2025Updated 5 months ago