[2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation
☆20Nov 8, 2025Updated 7 months ago
Alternatives and similar repositories for TGS-Agent
Users that are interested in TGS-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing☆14Nov 17, 2024Updated last year
- [2022 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line☆32Mar 6, 2023Updated 3 years ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- [CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".☆48Jun 5, 2025Updated last year
- [2025 CVPR] Towards Open-Vocabulary Audio-Visual Event Localization☆45Mar 7, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆50Oct 12, 2025Updated 8 months ago
- ☆18Nov 15, 2024Updated last year
- [ICCV2025] All in One: Visual-Description-Guided Unified Point Cloud Segmentation☆33Jul 25, 2025Updated 10 months ago
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆92Sep 29, 2025Updated 8 months ago
- ACM MM 2022 - PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding☆11Aug 12, 2022Updated 3 years ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆75Mar 6, 2025Updated last year
- [CVPR 2025] Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation☆85Dec 24, 2025Updated 5 months ago
- The official code of GEMINI - Homeomorphism Prior for Representation Learning (accepted by TPAMI 2025)☆12Apr 1, 2025Updated last year
- LaTeX中文模板收集☆31Aug 15, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆31Mar 13, 2024Updated 2 years ago
- This repository provides code for "Polyp Segmentation via Semantic Enhanced Perceptual Network" IEEE TCSVT-2024.☆18Mar 31, 2026Updated 2 months ago
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆37Nov 2, 2024Updated last year
- Video Reasoning Segmentation☆27Nov 29, 2024Updated last year
- MATLAB implementation for our TPAMI paper: Quanxue Gao;Pu Zhang;Wei Xia;Deyan Xie;Xinbo Gao;Dacheng Tao. Enhanced Tensor RPCA and Its App…☆17Aug 12, 2021Updated 4 years ago
- The Pytorch implementation of "NCAGC: A Neighborhood Contrast Framework for Attributed Graph Clustering"☆17Jul 28, 2024Updated last year
- Wavelet convolutional neural network combines a multi-resolution analysis and convolutional neural network into a single model to achie…☆10Feb 19, 2022Updated 4 years ago
- SmartCLIP: A training method to improve CLIP with both short and long texts☆42Jun 18, 2025Updated 11 months ago
- T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation☆35Sep 16, 2025Updated 8 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆38Oct 11, 2024Updated last year
- Generate power grid dynamic simulation data automatically for machine learning applications using Python and Modelica models.☆16Sep 1, 2022Updated 3 years ago
- The official code of TAMP - Imaging foundation model for universal enhancement of non-ideal measurement CT☆25Mar 24, 2026Updated 2 months ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆58Oct 7, 2023Updated 2 years ago
- Tensorflow 1.13.X implementation for our NN paper: Wei Xia, Sen Wang, Ming Yang, Quanxue Gao, Jungong Han, Xinbo Gao: Multi-view graph em…☆18Mar 1, 2022Updated 4 years ago
- [CVPR2025] Official implementation of RAM☆29Nov 4, 2025Updated 7 months ago
- Pointers to large-scale underwater datasets and relevant resources.☆11May 22, 2025Updated last year
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆40Apr 20, 2025Updated last year
- Code for our EMNLP-2022 paper: "Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning"☆16Feb 22, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MATLAB implementation for our IEEE TC paper: Wei Xia; Xiangdong Zhang; Quanxue Gao; Xiaochuang Shu; Jungong Han; Xinbo Gao. Multiview Sub…☆19May 31, 2022Updated 4 years ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆17Jul 9, 2024Updated last year
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆66Jan 27, 2026Updated 4 months ago
- [2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line☆42Jul 5, 2022Updated 3 years ago
- ☆24Apr 9, 2024Updated 2 years ago
- 💩里淘金☆49Updated this week
- Practical New Tasks and Inspiring Modeling Solutions for Diverse Open Vision Problems☆139Oct 2, 2025Updated 8 months ago