lxa9867/QSD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lxa9867/QSD)

lxa9867 / QSD

[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"

☆12

Alternatives and similar repositories for QSD

Users that are interested in QSD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lxa9867 / R2VOS
View on GitHub
Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]
☆30Mar 13, 2024Updated 2 years ago
lxa9867 / PaintSeg
View on GitHub
[NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"
☆14Dec 31, 2023Updated 2 years ago
OpenNLPLab / MMVAE-AVS
View on GitHub
Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].
☆20Sep 19, 2024Updated last year
jinxiang-liu / anno-free-AVS
View on GitHub
Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"
☆38Oct 11, 2024Updated last year
Xiaohao-Xu / Ambiguity-in-Space
View on GitHub
[ECCV 2026] One Scene, Two Depths: Probing Geometric Ambiguity in Monocular Foundation Models (Layered 3D Spatial Understanding)
☆23Jul 10, 2026Updated 2 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year
yannqi / COMBO-AVS
View on GitHub
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…
☆40Apr 20, 2025Updated last year
GeWu-Lab / Ref-AVS
View on GitHub
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
☆50Oct 12, 2025Updated 9 months ago
qiuk2 / AAR
View on GitHub
[Official Implementation] Acoustic Autoregressive Modeling 🔥
☆74Aug 24, 2024Updated last year
cyh-0 / CAVP
View on GitHub
Official code for "A Closer Look at Audio-Visual Segmentation"
☆97Oct 31, 2025Updated 8 months ago
stoneMo / SLAVC
View on GitHub
Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)
☆21Dec 6, 2022Updated 3 years ago
ChangyaoTian / ADDP
View on GitHub
The official implementation of ADDP (ICLR 2024)
☆12Mar 27, 2024Updated 2 years ago
shuheikurita / RefEgo
View on GitHub
☆13Jul 20, 2024Updated 2 years ago
Xiaohao-Xu / MAC-Ego3D
View on GitHub
[CVPR 2025] MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction
☆75Apr 10, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ttgeng233 / UnAV
View on GitHub
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)
☆73Jan 4, 2026Updated 6 months ago
ywyeli / lidar-camera-placement
View on GitHub
[ICRA'24] Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving
☆21Sep 14, 2024Updated last year
hxixixh / mix-and-localize
View on GitHub
☆23Mar 20, 2024Updated 2 years ago
Xiaohao-Xu / SLAM-under-Perturbation
View on GitHub
[ICLR 2025] Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video
☆58Nov 29, 2025Updated 7 months ago
lxtGH / TemporalPyramidRouting
View on GitHub
Temporal Pyramid Routing For Video Instance Segmentation-T-PAMI-2022
☆25Jul 6, 2023Updated 3 years ago
cyh-0 / BoMD
View on GitHub
Official code for "BoMD: Bag of Multi-label Descriptors for Noisy Chest X-ray Classification"
☆26Apr 11, 2024Updated 2 years ago
SII-Ferenas / PGSeg
View on GitHub
This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"
☆27Dec 7, 2023Updated 2 years ago
bo-miao / SgMg
View on GitHub
[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.
☆112Apr 9, 2025Updated last year
kaist-ami / LaughTalk
View on GitHub
☆14Dec 8, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MapleBoat / PASDF
View on GitHub
Bridging 3D Anomaly Localization and Repair via High-Quality Continuous Geometric Representation(ICCV2025)
☆15Dec 17, 2025Updated 7 months ago
kaiw7 / STG-CMA
View on GitHub
Towards Efficient Audio-Visual Learners via Empowering Pre-trained Vision Transformers with Cross-Modal Adaptation
☆15Apr 13, 2024Updated 2 years ago
jmnian / WRAG
View on GitHub
Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"
☆16Oct 2, 2025Updated 9 months ago
kaist-ami / AVHBench
View on GitHub
[ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"
☆25Mar 8, 2026Updated 4 months ago
SitongGong / Veason-R1
View on GitHub
Official code of Veason-R1
☆15Jul 14, 2026Updated last week
DLUT-yyc / Isomer
View on GitHub
[ICCV2023] Isomer: Isomerous Transformer for Zero-Shot Video Object Segmentation
☆30Nov 21, 2023Updated 2 years ago
JiabenChen / iQuery
View on GitHub
[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation
☆73Jul 25, 2023Updated 2 years ago
OpenNLPLab / FNAC_AVL
View on GitHub
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…
☆29Apr 10, 2023Updated 3 years ago
stoneMo / AVGN
View on GitHub
Official implementation for AVGN
☆42Mar 24, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
FudanCVL / SAAS
View on GitHub
[AAAI 2026] Segment Anything Across Shots: A Method and Benchmark
☆29Nov 16, 2025Updated 8 months ago
jianzongwu / robust-ref-seg
View on GitHub
(TIP 2024) Towards Robust Referring Image Segmentation
☆40Mar 2, 2024Updated 2 years ago
Vv2077 / visual-chatgpt
View on GitHub
Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
☆10Apr 17, 2023Updated 3 years ago
skhcjh231 / MATR_codebase
View on GitHub
☆22Mar 7, 2025Updated last year
Hanzy1996 / OpenSeg-R
View on GitHub
OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning
☆29May 24, 2025Updated last year
soham97 / ADIFF
View on GitHub
Explaining audio differences using language
☆16Feb 11, 2025Updated last year
zihuixue / MKE
View on GitHub
[ICCV 2021] Multimodal Knowledge Expansion
☆10Aug 28, 2021Updated 4 years ago