JiabenChen/iQuery

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JiabenChen/iQuery)

JiabenChen / iQuery

[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation

☆73

Alternatives and similar repositories for iQuery

Users that are interested in iQuery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

merlresearch / Gear-NeRF
View on GitHub
This repository contains the implementation of the paper: "Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Tem…
☆18Sep 4, 2024Updated last year
DarlingHang / radiance_field_propagation
View on GitHub
[NeurIPS 2022] Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation
☆14Nov 9, 2022Updated 3 years ago
YapengTian / CCOL-CVPR21
View on GitHub
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
☆26Nov 24, 2021Updated 4 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
YYX666660 / LAVSS
View on GitHub
Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
☆19Feb 25, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
hxixixh / mix-and-localize
View on GitHub
☆23Mar 20, 2024Updated 2 years ago
Adonis-galaxy / DSPoint
View on GitHub
Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"
☆20Feb 4, 2025Updated last year
daochenzha / neuroshard
View on GitHub
[MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
☆16May 5, 2023Updated 3 years ago
Jiaxin-Pei / Potato-Prolific-Dataset
View on GitHub
☆17Jun 14, 2023Updated 3 years ago
LoieSun / Auto-ACD
View on GitHub
code for A Large-scale Dataset for Audio-Language Representation Learning
☆14Sep 18, 2024Updated last year
ZrrSkywalker / Point-M2AE
View on GitHub
[NeurIPS 2022] Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
☆228May 4, 2023Updated 3 years ago
ZrrSkywalker / MonoDETR
View on GitHub
[ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer
☆444Jul 15, 2025Updated last year
SAGNIKMJR / ego-AV-spatial-correspondence
View on GitHub
[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'
☆14Jun 16, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
stoneMo / AVGN
View on GitHub
Official implementation for AVGN
☆41Mar 24, 2023Updated 3 years ago
OpenNLPLab / MMVAE-AVS
View on GitHub
Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].
☆20Sep 19, 2024Updated last year
JieZheng-ShanghaiTech / PiLSL
View on GitHub
PiLSL is a pairwise interaction learning-based graph neural network (GNN) model for prediction of synthetic lethality (SL) as anti-cancer…
☆13Dec 4, 2024Updated last year
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
SitongGong / Veason-R1
View on GitHub
Official code of Veason-R1
☆15Jul 14, 2026Updated last week
daochenzha / autosmote
View on GitHub
[CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification
☆10Mar 20, 2023Updated 3 years ago
facebookresearch / soundvista
View on GitHub
soundvista
☆16Dec 31, 2025Updated 6 months ago
YanjieZe / rl3d
View on GitHub
[RA-L 2023 & IROS 2023] Visual Reinforcement Learning with Self-Supervised 3D Representations
☆86Mar 8, 2023Updated 3 years ago
datamllab / autovideo
View on GitHub
AutoVideo: An Automated Video Action Recognition System
☆343Jun 22, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sony / CLIPSep
View on GitHub
☆43Feb 21, 2023Updated 3 years ago
BASHLab / OWL
View on GitHub
☆15May 25, 2026Updated last month
roudimit / MUSIC_dataset
View on GitHub
MUSIC Dataset from The Sound of Pixels (ECCV '18)
☆137Aug 12, 2022Updated 3 years ago
neu-vi / SportsSloMo
View on GitHub
SportsSloMo: A New Benchmark and Baseline Models for Human-centric Video Frame Interpolation, CVPR 2024 (https://arxiv.org/abs/2308.16876…
☆79Apr 4, 2024Updated 2 years ago
vvvb-github / AVSegFormer
View on GitHub
[AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer
☆74Mar 6, 2025Updated last year
jinbae-s / ACVIS
View on GitHub
[ICASSP 2026] The official pytorch implementation of ACVIS
☆15Jan 19, 2026Updated 6 months ago
kaiw7 / STG-CMA
View on GitHub
Towards Efficient Audio-Visual Learners via Empowering Pre-trained Vision Transformers with Cross-Modal Adaptation
☆15Apr 13, 2024Updated 2 years ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
ruohaoguo / avis
View on GitHub
[CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".
☆49Jun 5, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shlizee / savvy
View on GitHub
Repository for SAVVY(Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing) Benchmark and SAVVY model
☆25May 30, 2026Updated last month
rhgao / co-separation
View on GitHub
Co-Separating Sounds of Visual Objects (ICCV 2019)
☆98Jul 25, 2023Updated 2 years ago
ZrrSkywalker / I2P-MAE
View on GitHub
[CVPR 2023] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders
☆230Aug 10, 2023Updated 2 years ago
KawhiZhao / Egocentric-Audio-Visual-Speaker-Localization
View on GitHub
Code for paper Audio Visual Speaker Localization from EgoCentric Views
☆11Jul 3, 2024Updated 2 years ago
WikiChao / DAVIS
View on GitHub
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …
☆33Mar 30, 2026Updated 3 months ago
DarlingHang / ChatCam
View on GitHub
This repository contains the implementation of the paper: "ChatCam: Empowering Camera Control through Conversational AI", NeurIPS 2024.
☆23Nov 15, 2024Updated last year
TheKangChen / crosstalk-cancellation
View on GitHub
Binaural audio reproduction through loudspeakers. Also known as crosstalk cancellation.
☆11Sep 12, 2024Updated last year