[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation
☆72Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for iQuery
Users that are interested in iQuery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the implementation of the paper: "Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Tem…☆18Sep 4, 2024Updated last year
- [NeurIPS 2022] Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation☆14Nov 9, 2022Updated 3 years ago
- Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation☆26Nov 24, 2021Updated 4 years ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆18Feb 25, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆23Mar 20, 2024Updated 2 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"☆19Feb 4, 2025Updated last year
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Sep 18, 2024Updated last year
- [MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models☆16May 5, 2023Updated 2 years ago
- ☆16Jun 14, 2023Updated 2 years ago
- [NeurIPS 2022] Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training☆227May 4, 2023Updated 2 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- [ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer☆439Jul 15, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆15Jun 15, 2022Updated 3 years ago
- Official implementation for AVGN☆41Mar 24, 2023Updated 3 years ago
- PiLSL is a pairwise interaction learning-based graph neural network (GNN) model for prediction of synthetic lethality (SL) as anti-cancer…☆12Dec 4, 2024Updated last year
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆40Apr 20, 2025Updated 11 months ago
- Repo for ICML'23 paper SurCo Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems☆18Jul 11, 2023Updated 2 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 6 months ago
- [CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification☆10Mar 20, 2023Updated 3 years ago
- MUSIC Dataset from The Sound of Pixels (ECCV '18)☆136Aug 12, 2022Updated 3 years ago
- ☆43Feb 21, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆74Mar 6, 2025Updated last year
- AutoVideo: An Automated Video Action Recognition System☆341Jun 22, 2023Updated 2 years ago
- SportsSloMo: A New Benchmark and Baseline Models for Human-centric Video Frame Interpolation, CVPR 2024 (https://arxiv.org/abs/2308.16876…☆78Apr 4, 2024Updated 2 years ago
- (ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'…☆59Apr 18, 2024Updated last year
- RATE: Real-time Asynchronous Feature Tracking with Event Cameras. M. Ikura, et.al., IROS2024☆14Oct 22, 2024Updated last year
- [NeurIPS 2025] Separate Anything in Audio with Zero Training☆57Nov 3, 2025Updated 5 months ago
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆99Jul 25, 2023Updated 2 years ago
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- ☆14Jul 1, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆38Oct 11, 2024Updated last year
- Binaural audio reproduction through loudspeakers. Also known as crosstalk cancellation.☆11Sep 12, 2024Updated last year
- This repository contains the implementation of the paper: "ChatCam: Empowering Camera Control through Conversational AI", NeurIPS 2024.☆21Nov 15, 2024Updated last year
- Convert a mono channel recording into binaural playback with headphones and loudspeakers☆13Dec 6, 2023Updated 2 years ago
- PyBlend: a package for Blender with Python 🎨☆132Apr 8, 2024Updated 2 years ago
- ☆14Jul 1, 2024Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago