[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation
☆72Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for iQuery
Users that are interested in iQuery are comparing it to the libraries listed below
Sorting:
- Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation☆26Nov 24, 2021Updated 4 years ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Sep 18, 2024Updated last year
- ☆15Jun 15, 2022Updated 3 years ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆18Feb 25, 2025Updated last year
- ☆22Mar 20, 2024Updated last year
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆73Mar 6, 2025Updated 11 months ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- [CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification☆10Mar 20, 2023Updated 2 years ago
- RATE: Real-time Asynchronous Feature Tracking with Event Cameras. M. Ikura, et.al., IROS2024☆14Oct 22, 2024Updated last year
- Code for paper Learning Audio-Visual Dereverberation☆30Aug 10, 2022Updated 3 years ago
- ☆14Jul 1, 2024Updated last year
- Convert a mono channel recording into binaural playback with headphones and loudspeakers☆13Dec 6, 2023Updated 2 years ago
- [MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models☆16May 5, 2023Updated 2 years ago
- Binaural audio reproduction through loudspeakers. Also known as crosstalk cancellation.☆11Sep 12, 2024Updated last year
- [RA-L 2023 & IROS 2023] Visual Reinforcement Learning with Self-Supervised 3D Representations☆84Mar 8, 2023Updated 2 years ago
- An easy calibration toolbox for VECtor Benchmark☆29Jan 17, 2024Updated 2 years ago
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆37Oct 11, 2024Updated last year
- MUSIC Dataset from The Sound of Pixels (ECCV '18)☆136Aug 12, 2022Updated 3 years ago
- [NeurIPS 2025] Separate Anything in Audio with Zero Training☆56Nov 3, 2025Updated 4 months ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16May 19, 2023Updated 2 years ago
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆99Jul 25, 2023Updated 2 years ago
- ☆14Jul 1, 2023Updated 2 years ago
- ☆43Feb 21, 2023Updated 3 years ago
- ☆17Oct 2, 2023Updated 2 years ago
- [CVPR 2025 Highlight] Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures☆29Jun 20, 2025Updated 8 months ago
- Towards Long Form Audio-visual Video Understanding☆15Jan 16, 2026Updated last month
- SportsSloMo: A New Benchmark and Baseline Models for Human-centric Video Frame Interpolation, CVPR 2024 (https://arxiv.org/abs/2308.16876…☆77Apr 4, 2024Updated last year
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆20Dec 6, 2022Updated 3 years ago
- PyBlend: a package for Blender with Python 🎨☆132Apr 8, 2024Updated last year
- numerical stability testing ground for eventail solver☆19Jul 30, 2024Updated last year
- ☆21Oct 10, 2024Updated last year
- Implementation of Deep Reinforcement Learning Benchmark Algorithms, including DQN, Double DQN, Dueling DQN, Reinforce, Actor-Critic, A2C,…☆18Nov 5, 2021Updated 4 years ago
- ☆28Dec 29, 2023Updated 2 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- A collections of audio codecs with a standardized API☆35May 27, 2025Updated 9 months ago