Official implementation of the paper How to Listen? Rethinking Visual Sound Localization
☆18Apr 25, 2022Updated 3 years ago
Alternatives and similar repositories for rethinking-visual-sound-localization
Users that are interested in rethinking-visual-sound-localization are comparing it to the libraries listed below
Sorting:
- ☆17Nov 22, 2022Updated 3 years ago
- A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19May 27, 2020Updated 5 years ago
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆40Oct 2, 2022Updated 3 years ago
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆40Dec 15, 2020Updated 5 years ago
- [2025 CVPR] Towards Open-Vocabulary Audio-Visual Event Localization☆42Mar 7, 2025Updated last year
- ☆27Mar 21, 2024Updated last year
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆25Jun 6, 2022Updated 3 years ago
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆115Nov 16, 2020Updated 5 years ago
- Code for Look for the Change paper published at CVPR 2022☆36Oct 26, 2022Updated 3 years ago
- Localizing Visual Sounds the Hard Way☆83Jul 6, 2022Updated 3 years ago
- Collection of Deep Reinforcement Learning Jupyter Notebooks. Each notebook is self-contained and presents single algorithm. These include…☆38Mar 7, 2020Updated 6 years ago
- code for our ACM MM 2020 best paper "PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music"☆32Mar 13, 2022Updated 3 years ago
- ☆30Jun 14, 2022Updated 3 years ago
- ICCV 2021☆34May 11, 2022Updated 3 years ago
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆45Apr 5, 2023Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- 为了方便大家考研☆10Sep 8, 2021Updated 4 years ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- ☆15Apr 7, 2025Updated 11 months ago
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- Train a Mixture of Factor Analyzers (MFA) / Mixture of Probabilistic PCA (MPPCA) - low-rank-plus-diagonal GMMs using pytorch☆41Oct 30, 2022Updated 3 years ago
- This is the repository for Learning to Generate Piano Music With Sustain Pedals☆12Nov 23, 2023Updated 2 years ago
- ☆12Apr 26, 2025Updated 10 months ago
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- SING: SDE Inference via Natural Gradients☆36Dec 9, 2025Updated 2 months ago
- ☆15Aug 19, 2024Updated last year
- a fast and customizable CUDA int4 tensor core gemm☆15Aug 2, 2024Updated last year
- ☆10Jun 3, 2019Updated 6 years ago
- Compilation of ML/AI Resources for Members of MITxHarvard Women in AI☆11Mar 28, 2022Updated 3 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 2 years ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inference☆11Jun 10, 2024Updated last year
- Machine Learning Reading Group☆11Sep 15, 2023Updated 2 years ago
- ☆10Apr 17, 2024Updated last year
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago