Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)
☆18Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for SIRA-SSL
Users that are interested in SIRA-SSL are comparing it to the libraries listed below
Sorting:
- Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)☆13Sep 1, 2024Updated last year
- Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI …☆14Mar 1, 2025Updated last year
- Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)☆16Oct 29, 2024Updated last year
- Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"☆12Dec 21, 2022Updated 3 years ago
- Official Repository for "Multispectral Pedestrian Detection with Sparsely Annotated Label" (AAAI 2025)☆29Apr 28, 2025Updated 10 months ago
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆97Dec 4, 2024Updated last year
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- ☆17Aug 11, 2023Updated 2 years ago
- Official implementation of the paper How to Listen? Rethinking Visual Sound Localization☆17Apr 25, 2022Updated 3 years ago
- Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024☆27Mar 26, 2024Updated last year
- Complex-valued neural networks for DOA estimation☆29Jan 25, 2023Updated 3 years ago
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆27Feb 11, 2023Updated 3 years ago
- Official Repository for "MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection" (ECCV 2024)☆61Oct 18, 2024Updated last year
- [CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…☆27Apr 10, 2023Updated 2 years ago
- Repository related to Cranfield's AAI MSCs GDP☆11Apr 8, 2023Updated 2 years ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆34Feb 21, 2025Updated last year
- Localizing Visual Sounds the Hard Way☆82Jul 6, 2022Updated 3 years ago
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆32Nov 6, 2020Updated 5 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆41Dec 23, 2023Updated 2 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Jun 20, 2023Updated 2 years ago
- PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022…☆32Jul 8, 2024Updated last year
- cross modal background suppression for audio-visual event localization☆36Mar 18, 2022Updated 3 years ago
- LED : Light Enhanced Depth Estimation at Night☆13Dec 9, 2025Updated 2 months ago
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 11 months ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- P1AC: Revisiting Absolute Pose From a Single Affine Correspondence☆11Mar 19, 2024Updated last year
- ☆10Updated this week
- ☆12Jun 26, 2024Updated last year
- ☆10Nov 15, 2023Updated 2 years ago
- Official codebase for "Context Aware Deep Learning for Multi Modal Depression Detection" [ICASSP 2019, Oral]☆11Dec 26, 2024Updated last year
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- A tutorial for Sound Source Localization researchers and practitioners. The purpose of this repo is to organize the world’s resources for…☆54Mar 17, 2023Updated 2 years ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inference☆11Jun 10, 2024Updated last year
- ☆10Jun 13, 2022Updated 3 years ago
- Notes for CS294/194-196: Large Language Model Agents (Fall 2024, UC Berkeley), summarizing 12 lectures on LLM fundamentals, reasoning, pl…☆14Jan 7, 2025Updated last year
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆17Sep 15, 2025Updated 5 months ago
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated last year
- SING: SDE Inference via Natural Gradients☆36Dec 9, 2025Updated 2 months ago