liyidi / soundnet_localize_sound_sourceView external linksLinks
soundnet and localize sound source
☆12Dec 7, 2020Updated 5 years ago
Alternatives and similar repositories for soundnet_localize_sound_source
Users that are interested in soundnet_localize_sound_source are comparing it to the libraries listed below
Sorting:
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆97Dec 4, 2024Updated last year
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆32Nov 6, 2020Updated 5 years ago
- Convert an image to stereographic projection (Polar Coordinates)☆10Oct 15, 2022Updated 3 years ago
- [2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line☆42Jul 5, 2022Updated 3 years ago
- ☆12Jun 2, 2025Updated 8 months ago
- CHiME-5 Baseline Array Synchronisation☆12Sep 24, 2018Updated 7 years ago
- ☆11Jun 15, 2022Updated 3 years ago
- SalNet on Keras: A deep convolutional network for saliency prediction☆11Jun 23, 2017Updated 8 years ago
- 视频抽帧☆13Aug 30, 2015Updated 10 years ago
- SST-Sal: A spherical spatio-temporal approach for saliency prediction in 360º videos☆15Aug 31, 2023Updated 2 years ago
- Goal is to estimate the location of sound source using microphones array. LMS method is used to estimate time delays. Steepest descent al…☆14Oct 27, 2017Updated 8 years ago
- A JUCE based stereo expander and harmonic exciter using Mid-Side processing and tube-based distortion for increased stereo width and harm…☆15Nov 21, 2024Updated last year
- Developing an algorithm using MATLAB to detect the unknown location(coordinates) of a sound source in a closed room using a series of mic…☆14Jan 10, 2018Updated 8 years ago
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Nov 12, 2022Updated 3 years ago
- A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction (TIP2021)☆12Jul 7, 2022Updated 3 years ago
- ☆12Apr 26, 2018Updated 7 years ago
- (TMM 2022)Quality Assessment for Omnidirectional Video: A Spatio-Temporal Distortion Modeling Approach☆12Jun 16, 2021Updated 4 years ago
- 2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification☆14Jan 4, 2024Updated 2 years ago
- ☆14Aug 17, 2024Updated last year
- Delay estimation logic extracted from WebRTC☆18Jan 11, 2021Updated 5 years ago
- Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"☆12Dec 21, 2022Updated 3 years ago
- Fast Artistic Videos in pyTorch☆14Oct 3, 2023Updated 2 years ago
- The improved version of our previous work SalGAN360 which predict visual saliency on 360° image☆15Jan 19, 2021Updated 5 years ago
- Dual-Path Attention and Recurrent Network for speech separation☆19Sep 12, 2024Updated last year
- ☆14Aug 8, 2019Updated 6 years ago
- ☆23Jul 17, 2024Updated last year
- This is the first part of a TDOA-system used for estimating the time differences.☆17Feb 18, 2015Updated 10 years ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆23Nov 12, 2025Updated 3 months ago
- ☆24Mar 18, 2024Updated last year
- Official PyTorch implementation of our paper "Spherical Vision Transformer for 360° Video Saliency Prediction" (BMVC 2023)☆21Mar 27, 2024Updated last year
- Saliency prediction on 360° image with SalGAN☆16Jan 5, 2021Updated 5 years ago
- Repository for implementation of SalNet360 in Caffe☆18Jul 5, 2018Updated 7 years ago
- particle filter based object tracking☆17Mar 9, 2020Updated 5 years ago
- ☆18Apr 10, 2023Updated 2 years ago
- Offline CGMM and CGMM with spatial prior distribution in an online manner☆20Apr 19, 2019Updated 6 years ago
- ☆21Dec 25, 2020Updated 5 years ago
- The large-scale eye-tracking database called LEDOV for video salinecy☆19Sep 26, 2019Updated 6 years ago
- This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.☆95Oct 18, 2021Updated 4 years ago
- Cross-model active contrastive coding☆22Mar 17, 2021Updated 4 years ago