A dataset collected from synchronized ad-hoc microphone arrays
☆19Apr 24, 2023Updated 2 years ago
Alternatives and similar repositories for Libri-adhoc40
Users that are interested in Libri-adhoc40 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆27Feb 11, 2023Updated 3 years ago
- Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks☆89Mar 24, 2023Updated 3 years ago
- Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)☆76Sep 14, 2021Updated 4 years ago
- Graph Neural Networks for Sound Source Localization☆26Oct 31, 2023Updated 2 years ago
- ☆138Oct 25, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆61Sep 28, 2024Updated last year
- Pytorch implemention of SDNet☆23Jun 1, 2021Updated 4 years ago
- ☆10Mar 13, 2022Updated 4 years ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆159Apr 29, 2025Updated 11 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 3 years ago
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆14Dec 3, 2021Updated 4 years ago
- [ECCV 2024] We provide the Pytorch implementation of "Object-Aware NIR-to-Visible Translation".☆15Mar 2, 2025Updated last year
- Colorization of infrared images based on feature fusion and contrastive learning☆12Nov 16, 2021Updated 4 years ago
- In this paper, we propose Filter Gradient Decent (FGD), an efficient stochastic optimization algorithm that makes a consistent estimation…☆12May 18, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆32Nov 6, 2020Updated 5 years ago
- ☆14Nov 5, 2021Updated 4 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆40Oct 11, 2024Updated last year
- ☆23Jul 6, 2025Updated 9 months ago
- cross modal background suppression for audio-visual event localization☆36Mar 18, 2022Updated 4 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Jul 24, 2023Updated 2 years ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"☆16May 27, 2024Updated last year
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆38Mar 12, 2024Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2021 challenge☆42Jun 15, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tsinghua University SPMI Lab array processing toolkit☆18Nov 23, 2016Updated 9 years ago
- ☆39Oct 14, 2022Updated 3 years ago
- ☆17Mar 9, 2023Updated 3 years ago
- Training data simulation☆59May 6, 2024Updated last year
- Color Based Probabilistic Tracking☆11May 12, 2023Updated 2 years ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆72Feb 10, 2022Updated 4 years ago
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆18May 12, 2025Updated 11 months ago
- Pushing the limits of acoustic motion tracking☆14Jul 31, 2020Updated 5 years ago
- ☆15Dec 15, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆60Jan 19, 2022Updated 4 years ago
- ☆10Jan 26, 2021Updated 5 years ago
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 3 years ago
- DNN based binaural sound localization model, using GCC-PHAT as features☆22Jun 13, 2023Updated 2 years ago
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆304Jun 15, 2021Updated 4 years ago