A dataset collected from synchronized ad-hoc microphone arrays
☆19Apr 24, 2023Updated 2 years ago
Alternatives and similar repositories for Libri-adhoc40
Users that are interested in Libri-adhoc40 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆27Feb 11, 2023Updated 3 years ago
- Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks☆88Mar 24, 2023Updated 3 years ago
- Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)☆76Sep 14, 2021Updated 4 years ago
- Graph Neural Networks for Sound Source Localization☆26Oct 31, 2023Updated 2 years ago
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆60Sep 28, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Pytorch implemention of SDNet☆23Jun 1, 2021Updated 4 years ago
- ☆10Mar 13, 2022Updated 4 years ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆155Apr 29, 2025Updated 10 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 3 years ago
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆14Dec 3, 2021Updated 4 years ago
- ☆135Oct 25, 2021Updated 4 years ago
- [ECCV 2024] We provide the Pytorch implementation of "Object-Aware NIR-to-Visible Translation".☆15Mar 2, 2025Updated last year
- Colorization of infrared images based on feature fusion and contrastive learning☆12Nov 16, 2021Updated 4 years ago
- In this paper, we propose Filter Gradient Decent (FGD), an efficient stochastic optimization algorithm that makes a consistent estimation…☆12May 18, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆32Nov 6, 2020Updated 5 years ago
- ☆14Nov 5, 2021Updated 4 years ago
- ☆23Jul 6, 2025Updated 8 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆40Oct 11, 2024Updated last year
- cross modal background suppression for audio-visual event localization☆36Mar 18, 2022Updated 4 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Jul 24, 2023Updated 2 years ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"☆15May 27, 2024Updated last year
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆38Mar 12, 2024Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2021 challenge☆42Jun 15, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Tsinghua University SPMI Lab array processing toolkit☆18Nov 23, 2016Updated 9 years ago
- ☆39Oct 14, 2022Updated 3 years ago
- ☆17Mar 9, 2023Updated 3 years ago
- Training data simulation☆58May 6, 2024Updated last year
- Color Based Probabilistic Tracking☆11May 12, 2023Updated 2 years ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆71Feb 10, 2022Updated 4 years ago
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆18May 12, 2025Updated 10 months ago
- Pushing the limits of acoustic motion tracking☆14Jul 31, 2020Updated 5 years ago
- ☆15Dec 15, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆59Jan 19, 2022Updated 4 years ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year
- ☆10Jan 26, 2021Updated 5 years ago
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 3 years ago
- DNN based binaural sound localization model, using GCC-PHAT as features☆22Jun 13, 2023Updated 2 years ago
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆304Jun 15, 2021Updated 4 years ago