A dataset collected from synchronized ad-hoc microphone arrays
☆19Apr 24, 2023Updated 3 years ago
Alternatives and similar repositories for Libri-adhoc40
Users that are interested in Libri-adhoc40 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆28Feb 11, 2023Updated 3 years ago
- Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks☆90Mar 24, 2023Updated 3 years ago
- Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)☆76Sep 14, 2021Updated 4 years ago
- Graph Neural Networks for Sound Source Localization☆29Oct 31, 2023Updated 2 years ago
- ☆144Oct 25, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆66Sep 28, 2024Updated last year
- Pytorch implemention of SDNet☆23Jun 1, 2021Updated 5 years ago
- ☆10Mar 13, 2022Updated 4 years ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆173Apr 29, 2025Updated last year
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 3 years ago
- Acoustic echo cancel based on aec3 written in rust + other audio processing goodies☆43Jun 24, 2026Updated last week
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆15Dec 3, 2021Updated 4 years ago
- [ECCV 2024] We provide the Pytorch implementation of "Object-Aware NIR-to-Visible Translation".☆17Mar 2, 2025Updated last year
- Colorization of infrared images based on feature fusion and contrastive learning☆12Nov 16, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- In this paper, we propose Filter Gradient Decent (FGD), an efficient stochastic optimization algorithm that makes a consistent estimation…☆12May 18, 2021Updated 5 years ago
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆33Nov 6, 2020Updated 5 years ago
- ☆16Nov 5, 2021Updated 4 years ago
- ☆24Jul 6, 2025Updated 11 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆40Oct 11, 2024Updated last year
- ☆20Jun 29, 2025Updated last year
- cross modal background suppression for audio-visual event localization☆36Mar 18, 2022Updated 4 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Jul 24, 2023Updated 2 years ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"☆19May 27, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆38Mar 12, 2024Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2021 challenge☆45Jun 15, 2021Updated 5 years ago
- Tsinghua University SPMI Lab array processing toolkit☆18Nov 23, 2016Updated 9 years ago
- ☆42Oct 14, 2022Updated 3 years ago
- ☆19Mar 9, 2023Updated 3 years ago
- Training data simulation☆60May 6, 2024Updated 2 years ago
- Color Based Probabilistic Tracking☆11May 12, 2023Updated 3 years ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆72Feb 10, 2022Updated 4 years ago
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆19May 12, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Pushing the limits of acoustic motion tracking☆14Jul 31, 2020Updated 5 years ago
- ☆15Dec 15, 2020Updated 5 years ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆61Jan 19, 2022Updated 4 years ago
- ☆10Jan 26, 2021Updated 5 years ago
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 4 years ago