Real-time binaural target sound extraction model.
☆97Mar 28, 2024Updated last year
Alternatives and similar repositories for SemanticHearing
Users that are interested in SemanticHearing are comparing it to the libraries listed below
Sorting:
- ☆13Oct 11, 2024Updated last year
- Project for speech bubble☆56Aug 15, 2025Updated 6 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆39Oct 11, 2024Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Sep 27, 2024Updated last year
- The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation☆336Jan 1, 2025Updated last year
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"☆17Mar 2, 2025Updated last year
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆153Apr 29, 2025Updated 10 months ago
- Decoding of the speech envelope from EEG using the VLAAI deep neural network☆15Sep 28, 2022Updated 3 years ago
- Binaural impulse responses captured in real rooms.☆37Mar 9, 2016Updated 9 years ago
- ☆25May 14, 2020Updated 5 years ago
- ☆209Dec 4, 2023Updated 2 years ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]☆141Feb 5, 2026Updated last month
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆23Jun 9, 2025Updated 8 months ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16May 19, 2023Updated 2 years ago
- ☆46Jun 6, 2021Updated 4 years ago
- Translating Synthetic RIRs to Real RIRs☆45Sep 15, 2023Updated 2 years ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆48Nov 4, 2020Updated 5 years ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆24Nov 12, 2025Updated 3 months ago
- Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing☆19Apr 10, 2024Updated last year
- ☆33Nov 29, 2022Updated 3 years ago
- Easy to use Beamformers for multi-channel speech separation/enhancement☆210Jan 26, 2021Updated 5 years ago
- Toolbox for Evaluation of AEC/AES Systems☆33Feb 18, 2026Updated 2 weeks ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆43Oct 30, 2025Updated 4 months ago
- The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.☆210Oct 16, 2022Updated 3 years ago
- A deep neural network architecture for low-latency audio processing☆323Aug 15, 2023Updated 2 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆38Oct 27, 2025Updated 4 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- Clearbuds machine learning repository☆44Apr 14, 2025Updated 10 months ago
- Official repository for LMFCA-Net: A Lightweight Model for Multi-Channel Speech Enhancement with Efficient Narrow-Band and Cross-Band Att…☆29Feb 26, 2025Updated last year
- DNN-based hearing aid for real-time sound processing☆25May 25, 2023Updated 2 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆87May 21, 2025Updated 9 months ago
- A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION☆124Jun 18, 2022Updated 3 years ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆114Jan 28, 2026Updated last month
- ☆12Jun 22, 2020Updated 5 years ago
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago