vb000/SemanticHearing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vb000/SemanticHearing)

vb000 / SemanticHearing

Real-time binaural target sound extraction model.

☆99

Alternatives and similar repositories for SemanticHearing

Users that are interested in SemanticHearing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

introlab / uimvdr
View on GitHub
☆13Oct 11, 2024Updated last year
Audio-WestlakeU / NBSS
View on GitHub
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
☆362Jan 1, 2025Updated last year
chentuochao / Sound_Bubble
View on GitHub
Project for speech bubble
☆66Aug 15, 2025Updated 11 months ago
haidog-yaqub / DPMTSE
View on GitHub
A Diffusion Probabilistic Model for Target Sound Extraction
☆40Sep 27, 2024Updated last year
wanganran / HybridBeam
View on GitHub
Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing
☆19Apr 10, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated 2 months ago
LiChenda / Multi-clue-TSE-data
View on GitHub
Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"
☆17May 19, 2023Updated 3 years ago
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year
vb000 / Waveformer
View on GitHub
A deep neural network architecture for low-latency audio processing
☆326Aug 15, 2023Updated 2 years ago
exporl / vlaai
View on GitHub
Decoding of the speech envelope from EEG using the VLAAI deep neural network
☆14Sep 28, 2022Updated 3 years ago
francesclluis / direction-ambisonics-source-separation
View on GitHub
Deep learning for directional sound source separation from Ambisonics mixtures.
☆31Oct 1, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
IoSR-Surrey / RealRoomBRIRs
View on GitHub
Binaural impulse responses captured in real rooms.
☆41Mar 9, 2016Updated 10 years ago
fotisdr / DNN-HA
View on GitHub
DNN-based hearing aid for real-time sound processing
☆25May 25, 2023Updated 3 years ago
FrancoisGrondin / steernet
View on GitHub
☆27May 14, 2020Updated 6 years ago
nfurnon / disco
View on GitHub
Distributed semi-constrained microphone arrays
☆32May 4, 2024Updated 2 years ago
donghoney0416 / DeFTAN-II
View on GitHub
Official page of "DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing", IEEE/ACM Transactions on Audio, Speech,…
☆34Nov 21, 2024Updated last year
vivjay30 / clearbuds
View on GitHub
Clearbuds machine learning repository
☆45Apr 14, 2025Updated last year
Honee-W / CPTNN
View on GitHub
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
echocatzh / py-aec-unified2021
View on GitHub
☆47Jun 6, 2021Updated 5 years ago
vkothapally / Subband-Beamformer
View on GitHub
☆33Nov 29, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sekiguchi92 / SoundSourceSeparation
View on GitHub
The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.
☆213Oct 16, 2022Updated 3 years ago
danielkrause / Moving-Binaural-SDEL
View on GitHub
Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"
☆22Mar 2, 2025Updated last year
K-STMLab / SSL4PR
View on GitHub
This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…
☆12Dec 19, 2025Updated 7 months ago
anton-jeran / TS-RIR
View on GitHub
Translating Synthetic RIRs to Real RIRs
☆45Sep 15, 2023Updated 2 years ago
ConferencingSpeech / ConferencingSpeech2021
View on GitHub
Conferencing Speech Challenge
☆95Apr 6, 2021Updated 5 years ago
facebookresearch / R3VIVAL
View on GitHub
A Repository of Room Responses and 360 Videos of a Variable Acoustics Lab
☆46Mar 14, 2023Updated 3 years ago
BUTSpeechFIT / cgmm_mvdr_online
View on GitHub
Implementation of CGMM-MVDR beamforming used for Clarity challenge
☆14Jan 14, 2022Updated 4 years ago
facebookresearch / SS2_HRTF
View on GitHub
SS2 HRTF Dataset - Reality Labs Research Audio
☆18May 22, 2026Updated 2 months ago
amourgela / hearinglosssimulationplugin
View on GitHub
Hearing loss simulation VST plugin
☆14Mar 14, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
michaelneri / audio-distance-estimation
View on GitHub
Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …
☆40Jun 29, 2026Updated 3 weeks ago
Qingzheng-Wang / Dual-Window-SE
View on GitHub
An implement of STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency of Zhong-Qiu Wang et al.
☆16Nov 21, 2023Updated 2 years ago
BingYang-20 / DP-RTF-Learning
View on GitHub
A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]
☆28Feb 11, 2023Updated 3 years ago
thomeou / SALSA-Lite
View on GitHub
This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.
☆15Dec 3, 2021Updated 4 years ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
Enny1991 / beamformers
View on GitHub
Easy to use Beamformers for multi-channel speech separation/enhancement
☆216Jan 26, 2021Updated 5 years ago
echocatzh / MTFAA-Net
View on GitHub
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
☆233Sep 30, 2022Updated 3 years ago