cogmhear/Intelligibility-Oriented-Audio-Visual-Speech-Enhancement

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cogmhear/Intelligibility-Oriented-Audio-Visual-Speech-Enhancement)

cogmhear / Intelligibility-Oriented-Audio-Visual-Speech-Enhancement

Towards Intelligibility-Oriented Audio-Visual Speech Enhancement

☆15

Alternatives and similar repositories for Intelligibility-Oriented-Audio-Visual-Speech-Enhancement

Users that are interested in Intelligibility-Oriented-Audio-Visual-Speech-Enhancement are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cogmhear / avse_challenge
View on GitHub
COG-MHEAR Audio-Visual Speech Enhancement Challenge
☆48Feb 17, 2026Updated 5 months ago
dyahayumgw / HAAQI-Net
View on GitHub
HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.
☆18Sep 26, 2025Updated 10 months ago
aminEdraki / py-intelligibility
View on GitHub
Python implementation of a few speech intelligibility prediction algorithms
☆15May 29, 2024Updated 2 years ago
dhimasryan / STOI-Net
View on GitHub
☆29Nov 7, 2023Updated 2 years ago
JuanFMontesinos / VoViT
View on GitHub
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
☆35Mar 18, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BUTSpeechFIT / torch_msbg_mbstoi
View on GitHub
Differentiable implementation of MSBG hearing loss model and MBSTOI intelligibility metric for Clarity Enhancement challenge.
☆21Nov 19, 2021Updated 4 years ago
suhangpro / cnn-finetune
View on GitHub
Fine-tuning CNNs with MatConvNet
☆11Sep 29, 2017Updated 8 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
TeaPoly / PLCPA-ASYM-Loss
View on GitHub
The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss
☆15Sep 4, 2023Updated 2 years ago
XiangzhuKong / CA-Dense-UNet
View on GitHub
An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement
☆13Jul 17, 2023Updated 3 years ago
zexupan / USEV
View on GitHub
☆14Jul 1, 2024Updated 2 years ago
darkjazz / musiclynx
View on GitHub
☆15Jun 8, 2026Updated last month
choijeongsoo / lip2speech-unit
View on GitHub
[Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units
☆47Oct 26, 2024Updated last year
RanaCM / DSU-AVO
View on GitHub
Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023
☆12May 13, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
kamo-naoyuki / pySIIB
View on GitHub
A python implementation of Speech intelligibility in bits (SIIB)
☆26Apr 4, 2022Updated 4 years ago
danmic / av-se
View on GitHub
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
☆222Apr 16, 2023Updated 3 years ago
sevagh / xumx-sliCQ
View on GitHub
music demixing with the sliCQ Transform and PyTorch
☆32Nov 10, 2023Updated 2 years ago
prerak23 / RoomParamEstim
View on GitHub
This is the code for the WASPAA 2021 paper "Blind Room Parameter Estimation Using Multiple Multichannel Speech Recordings
☆17Nov 9, 2022Updated 3 years ago
enzodesena / rim
View on GitHub
Matlab implementation of the popular room acoustic model "image method", with the addition of randomisation to remove sweeping echoes (if…
☆14Aug 29, 2020Updated 5 years ago
LiChenda / Multi-clue-TSE-data
View on GitHub
Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"
☆17May 19, 2023Updated 3 years ago
christhetree / scrapl
View on GitHub
Scattering Transform with Random Paths for Machine Learning
☆16Apr 9, 2026Updated 3 months ago
koichi-saito-sony / ismir2024_tutorial_demo
View on GitHub
☆18Nov 8, 2024Updated last year
andresperezEUT / ambisonic_rt_estimation
View on GitHub
Ambisonic Blind Reverberation Time Estimation
☆12Jun 14, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
metabrainz / listenbrainz-labs
View on GitHub
A collection tools/scripts to explore the ListenBrainz data using Apache Spark.
☆16Jan 19, 2020Updated 6 years ago
nii-yamagishilab / NELE-GAN
View on GitHub
Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement
☆22Sep 21, 2021Updated 4 years ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆18Jun 16, 2026Updated last month
onolab-tmu / asp-tutorial-2022
View on GitHub
Ono laboratory audio signal processing exercise for beginners.
☆19May 10, 2023Updated 3 years ago
nicolasobin / binauralLocalization
View on GitHub
binaural sound source localization, ROUTE project - Sorbonne Université
☆14Feb 2, 2018Updated 8 years ago
zexupan / reentry
View on GitHub
☆18Nov 22, 2024Updated last year
pabdzadeh / voice-spoof-detection-system
View on GitHub
A voice spoofing detection system, based on paper presented at ICSPIS 2021
☆10Feb 11, 2022Updated 4 years ago
google-research-datasets / LLAMA1-Test-Set
View on GitHub
We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…
☆23Mar 14, 2024Updated 2 years ago
l3das / L3DAS23
View on GitHub
Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge
☆16Feb 10, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Audio-WestlakeU / NBSS
View on GitHub
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
☆363Jan 1, 2025Updated last year
jonashaag / pydct
View on GitHub
Short-Time Discrete Cosine Transform (DCT) for Python. SciPy, TensorFlow and PyTorch implementations.
☆28Feb 11, 2021Updated 5 years ago
int-brain-lab / analysis
View on GitHub
Initial repo for behavioral analyses
☆11Aug 24, 2022Updated 3 years ago
opensourcestories / story-questions
View on GitHub
repository for questions that are asked (or you want answered!) during storytelling sessions
☆12Sep 7, 2025Updated 10 months ago
lucacoma / NeuralBeamspaceDomainFilter
View on GitHub
Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…
☆19Oct 21, 2022Updated 3 years ago
KranthiKumarR / Localize-to-Binauralize
View on GitHub
Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)
☆10Oct 11, 2021Updated 4 years ago
Overcautious / ADENet
View on GitHub
Accepted by TMM 2022
☆19Aug 18, 2022Updated 3 years ago