hmartelb/avlit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hmartelb/avlit)

hmartelb / avlit

Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model" (AVLIT)

☆20

Alternatives and similar repositories for avlit

Users that are interested in avlit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JusperLee / CTCNet
View on GitHub
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆82Apr 28, 2024Updated 2 years ago
neoncloud / mdctGAN
View on GitHub
Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"
☆66Jun 3, 2023Updated 3 years ago
JusperLee / TFACM
View on GitHub
☆23Jul 16, 2025Updated last year
haidog-yaqub / DPMTSE
View on GitHub
A Diffusion Probabilistic Model for Target Sound Extraction
☆40Sep 27, 2024Updated last year
Andong-Li-speech / TaEr
View on GitHub
This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…
☆14Nov 25, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
urgent-challenge / urgent2024_challenge
View on GitHub
Official data preparation scripts for the URGENT 2024 Challenge
☆90May 21, 2025Updated last year
YUCHEN005 / NASE
View on GitHub
Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"
☆89Jun 10, 2024Updated 2 years ago
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
fakufaku / diffusion-separation
View on GitHub
Single channel speech source separation by diffusion process (ICASSP 2023)
☆126Mar 15, 2024Updated 2 years ago
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
JusperLee / IIANet
View on GitHub
This is the demo of our paper "IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation".
☆110Mar 12, 2025Updated last year
Andong-Li-speech / G2Net
View on GitHub
The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP
☆19Apr 27, 2022Updated 4 years ago
guozixunnicolas / DENT_DDSP
View on GitHub
☆24Jun 30, 2023Updated 3 years ago
K-STMLab / SSL4PR
View on GitHub
This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…
☆12Dec 19, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
seongq / flowmse
View on GitHub
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
☆107Jul 23, 2025Updated last year
yangdongchao / Tim-TSENet
View on GitHub
The source code of Tim-TSENet
☆15Apr 22, 2022Updated 4 years ago
sp-uhh / sgmse-bbed
View on GitHub
Brownian Bridge with Exponential Diffusion Coefficient
☆43Nov 1, 2023Updated 2 years ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week
audiolabs / MonteCarloRIRSimulation
View on GitHub
Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)
☆18Feb 25, 2026Updated 4 months ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
huaidanquede / MUSE-Speech-Enhancement
View on GitHub
Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…
☆58Mar 5, 2025Updated last year
RookieJunChen / Inter-SubNet
View on GitHub
The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.
☆102May 24, 2023Updated 3 years ago
JusperLee / S4M
View on GitHub
Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models
☆28Feb 25, 2026Updated 4 months ago
spkgyk / RTFS-Net
View on GitHub
Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024
☆51Oct 14, 2025Updated 9 months ago
Jiang-Yidi / TS-TalkNet
View on GitHub
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
☆61May 29, 2023Updated 3 years ago
Emrys365 / se-scaling
View on GitHub
Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…
☆41Aug 7, 2024Updated last year
iamycy / diffwave-sr
View on GitHub
☆87May 21, 2023Updated 3 years ago
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
Kuray107 / S4ND-U-Net_speech_enhancement
View on GitHub
☆33May 17, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NikolaiKyhne / RWSAMamba-UNet
View on GitHub
Official repository for the paper "Exploring Resolution-Wise Shared Attention in Hybrid Mamba-U-Nets for Improved Cross-Corpus Speech Enh…
☆19May 5, 2026Updated 2 months ago
sp-uhh / stcn-nmf
View on GitHub
VAE and STCN with NMF for single-channel speech enhancement
☆15Mar 24, 2021Updated 5 years ago
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
Honee-W / CPTNN
View on GitHub
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
IsraelCohenLab / ConstantBeamwidthUCCA
View on GitHub
☆11Jun 6, 2022Updated 4 years ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
YUCHEN005 / Unified-Enhance-Separation
View on GitHub
Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"
☆45Jul 10, 2024Updated 2 years ago