kagaminccino/LAVSE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kagaminccino/LAVSE)

kagaminccino / LAVSE

Python codes for Lite Audio-Visual Speech Enhancement.

☆95

Alternatives and similar repositories for LAVSE

Users that are interested in LAVSE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WilliamYu1993 / ICSE
View on GitHub
INCREASING COMPACTNESS OF DEEP LEARNING BASED SPEECH ENHANCEMENT MODELS WITH PARAMETER PRUNING AND QUANTIZATION TECHNIQUES
☆15Oct 18, 2019Updated 6 years ago
yuwchen / CITISEN
View on GitHub
☆33Apr 21, 2022Updated 4 years ago
danmic / av-se
View on GitHub
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
☆222Apr 16, 2023Updated 3 years ago
WilliamYu1993 / BAMSE
View on GitHub
Bone/Air conducted speech signal enhancement exploiting multi-modal framework
☆19Oct 15, 2020Updated 5 years ago
dr-pato / audio_visual_speech_enhancement
View on GitHub
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆112Mar 19, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aleXiehta / PhoneFortifiedPerceptualLoss
View on GitHub
Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement
☆82Jun 28, 2021Updated 5 years ago
weichian0920 / MFA_DAE
View on GitHub
Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder
☆12Apr 8, 2021Updated 5 years ago
jerrygood0703 / DDAE
View on GitHub
DDAE speech enhancement on spectrogram domain using Keras
☆25Aug 21, 2017Updated 8 years ago
khhungg / MECG-E
View on GitHub
☆17Nov 25, 2024Updated last year
JasonSWFu / End-to-end-waveform-utterance-enhancement
View on GitHub
End-to-end waveform utterance enhancement for direct evaluation metrics optimization by fully convolutional neural networks (TASLP 2018)
☆18Jul 12, 2019Updated 7 years ago
JuanFMontesinos / VoViT
View on GitHub
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
☆35Mar 18, 2023Updated 3 years ago
jonlu0602 / DeepDenoisingAutoencoder
View on GitHub
Tensorflow implementation for Speech Enhancement (DDAE)
☆49Jul 20, 2018Updated 8 years ago
nii-yamagishilab / NELE-GAN
View on GitHub
Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement
☆22Sep 21, 2021Updated 4 years ago
JasonSWFu / MetricGAN
View on GitHub
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awar…
☆150Apr 19, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JuanFMontesinos / Acappella-YNet
View on GitHub
Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21
☆18May 14, 2022Updated 4 years ago
sypdbhee / DWPT-NMF
View on GitHub
Applying discrete wavelet packet transform (DWPT) and nonnegative matrix factorization (NMF) analysis to speech enhancement tasks. Conven…
☆12May 14, 2017Updated 9 years ago
yakovmon / Real-Time-Audio-Visual-Speech-Enhancement
View on GitHub
☆13May 27, 2019Updated 7 years ago
ChangLee0903 / SERIL
View on GitHub
Official Implementation of SERIL in Pytorch
☆27Sep 29, 2020Updated 5 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
zexupan / USEV
View on GitHub
☆14Jul 1, 2024Updated 2 years ago
zexupan / reentry
View on GitHub
☆18Nov 22, 2024Updated last year
dhimasryan / SpeechAssessmentModels
View on GitHub
☆22Jan 18, 2024Updated 2 years ago
RoyChao19477 / SEMamba
View on GitHub
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
☆273Dec 12, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
dhimasryan / TMHINT-QI-VoiceMOS2023
View on GitHub
☆17Oct 18, 2023Updated 2 years ago
khhungg / BSSE-SE
View on GitHub
Boosting Self-Supervised Embeddings for Speech Enhancement
☆47Jun 23, 2022Updated 4 years ago
RoyChao19477 / PCS
View on GitHub
Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)
☆73May 11, 2024Updated 2 years ago
SolomidHero / speech-regeneration-enhancer
View on GitHub
Pytorch implementation of paper "High Fidelity Speech Regeneration With Application to Speech Enhancement"
☆15May 8, 2021Updated 5 years ago
unilight / cdvae-vc
View on GitHub
TensorFlow Implementation of CDVAE-VC.
☆54Mar 24, 2023Updated 3 years ago
XiaoyuBIE1994 / DVAE_SE
View on GitHub
(TASLP 2022) Unsupervised speech enhancement using DVAEs
☆23Dec 16, 2024Updated last year
jerrygood0703 / speech-enhancement-WGAN
View on GitHub
speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
☆36Apr 16, 2018Updated 8 years ago
yuwchen / InQSS
View on GitHub
☆15Oct 6, 2023Updated 2 years ago
cogmhear / avse_challenge
View on GitHub
COG-MHEAR Audio-Visual Speech Enhancement Challenge
☆48Feb 17, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
JasonSWFu / Quality-Net
View on GitHub
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)
☆92Jul 22, 2019Updated 7 years ago
wangkenpu / Conv-TasNet-PyTorch
View on GitHub
A PyTorch implementation of Conv-TasNet
☆46Nov 25, 2019Updated 6 years ago
ConferencingSpeech / ConferencingSpeech2022
View on GitHub
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
☆45Apr 11, 2022Updated 4 years ago
desh2608 / css
View on GitHub
PyTorch implementation of Continuous Speech Separation
☆12Oct 5, 2022Updated 3 years ago
craigmacartney / Wave-U-Net-For-Speech-Enhancement
View on GitHub
Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…
☆224Mar 24, 2023Updated 3 years ago
JasonSWFu / VQscore
View on GitHub
☆59Dec 2, 2024Updated last year
Andong-Li-speech / DARCN
View on GitHub
The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"
☆80Dec 8, 2022Updated 3 years ago