JusperLee/LRS3-For-Speech-Separation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JusperLee/LRS3-For-Speech-Separation)

JusperLee / LRS3-For-Speech-Separation

Multi-modal speech separation task data generation script on LRS3 data set.

☆88

Alternatives and similar repositories for LRS3-For-Speech-Separation

Users that are interested in LRS3-For-Speech-Separation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JusperLee / Arxiv-New-Paper-Server
View on GitHub
Arxiv automatically obtains the latest article service.
☆11Apr 29, 2020Updated 6 years ago
JusperLee / awesome-speech-enhancement
View on GitHub
speech enhancement\speech seperation\sound source localization
☆15Apr 22, 2020Updated 6 years ago
JusperLee / ExamOnline
View on GitHub
This is a complete online exam system
☆10Dec 27, 2019Updated 6 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
JusperLee / UtterancePIT-Speech-Separation
View on GitHub
According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
☆67Apr 14, 2020Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
JusperLee / DANet-For-Speech-Separation
View on GitHub
Pytorch implement of DANet For Speech Separation
☆21Jan 9, 2020Updated 6 years ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Jul 20, 2026Updated last week
JusperLee / Speech-Separation-Paper-Tutorial
View on GitHub
A must-read paper for speech separation based on neural networks
☆952Aug 11, 2025Updated 11 months ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
JusperLee / Deep-Encoder-Decoder-Conv-TasNet
View on GitHub
A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "
☆51Apr 20, 2020Updated 6 years ago
LiChenda / Multi-clue-TSE-data
View on GitHub
Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"
☆17May 19, 2023Updated 3 years ago
JusperLee / Deep-Clustering-for-Speech-Separation
View on GitHub
Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation
☆133Jul 14, 2020Updated 6 years ago
JusperLee / Calculate-SNR-SDR
View on GitHub
Script to calculate SNR and SDR using python
☆93Jul 7, 2020Updated 6 years ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TaoRuijie / SEANet
View on GitHub
Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)
☆32Feb 28, 2025Updated last year
JusperLee / CTCNet
View on GitHub
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆82Apr 28, 2024Updated 2 years ago
JusperLee / Dual-Path-RNN-Pytorch
View on GitHub
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
☆468Feb 14, 2023Updated 3 years ago
danmic / av-se
View on GitHub
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
☆222Apr 16, 2023Updated 3 years ago
JusperLee / Looking-to-Listen-at-the-Cocktail-Party
View on GitHub
Executable code based on Google articles
☆166Dec 8, 2022Updated 3 years ago
dr-pato / audio_visual_speech_enhancement
View on GitHub
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆112Mar 19, 2024Updated 2 years ago
zexupan / reentry
View on GitHub
☆18Nov 22, 2024Updated last year
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
facebookresearch / VisualVoice
View on GitHub
Audio-Visual Speech Separation with Cross-Modal Consistency
☆250Jul 25, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
haidog-yaqub / DPMTSE
View on GitHub
A Diffusion Probabilistic Model for Target Sound Extraction
☆40Sep 27, 2024Updated last year
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
etzinis / optimal_condition_training
View on GitHub
Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…
☆14Feb 15, 2023Updated 3 years ago
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
ZBang / USEF-TSE
View on GitHub
☆70Jul 5, 2025Updated last year
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆18Jun 16, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
choijeongsoo / lip2speech-unit
View on GitHub
[Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units
☆47Oct 26, 2024Updated last year
YUCHEN005 / GILA
View on GitHub
Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"
☆18Jun 21, 2023Updated 3 years ago
merlresearch / tf-locoformer
View on GitHub
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
☆133Aug 8, 2025Updated 11 months ago
aispeech-lab / LiMuSE
View on GitHub
PyTorch implementation of LiMuSE
☆33Oct 11, 2022Updated 3 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
yangdongchao / Tim-TSENet
View on GitHub
The source code of Tim-TSENet
☆15Apr 22, 2022Updated 4 years ago
gemengtju / SpEx_Plus
View on GitHub
SpEx+(tied) source code
☆96Jul 6, 2023Updated 3 years ago