zexupan/MuSE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zexupan/MuSE)

zexupan / MuSE

☆42

Alternatives and similar repositories for MuSE

Users that are interested in MuSE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
zexupan / USEV
View on GitHub
☆14Jul 1, 2024Updated 2 years ago
zexupan / reentry
View on GitHub
☆18Nov 22, 2024Updated last year
Overcautious / ADENet
View on GitHub
Accepted by TMM 2022
☆19Aug 18, 2022Updated 3 years ago
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
TaoRuijie / SEANet
View on GitHub
Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)
☆32Feb 28, 2025Updated last year
danmic / av-se
View on GitHub
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
☆222Apr 16, 2023Updated 3 years ago
gemengtju / SpEx_Plus
View on GitHub
SpEx+(tied) source code
☆96Jul 6, 2023Updated 3 years ago
facebookresearch / VisualVoice
View on GitHub
Audio-Visual Speech Separation with Cross-Modal Consistency
☆250Jul 25, 2023Updated 2 years ago
JusperLee / LRS3-For-Speech-Separation
View on GitHub
Multi-modal speech separation task data generation script on LRS3 data set.
☆88Feb 2, 2024Updated 2 years ago
Jiang-Yidi / FlatTrajectoryDistillation_FTD
View on GitHub
The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)
☆18Mar 21, 2023Updated 3 years ago
smeetrs / deep_avsr
View on GitHub
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
☆244Feb 15, 2024Updated 2 years ago
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JusperLee / Looking-to-Listen-at-the-Cocktail-Party
View on GitHub
Executable code based on Google articles
☆166Dec 8, 2022Updated 3 years ago
JusperLee / AV-ConvTasNet
View on GitHub
Unofficial Time Domain Audio Visual Speech Separation Implementation
☆45Apr 19, 2023Updated 3 years ago
RanaCM / DSU-AVO
View on GitHub
Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023
☆12May 13, 2024Updated 2 years ago
LiChenda / Multi-clue-TSE-data
View on GitHub
Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"
☆17May 19, 2023Updated 3 years ago
JusperLee / CTCNet
View on GitHub
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆82Apr 28, 2024Updated 2 years ago
Aisaka0v0 / CLAPSep
View on GitHub
Query-conditioned target sound extraction model
☆30Mar 25, 2025Updated last year
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week
ahmadikalkhorani / AVCrossNet
View on GitHub
☆16Jul 4, 2024Updated 2 years ago
mutiann / speech_rankings
View on GitHub
A CSRankings-like index for speech researchers
☆35Oct 16, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Jiang-Yidi / TS-TalkNet
View on GitHub
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
☆61May 29, 2023Updated 3 years ago
ZBang / USEF-TSE
View on GitHub
☆70Jul 5, 2025Updated last year
JorisCos / LibriMix
View on GitHub
An open source dataset for source separation
☆499Feb 9, 2024Updated 2 years ago
haidog-yaqub / DPMTSE
View on GitHub
A Diffusion Probabilistic Model for Target Sound Extraction
☆40Sep 27, 2024Updated last year
JusperLee / IIANet
View on GitHub
This is the demo of our paper "IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation".
☆110Mar 12, 2025Updated last year
JuanFMontesinos / VoViT
View on GitHub
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
☆35Mar 18, 2023Updated 3 years ago
xuchenglin28 / speaker_extraction_SpEx
View on GitHub
multi-scale time domain speaker extraction
☆81Jun 7, 2021Updated 5 years ago
X-LANCE / MSDWILD
View on GitHub
[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
☆65Jan 24, 2024Updated 2 years ago
yangdongchao / Tim-TSENet
View on GitHub
The source code of Tim-TSENet
☆15Apr 22, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
spkgyk / RTFS-Net
View on GitHub
Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024
☆51Oct 14, 2025Updated 9 months ago
xuchenglin28 / speaker_extraction
View on GitHub
target speaker extraction and verification for multi-talker speech
☆210Jan 24, 2021Updated 5 years ago
cogmhear / Intelligibility-Oriented-Audio-Visual-Speech-Enhancement
View on GitHub
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
☆15Sep 6, 2024Updated last year
gemengtju / L-SpEx
View on GitHub
☆39Feb 23, 2022Updated 4 years ago
dr-pato / audio_visual_speech_enhancement
View on GitHub
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆112Mar 19, 2024Updated 2 years ago
cogmhear / avse_challenge
View on GitHub
COG-MHEAR Audio-Visual Speech Enhancement Challenge
☆48Feb 17, 2026Updated 5 months ago
showlab / AVA-AVD
View on GitHub
☆22Nov 24, 2022Updated 3 years ago