lin9x/AV-Sepformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lin9x/AV-Sepformer)

lin9x / AV-Sepformer

☆65

Alternatives and similar repositories for AV-Sepformer

Users that are interested in AV-Sepformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
gemengtju / L-SpEx
View on GitHub
☆39Feb 23, 2022Updated 4 years ago
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
gemengtju / SpEx_Plus
View on GitHub
SpEx+(tied) source code
☆96Jul 6, 2023Updated 3 years ago
spkgyk / RTFS-Net
View on GitHub
Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024
☆51Oct 14, 2025Updated 9 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
LiChenda / Multi-clue-TSE-data
View on GitHub
Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"
☆17May 19, 2023Updated 3 years ago
fakufaku / diffusion-separation
View on GitHub
Single channel speech source separation by diffusion process (ICASSP 2023)
☆126Mar 15, 2024Updated 2 years ago
TaoRuijie / SEANet
View on GitHub
Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)
☆32Feb 28, 2025Updated last year
Aisaka0v0 / CLAPSep
View on GitHub
Query-conditioned target sound extraction model
☆30Mar 25, 2025Updated last year
ahmadikalkhorani / AVCrossNet
View on GitHub
☆16Jul 4, 2024Updated 2 years ago
xi-j / Mamba-TasNet
View on GitHub
☆116Oct 1, 2024Updated last year
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
ZBang / USEF-TSE
View on GitHub
☆70Jul 5, 2025Updated last year
yangdongchao / Tim-TSENet
View on GitHub
The source code of Tim-TSENet
☆15Apr 22, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
JusperLee / S4M
View on GitHub
Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models
☆28Feb 25, 2026Updated 4 months ago
Sanyuan-Chen / CSS_with_Conformer
View on GitHub
Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.
☆120Mar 18, 2023Updated 3 years ago
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
snuhcs / Papez
View on GitHub
Papez: Resource-Efficient Speech Separation with Auditory Working Memory (ICASSP 2023)
☆21Jun 25, 2023Updated 3 years ago
xuchenglin28 / speaker_extraction
View on GitHub
target speaker extraction and verification for multi-talker speech
☆210Jan 24, 2021Updated 5 years ago
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
JusperLee / AV-ConvTasNet
View on GitHub
Unofficial Time Domain Audio Visual Speech Separation Implementation
☆45Apr 19, 2023Updated 3 years ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zexupan / USEV
View on GitHub
☆14Jul 1, 2024Updated 2 years ago
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
smeetrs / deep_avsr
View on GitHub
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
☆244Feb 15, 2024Updated 2 years ago
JusperLee / CTCNet
View on GitHub
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆82Apr 28, 2024Updated 2 years ago
jyhan03 / dpccn
View on GitHub
This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.
☆13Dec 8, 2021Updated 4 years ago
egruttadauria98 / SSpaVAlDo
View on GitHub
☆37Jan 6, 2026Updated 6 months ago
BUTSpeechFIT / speakerbeam
View on GitHub
☆146Oct 25, 2021Updated 4 years ago
xuchenglin28 / target_speaker_verification
View on GitHub
target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech
☆15Jan 26, 2021Updated 5 years ago
X-LANCE / MSDWILD
View on GitHub
[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
☆65Jan 24, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
JorisCos / LibriMix
View on GitHub
An open source dataset for source separation
☆499Feb 9, 2024Updated 2 years ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
Overcautious / ADENet
View on GitHub
Accepted by TMM 2022
☆19Aug 18, 2022Updated 3 years ago
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
JusperLee / TDANet
View on GitHub
An efficient speech separation method
☆277Apr 11, 2024Updated 2 years ago
jyhan03 / icassp22-dataset
View on GitHub
Dataset simulation for DPCCN.
☆16Dec 25, 2022Updated 3 years ago