JusperLee / CTCNetLinks
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆78Updated last year
Alternatives and similar repositories for CTCNet
Users that are interested in CTCNet are comparing it to the libraries listed below
Sorting:
- An efficient speech separation method☆280Updated last year
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆43Updated last year
- ☆52Updated 2 years ago
- ☆83Updated 9 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- ☆39Updated 7 months ago
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆93Updated 7 months ago
- Some convenient scripts for your own use☆10Updated 4 years ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆19Updated 3 weeks ago
- Multi-modal speech separation task data generation script on LRS3 data set.☆82Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆59Updated 8 months ago
- ☆25Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆68Updated 2 months ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆52Updated last year
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆134Updated 3 weeks ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆90Updated 2 years ago
- ☆171Updated 6 months ago
- ☆13Updated last year
- A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "☆48Updated 5 years ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆116Updated 2 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated 2 years ago
- Pytorch implement of DANet For Speech Separation☆20Updated 5 years ago
- Streaming Audiotransformers for online Audio tagging☆45Updated last year
- Query-conditioned target sound extraction model☆24Updated 3 months ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆22Updated last year
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆17Updated 3 years ago
- TODO☆39Updated last year