JusperLee / CTCNet
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆74Updated 11 months ago
Alternatives and similar repositories for CTCNet:
Users that are interested in CTCNet are comparing it to the libraries listed below
- An efficient speech separation method☆273Updated 11 months ago
- ☆71Updated 6 months ago
- ☆50Updated last year
- ☆33Updated 4 months ago
- Some convenient scripts for your own use☆10Updated 4 years ago
- Multi-modal speech separation task data generation script on LRS3 data set.☆81Updated last year
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆40Updated last year
- ☆160Updated 3 months ago
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆91Updated 4 months ago
- ☆25Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆124Updated 5 months ago
- Script to calculate SNR and SDR using python☆90Updated 4 years ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆59Updated last month
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆111Updated 2 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.☆67Updated 4 years ago
- PyTorch implementation of LiMuSE☆30Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆51Updated 5 months ago
- Query-conditioned target sound extraction model☆20Updated last week
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆116Updated 3 months ago
- ☆32Updated 9 months ago
- ☆30Updated last year
- Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation☆131Updated 4 years ago
- A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "☆46Updated 4 years ago
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆81Updated 4 months ago
- ☆64Updated last year
- Unofficial Time Domain Audio Visual Speech Separation Implementation☆45Updated last year