JusperLee / CTCNet
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆76Updated last year
Alternatives and similar repositories for CTCNet
Users that are interested in CTCNet are comparing it to the libraries listed below
Sorting:
- An efficient speech separation method☆274Updated last year
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆41Updated last year
- Multi-modal speech separation task data generation script on LRS3 data set.☆82Updated last year
- ☆78Updated 7 months ago
- ☆50Updated last year
- ☆35Updated 5 months ago
- Some convenient scripts for your own use☆10Updated 4 years ago
- According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.☆67Updated 5 years ago
- Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation☆131Updated 4 years ago
- A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "☆48Updated 5 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- ☆165Updated 5 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆31Updated 10 months ago
- Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement☆466Updated last year
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆92Updated 5 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆58Updated 7 months ago
- ☆25Updated last year
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆21Updated last year
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- Script to calculate SNR and SDR using python☆91Updated 4 years ago
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆65Updated 3 weeks ago
- Unofficial Time Domain Audio Visual Speech Separation Implementation☆45Updated 2 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆128Updated 7 months ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆15Updated last week
- Speech Separation☆64Updated last year
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆89Updated last year
- Pytorch implement of DANet For Speech Separation☆20Updated 5 years ago
- PyTorch implementation of LiMuSE☆30Updated 2 years ago
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆114Updated last year