JusperLee / CTCNet
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆72Updated 9 months ago
Alternatives and similar repositories for CTCNet:
Users that are interested in CTCNet are comparing it to the libraries listed below
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆39Updated 11 months ago
- ☆49Updated last year
- ☆67Updated 4 months ago
- ☆32Updated 3 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Some convenient scripts for your own use☆10Updated 3 years ago
- ☆25Updated last year
- ☆153Updated 2 months ago
- An efficient speech separation method☆271Updated 10 months ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆19Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- Query-conditioned target sound extraction model☆20Updated 3 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆47Updated 4 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆89Updated last year
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆89Updated 2 months ago
- ☆30Updated last year
- A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "☆45Updated 4 years ago
- ☆13Updated 7 months ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆120Updated 4 months ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆47Updated last year
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆111Updated last year
- Multi-modal speech separation task data generation script on LRS3 data set.☆81Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆53Updated 2 weeks ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 11 months ago
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆48Updated 2 years ago
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆112Updated 4 months ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated last year