JusperLee / CTCNet
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆70Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for CTCNet
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆38Updated 8 months ago
- Some convenient scripts for your own use☆10Updated 3 years ago
- ☆46Updated last year
- An efficient speech separation method☆261Updated 7 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- ☆32Updated last week
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Multi-modal speech separation task data generation script on LRS3 data set.☆77Updated 9 months ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- ☆129Updated last month
- ☆26Updated 10 months ago
- ☆55Updated last month
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆52Updated last year
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆18Updated last year
- Pytorch implement of DANet For Speech Separation☆20Updated 4 years ago
- This is a complete online exam system☆10Updated 4 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆101Updated last month
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆84Updated 11 months ago
- SpeechBrain中文文档☆12Updated 3 years ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆87Updated last year
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆27Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆16Updated 4 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 7 months ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated last year
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆36Updated last month
- Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement☆195Updated 2 years ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS …☆95Updated last month
- Noise-Aware Speech Separation with Contrastive Learning☆16Updated 6 months ago