JusperLee / CTCNet
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆76Updated 11 months ago
Alternatives and similar repositories for CTCNet:
Users that are interested in CTCNet are comparing it to the libraries listed below
- An efficient speech separation method☆272Updated last year
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆40Updated last year
- Multi-modal speech separation task data generation script on LRS3 data set.☆81Updated last year
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆92Updated 4 months ago
- ☆50Updated last year
- ☆75Updated 6 months ago
- Some convenient scripts for your own use☆10Updated 4 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆62Updated this week
- ☆162Updated 4 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆54Updated 6 months ago
- ☆25Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated 2 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆128Updated 6 months ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆30Updated 9 months ago
- ☆33Updated 5 months ago
- Script to calculate SNR and SDR using python☆90Updated 4 years ago
- Streaming Audiotransformers for online Audio tagging☆44Updated 10 months ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆119Updated 4 months ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- ☆30Updated last year
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆67Updated 3 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆190Updated 4 months ago
- Official PyTorch code for Deep Audio-Signal Holistic Embeddings☆87Updated this week
- Noise-Aware Speech Separation with Contrastive Learning☆17Updated last year
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆37Updated this week
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆21Updated last year
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆112Updated 2 years ago