merlresearch / tf-locoformer
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
☆39Updated 5 months ago
Alternatives and similar repositories for tf-locoformer:
Users that are interested in tf-locoformer are comparing it to the libraries listed below
- ☆48Updated last year
- ☆37Updated last week
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆37Updated last month
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆43Updated last week
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 9 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆60Updated 11 months ago
- ☆45Updated last month
- ☆15Updated 6 months ago
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆34Updated last month
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆46Updated last week
- ☆21Updated last year
- ☆23Updated this week
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆21Updated 3 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆58Updated last month
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆32Updated 5 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆36Updated 3 weeks ago
- Implementation of SpatialCodec.☆55Updated last year
- ☆60Updated last year
- This is the official implementation of the LiSenNet☆33Updated 2 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆42Updated 5 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆21Updated 3 weeks ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆44Updated 4 months ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆51Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆25Updated 5 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆63Updated last month
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- ☆32Updated 4 months ago
- ☆21Updated 8 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆42Updated 9 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆46Updated 3 months ago