merlresearch / tf-locoformer
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
☆34Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for tf-locoformer
- ☆15Updated 4 months ago
- ☆42Updated last month
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆30Updated 3 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆33Updated this week
- Spherical residual vector quantization (SRVQ)☆26Updated 2 months ago
- ☆21Updated 7 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆17Updated last month
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 7 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 3 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- ☆59Updated last year
- Prosody and Pronunciation Modification Network☆44Updated 3 months ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆19Updated 2 weeks ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆28Updated 3 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated 10 months ago
- ☆48Updated last year
- ☆20Updated 10 months ago
- Viterbi decoding in PyTorch☆27Updated last month
- Implementation of SpatialCodec.☆54Updated last year
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- Da - ECHO - RetrievAl - daTasEt☆24Updated 4 months ago
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆10Updated 2 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆60Updated 9 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆22Updated last year
- Landing Page for Divide and Remaster v3☆13Updated 4 months ago
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆33Updated last month
- ☆32Updated 2 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆33Updated 2 months ago
- Official implementation of Self-Remixing☆11Updated 9 months ago