anton-jeran / Speech2RIR
This is the official implementation of reverberant speech to room impulse response estimator
☆18Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for Speech2RIR
- ☆15Updated 3 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆41Updated 7 months ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆13Updated 2 years ago
- Prediction of sound event bounding boxes (SEBBs)☆20Updated 3 months ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆17Updated last week
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 7 months ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated 5 months ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated 9 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆30Updated 10 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆34Updated 3 months ago
- Viterbi decoding in PyTorch☆27Updated last month
- ☆12Updated 2 months ago
- Da - ECHO - RetrievAl - daTasEt☆24Updated 4 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- ☆40Updated 3 weeks ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆33Updated 2 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated last month
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- ☆21Updated 6 months ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆14Updated 3 months ago
- Refined Band-split RNN - Based on https://arxiv.org/abs/2209.15174☆16Updated 9 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 2 months ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆34Updated 3 weeks ago
- ☆13Updated last month
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated last year
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆21Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆44Updated last week
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆28Updated this week
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆15Updated 2 years ago