anton-jeran / Speech2RIR
This is the official implementation of reverberant speech to room impulse response estimator
☆20Updated 5 months ago
Alternatives and similar repositories for Speech2RIR:
Users that are interested in Speech2RIR are comparing it to the libraries listed below
- ☆15Updated 6 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆42Updated 9 months ago
- ☆17Updated 4 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆49Updated this week
- ☆45Updated last month
- ☆43Updated 7 months ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆18Updated 3 weeks ago
- Viterbi decoding in PyTorch☆27Updated 3 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆44Updated 4 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆31Updated last month
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆21Updated last year
- Streaming Vocos☆19Updated last week
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆24Updated 4 months ago
- ☆21Updated 8 months ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆14Updated 5 months ago
- Prediction of sound event bounding boxes (SEBBs)☆25Updated 5 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆33Updated 4 months ago
- Translating Synthetic RIRs to Real RIRs☆41Updated last year
- ☆60Updated last year
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆38Updated 3 months ago
- Prosody and Pronunciation Modification Network☆47Updated 5 months ago
- Landing Page for All Things Source Separation☆19Updated 2 months ago
- Alignment examples for Interspeech 2024☆18Updated 6 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 9 months ago
- ☆37Updated last week
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆58Updated last month
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆71Updated 3 weeks ago