crlandsc / torch-log-wmse
logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source separation systems.
☆36Updated 7 months ago
Alternatives and similar repositories for torch-log-wmse:
Users that are interested in torch-log-wmse are comparing it to the libraries listed below
- ☆43Updated 9 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆57Updated 2 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆58Updated 2 years ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆37Updated 4 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆60Updated 4 months ago
- Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"☆53Updated 11 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆91Updated 6 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆27Updated 3 months ago
- ☆22Updated 11 months ago
- ☆86Updated 3 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆85Updated last year
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆43Updated 2 weeks ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆40Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆47Updated last week
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆22Updated last year
- ☆47Updated 4 months ago
- A DDSP-based neural voice synthesiser.☆114Updated 4 months ago
- ☆22Updated 6 months ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆95Updated 8 months ago
- An open source platform for browser based speech and audio subjective quality tests.☆33Updated last year
- Robust Singing Voice Transcription and MIDI Extraction☆71Updated 4 months ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆51Updated 2 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆65Updated 2 months ago
- ☆83Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆43Updated 5 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆64Updated last year