Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models
☆23Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for Diff-SV
Users that are interested in Diff-SV are comparing it to the libraries listed below
Sorting:
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated last year
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Jul 24, 2023Updated 2 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- ☆27Jan 17, 2024Updated 2 years ago
- Official repository of NeXt-TDNN for speaker verification☆80Oct 10, 2024Updated last year
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆17Jun 12, 2024Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 5 months ago
- ☆26Nov 2, 2022Updated 3 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- ☆46Feb 16, 2023Updated 3 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- Repository of published DNN speech separation recipes for a number of datasets☆12Jan 22, 2024Updated 2 years ago
- ☆15Nov 11, 2024Updated last year
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Jun 22, 2022Updated 3 years ago
- ☆32Jan 9, 2024Updated 2 years ago
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Aug 27, 2023Updated 2 years ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- A implement of adaptive score normalization (AS-Norm) in speaker verification/recognition with pytorch☆10Oct 12, 2022Updated 3 years ago
- ☆67Aug 16, 2023Updated 2 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated last week
- ☆30Jun 12, 2025Updated 8 months ago
- A Python library for high-quality, fast, and customizable dynamic audio compression and peak limiting.☆15Oct 24, 2025Updated 4 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆41Aug 14, 2025Updated 6 months ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- ☆38Feb 1, 2024Updated 2 years ago
- ☆68Jul 23, 2023Updated 2 years ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆80Aug 20, 2024Updated last year
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 6 months ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Feb 9, 2025Updated last year
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆46Sep 12, 2024Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- ☆21Jul 15, 2024Updated last year
- ☆24May 6, 2025Updated 9 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated 11 months ago
- ☆19Mar 22, 2024Updated last year