nidwbin / AS-NormLinks
A implement of adaptive score normalization (AS-Norm) in speaker verification/recognition with pytorch
☆9Updated 2 years ago
Alternatives and similar repositories for AS-Norm
Users that are interested in AS-Norm are comparing it to the libraries listed below
Sorting:
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Updated last year
- Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verificat…☆16Updated last year
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 2 years ago
- ☆13Updated 2 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆20Updated 8 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- TDY-CNN for text-independent speaker verification☆17Updated 2 years ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆38Updated 6 months ago
- Code for calculate DNS_MOS.☆38Updated 2 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆37Updated 7 months ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated last year
- ☆32Updated 2 years ago
- Exploring Binary Classification Loss for Speaker Verification☆16Updated last year
- MetricGAN+ PyTorch Implementation☆24Updated last year
- Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"☆19Updated 2 years ago
- ☆32Updated 2 years ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆40Updated last month
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆21Updated 10 months ago
- ☆24Updated 2 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆57Updated 8 months ago
- ☆41Updated 4 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆26Updated last year
- flow matching based speech enhancement☆14Updated 5 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆58Updated 7 months ago
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆104Updated 2 years ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆60Updated 4 years ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 9 months ago
- ☆33Updated 4 years ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆27Updated 5 months ago