nikvaessen / w2v2-speakerView external linksLinks
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
☆146May 10, 2022Updated 3 years ago
Alternatives and similar repositories for w2v2-speaker
Users that are interested in w2v2-speaker are comparing it to the libraries listed below
Sorting:
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆96Nov 20, 2024Updated last year
- ☆31Jul 13, 2023Updated 2 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- ☆157Jan 9, 2023Updated 3 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆787Apr 11, 2024Updated last year
- An Open Source Tools for Speaker Recognition☆634Aug 5, 2024Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Oct 27, 2022Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Sep 15, 2021Updated 4 years ago
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆116Jan 26, 2024Updated 2 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆478Apr 5, 2024Updated last year
- In defence of metric learning for speaker recognition☆1,161Mar 26, 2024Updated last year
- ☆13Sep 25, 2024Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 2 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- ICASSP 2023 Accepted☆189May 6, 2024Updated last year
- A PyTorch implementation of the universal neural vocoder☆67Nov 6, 2020Updated 5 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆1,192Jan 20, 2026Updated 3 weeks ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆212May 30, 2025Updated 8 months ago
- An ODE-based generative neural vocoder using Rectified Flow☆58Apr 29, 2023Updated 2 years ago
- A library for speech data augmentation in time-domain☆682Aug 30, 2021Updated 4 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆367Oct 12, 2021Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆78Dec 3, 2024Updated last year
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆469Apr 24, 2024Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆150Feb 11, 2023Updated 3 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆90Jun 9, 2022Updated 3 years ago
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆53Jan 18, 2024Updated 2 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Sep 4, 2022Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,527Jun 13, 2025Updated 8 months ago
- ☆64May 23, 2022Updated 3 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆59Jul 1, 2025Updated 7 months ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆102Apr 10, 2025Updated 10 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago