SkyDocs / speaker-identificationLinks
Speaker Identification using Neural Net.
☆20Updated last year
Alternatives and similar repositories for speaker-identification
Users that are interested in speaker-identification are comparing it to the libraries listed below
Sorting:
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 5 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆108Updated 3 weeks ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆145Updated 3 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆181Updated last year
- Speaker identification using voice MFCCs and GMM☆54Updated 5 years ago
- Advanced data structures for handling temporal segments with attached labels.☆124Updated 3 months ago
- ☆67Updated 6 months ago
- This project is about performing Speaker diarization for Hindi Language.☆58Updated 4 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆286Updated last year
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆45Updated 2 years ago
- target speaker extraction and verification for multi-talker speech☆193Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆92Updated 2 years ago
- Voice Activity Detection (VAD) using deep learning.☆202Updated 6 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- Spot the conversation: speaker diarisation in the wild☆157Updated 3 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆218Updated 2 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆209Updated 3 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆89Updated 3 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago