SkyDocs / speaker-identificationLinks
Speaker Identification using Neural Net.
☆20Updated last year
Alternatives and similar repositories for speaker-identification
Users that are interested in speaker-identification are comparing it to the libraries listed below
Sorting:
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 5 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆145Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆86Updated last year
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆88Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆65Updated 2 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆211Updated 5 years ago
- Spot the conversation: speaker diarisation in the wild☆145Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- Advanced data structures for handling temporal segments with attached labels.☆118Updated last week
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆108Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆177Updated 9 months ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆136Updated 8 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated last week
- target speaker extraction and verification for multi-talker speech☆181Updated 4 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆217Updated 2 years ago
- An tensorflow implementation of ghostvlad for speaker recognition☆15Updated 6 years ago
- Speaker identification using voice MFCCs and GMM☆54Updated 4 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Updated 6 years ago
- Python toolkit for speech processing☆71Updated last month
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆91Updated 5 months ago
- Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家☆46Updated last year
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago