SkyDocs / speaker-identificationLinks
Speaker Identification using Neural Net.
☆20Updated last year
Alternatives and similar repositories for speaker-identification
Users that are interested in speaker-identification are comparing it to the libraries listed below
Sorting:
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 5 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆58Updated 4 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆146Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆63Updated 4 years ago
- ☆27Updated 4 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆113Updated last month
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Updated 4 years ago
- ☆49Updated 2 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆212Updated 5 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆286Updated 2 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆178Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago
- ☆67Updated 7 months ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆438Updated 5 months ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆183Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Updated 5 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 3 years ago
- Urdu Language Speech Emotional Corpus☆46Updated 7 years ago
- End-to-End Neural Diarization☆420Updated 4 years ago
- Advanced data structures for handling temporal segments with attached labels.☆124Updated 4 months ago
- target speaker extraction and verification for multi-talker speech☆196Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Spot the conversation: speaker diarisation in the wild☆157Updated 3 years ago
- Speaker identification using voice MFCCs and GMM☆54Updated 5 years ago