jim-schwoebel / voice_gender_detection
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
☆81Updated 8 months ago
Alternatives and similar repositories for voice_gender_detection:
Users that are interested in voice_gender_detection are comparing it to the libraries listed below
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Speaker identification using voice MFCCs and GMM☆53Updated 4 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆56Updated 5 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- This project is about performing Speaker diarization for Hindi Language.☆48Updated 3 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Various algorithms for voice activity detection☆22Updated 8 years ago
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 3 years ago
- An tensorflow implementation of ghostvlad for speaker recognition☆15Updated 5 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆120Updated last year
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 4 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆103Updated 2 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 5 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- ☆18Updated 2 years ago
- ☆90Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Removes silence segments from wav audio files☆29Updated 4 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- Deep Convolution Text to Speech☆35Updated 7 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated 11 months ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆111Updated 5 years ago
- Spot the conversation: speaker diarisation in the wild☆134Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago