VaibhavBhapkar / Speaker-Identification-Using-Machine-LearningLinks
☆32Updated 3 years ago
Alternatives and similar repositories for Speaker-Identification-Using-Machine-Learning
Users that are interested in Speaker-Identification-Using-Machine-Learning are comparing it to the libraries listed below
Sorting:
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆130Updated 2 years ago
- ☆117Updated 5 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆90Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆212Updated 5 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆220Updated 2 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆178Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- How to create your own model for vosk☆75Updated 4 years ago
- This project is about performing Speaker diarization for Hindi Language.☆58Updated 4 years ago
- Voice Biometrics Authentication using GMM and Face Recognition Using Facenet and Tensorflow☆113Updated 5 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 5 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆375Updated last year
- ☆23Updated 3 months ago
- Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/☆276Updated 3 months ago
- Speech Emotion Recognition☆43Updated 2 years ago
- Text to Speech for Indic languages☆52Updated 3 years ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆77Updated 2 years ago
- A Docker image for a relatively light-weight full Arabic speech synthesis system☆31Updated 4 years ago
- 🐸STT integration examples☆130Updated 3 years ago
- Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC☆44Updated 3 years ago
- On-device noise suppression powered by deep learning☆80Updated 2 weeks ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Model for recasing and repunctuating ASR transcripts☆143Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Identifying people from small audio fragments☆171Updated 5 years ago
- Speaker identification using voice MFCCs and GMM☆54Updated 5 years ago
- This repository is a collection of TTS Models in TFLite☆201Updated 4 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆79Updated 4 years ago