hernanrazo / human-voice-detection
Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.
☆31Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for human-voice-detection
- This project is about performing Speaker diarization for Hindi Language.☆45Updated 3 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆160Updated 5 months ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆22Updated 2 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆296Updated last month
- Extract frequency, power, width and dissonance of formants from wav files☆25Updated 2 years ago
- ☆28Updated 3 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆237Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆84Updated last month
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆207Updated 4 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆164Updated last year
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆111Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆273Updated 10 months ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆355Updated last year
- Pytorch implementation of deep audio embedding calculation☆99Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆74Updated 4 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 4 years ago
- ☆32Updated last year
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated last year
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆340Updated 2 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆209Updated 2 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆205Updated last year
- Kaldi based speaker verification☆47Updated 6 years ago
- Text to Speech for Indic languages☆48Updated 2 years ago