hernanrazo / human-voice-detection
Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.
☆33Updated 3 years ago
Alternatives and similar repositories for human-voice-detection:
Users that are interested in human-voice-detection are comparing it to the libraries listed below
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Simple real-time Sound Event Detector based on YAMNet and pyaudio.☆21Updated 5 years ago
- Speech Emotion Recognition (SER) in real-time, using Deep Neural Networks (DNN) of Long Short Memory Term (LSTM).☆102Updated 2 years ago
- Classify daily life events using audio data.☆50Updated 4 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆206Updated last year
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 4 years ago
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆45Updated 5 months ago
- A multimodal approach on emotion recognition using audio and text.☆170Updated 4 years ago
- Speech Emotion Recognition☆39Updated last year
- Detecting emotions using MFCC features of human speech using Deep Learning☆125Updated 4 years ago
- Removing background noise in a sound file☆62Updated 5 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆163Updated 7 months ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆27Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆60Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆48Updated 3 years ago
- The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)☆37Updated 2 years ago
- Voice Emotion Detector that detects emotion from audio speech using one dimensional CNNs (convolutional neural networks) using keras and …☆104Updated 6 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆94Updated 2 weeks ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆295Updated 3 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆308Updated 4 months ago
- Python library for audio augmentation☆83Updated last year
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆197Updated 2 years ago
- Analyzes signal, finds fundamental frequency, HNR etc☆15Updated 7 years ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆151Updated 8 months ago
- Identify the emotion of multiple speakers in an Audio Segment☆166Updated last year
- ☆47Updated last year
- Voice Activity Detection based on Deep Learning & TensorFlow☆358Updated last year
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆127Updated 3 weeks ago
- Voice Activity Detection (VAD) using deep learning.☆193Updated 5 years ago