hernanrazo / human-voice-detectionLinks
Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.
☆34Updated 3 years ago
Alternatives and similar repositories for human-voice-detection
Users that are interested in human-voice-detection are comparing it to the libraries listed below
Sorting:
- Voice Activity Detection based on Deep Learning & TensorFlow☆366Updated 2 years ago
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆189Updated last year
- Voice Activity Detection (VAD) using deep learning.☆196Updated 5 years ago
- Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink threshold…☆192Updated 2 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆214Updated last year
- The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)☆37Updated 2 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆171Updated 2 years ago
- Classify daily life events using audio data.☆52Updated 5 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆343Updated 8 months ago
- A simple audio feature extraction library☆80Updated 5 years ago
- ☆32Updated 2 years ago
- Speech Emotion Recognition☆42Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆64Updated 2 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆126Updated 2 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 4 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆249Updated 10 months ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Updated 2 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆67Updated 4 years ago
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆275Updated 3 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆282Updated last year
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆351Updated 3 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆86Updated 2 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆203Updated 2 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆268Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)☆189Updated last year
- CNN 1D vs 2D audio classification☆104Updated 6 years ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆390Updated 2 weeks ago
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆324Updated last year
- Speech Emotion Recognition (SER) in real-time, using Deep Neural Networks (DNN) of Long Short Memory Term (LSTM).☆110Updated 3 years ago