hernanrazo / human-voice-detection
Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.
☆34Updated 3 years ago
Alternatives and similar repositories for human-voice-detection:
Users that are interested in human-voice-detection are comparing it to the libraries listed below
- Voice Activity Detection based on Deep Learning & TensorFlow☆363Updated 2 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆167Updated 10 months ago
- Classify daily life events using audio data.☆51Updated 5 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆248Updated 9 months ago
- Speaker embedding (d-vector) trained with GE2E loss☆282Updated last year
- Voice Activity Detection (VAD) using deep learning.☆196Updated 5 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆213Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆412Updated last year
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆272Updated 3 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆331Updated 7 months ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 3 years ago
- Python library for audio augmentation☆84Updated last year
- Identify the emotion of multiple speakers in an Audio Segment☆170Updated 2 years ago
- Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink threshold…☆192Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆383Updated last month
- An open source dataset for source separation☆417Updated last year
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆177Updated 3 years ago
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆188Updated last year
- Conformer-based Metric GAN for speech enhancement☆354Updated last year
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 6 years ago
- The Real time Noise cancellation from Audio data signal . Like the construction noise with the denoising the signal .☆117Updated 2 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆143Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆149Updated 11 months ago
- A PyTorch implementation of DNN-based source separation.☆300Updated 3 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆330Updated 2 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆75Updated 4 years ago