gveres / donateacry-corpusLinks
An infant cry audio corpus that's being built through the Donate-a-cry campaign - see http://donateacry.com
☆183Updated 5 years ago
Alternatives and similar repositories for donateacry-corpus
Users that are interested in donateacry-corpus are comparing it to the libraries listed below
Sorting:
- Recognition of baby cry audio signal☆274Updated 2 years ago
- Voice Activity Detection (VAD) using deep learning.☆197Updated 5 years ago
- Include some core functions and model to handle speech separation☆155Updated 4 years ago
- Deep Neural Network for Speaker Count Estimation☆155Updated 4 years ago
- A statistical model-based Voice Activity Detection☆192Updated 6 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆368Updated 2 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆217Updated 2 years ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆101Updated 2 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆101Updated 2 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 5 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆180Updated 3 years ago
- Environmental sound classification using Deep Learning with extracted features☆165Updated 5 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆130Updated 3 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- Classify daily life events using audio data.☆53Updated 5 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆89Updated 2 years ago
- General purpose sound recognition demo☆158Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆108Updated 2 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆112Updated 6 years ago
- LogMMSE speech enhancement/noise reduction☆88Updated 5 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆84Updated 4 months ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆110Updated last year
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆451Updated 5 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆254Updated last year
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated 2 years ago